Differences between lexical features and orthographic features in NLP? -


features used model training , testing. differences between lexical features , orthographic features in natural language processing? examples preferred.

i not aware of such distinction, , of time when people talk lexical features talk using word itself, in contrast using other features, ie part-of-speech.

here example of paper means "whole word orthograph" when lexical features

one venture orthographic mean more abstract sequence of characters themselves, example whether sequence capitalized / titlecased / camelcased / etc. have useful , understood shape feature denomination that.

as such, recommend distinguishing features this:

lexical features: whole word, prefix/suffix (various lengths possible), stemmed word, lemmatized word

shape features: uppercase, titlecase, camelcase, lowercase

grammatical , syntactic features: pos, part of noun-phrase, head of verb phrase, complement of prepositional phrase, etc...

this not exhaustive list of possible features , feature categories, might categorizing linguistic features in clearer , more widely-accepted way.


Comments

Popular posts from this blog

php - Invalid Cofiguration - yii\base\InvalidConfigException - Yii2 -

How to show in django cms breadcrumbs full path? -

ruby on rails - npm error: tunneling socket could not be established, cause=connect ETIMEDOUT -