./textproc/p5-Lingua-EN-Tagger, Part-of-speech tagger for English natural language processing

[ CVSweb ] [ Homepage ] [ RSS ] [ Required by ] [ Add to tracker ]


Branch: pkgsrc-2013Q2, Version: 0.20nb2, Package name: p5-Lingua-EN-Tagger-0.20nb2, Maintainer: pkgsrc-users

The module is a probability based, corpus-trained tagger that assigns POS
tags to English text based on a lookup dictionary and a set of probability
values. The tagger assigns appropriate tags based on conditional
probabilities - it examines the preceding tag to determine the appropriate
tag for the current word. Unknown words are classified according to word
morphology or can be set to be treated as nouns or other parts of speech.

The tagger also extracts as many nouns and noun phrases as it can, using a
set of regular expressions.


Required to run:
[www/p5-HTML-Tagset] [www/p5-HTML-Parser] [devel/p5-Memoize-ExpireLRU] [lang/perl5] [textproc/p5-Lingua-Stem]

Master sites: (Expand)

SHA1: c46628a9dfdb54567cd64c65fff43a49ea36452c
RMD160: 0f64670807a0ca4ecde3789ecb2d7b69ba6bac4b
Filesize: 256.499 KB

Version history: (Expand)