./textproc/py-Whoosh, Fast, featureful full-text indexing and searching library implemented

[ CVSweb ] [ Homepage ] [ RSS ] [ Required by ] [ Add to tracker ]


Branch: pkgsrc-2015Q4, Version: 2.5.7, Package name: py27-Whoosh-2.5.7, Maintainer: pkgsrc-users

Whoosh is a fast, featureful full-text indexing and searching
library implemented in pure Python.

Some of Whoosh's features include:

* Pythonic API.
* Pure-Python. No compilation or binary packages needed, no
mysterious crashes.
* Fielded indexing and search.
* Fast indexing and retrieval -- much faster than any other
pure-Python solution.
* Pluggable scoring algorithm (including BM25F), text analysis,
storage, posting format, etc.
* Powerful query language parsed by pyparsing.
* Pure Python spell-checker (as far as I know, the only one).

Whoosh takes much inspiration (and sometimes translates code) from
other open-source search engines. The fundamental design is similar
to (but does not entirely work like and is not compatible with)
Lucene, but uses KinoSearch's indexing algorithm, some scoring
algorithms from Terrier, and the English morphological variation
generator from Minion.


Required to run:
[lang/python27] [devel/py-setuptools]

Master sites:

SHA1: 915828152ea97823345348530614455202fbcd5c
RMD160: e5bd168af8cc5cd601af9d3aff6c6cb85c38d61c
Filesize: 1016.367 KB

Version history: (Expand)