./textproc/py-Whoosh, Fast, featureful full-text indexing and searching library implemented

[ CVSweb ] [ Homepage ] [ RSS ] [ Required by ] [ Add to tracker ]


Branch: pkgsrc-2021Q1, Version: 2.7.4, Package name: py38-Whoosh-2.7.4, Maintainer: pkgsrc-users

Whoosh is a fast, featureful full-text indexing and searching
library implemented in pure Python.

Some of Whoosh's features include:

* Pythonic API.
* Pure-Python. No compilation or binary packages needed, no
mysterious crashes.
* Fielded indexing and search.
* Fast indexing and retrieval -- much faster than any other
pure-Python solution.
* Pluggable scoring algorithm (including BM25F), text analysis,
storage, posting format, etc.
* Powerful query language parsed by pyparsing.
* Pure Python spell-checker (as far as I know, the only one).

Whoosh takes much inspiration (and sometimes translates code) from
other open-source search engines. The fundamental design is similar
to (but does not entirely work like and is not compatible with)
Lucene, but uses KinoSearch's indexing algorithm, some scoring
algorithms from Terrier, and the English morphological variation
generator from Minion.


Master sites:

SHA1: 415c711099d09fbb6fa4f652575d96c6d7f79f67
RMD160: f60b9fe6277f4b9a7aa48e3c7df2d6fff6390a36
Filesize: 946.036 KB

Version history: (Expand)