./textproc/py-acora, Fast multi-keyword search engine for text strings in Python

[ CVSweb ] [ Homepage ] [ RSS ] [ Required by ] [ Add to tracker ]


Branch: pkgsrc-2018Q1, Version: 2.1, Package name: py27-acora-2.1, Maintainer: pkgsrc-users

Acora is 'fgrep' for Python, a fast multi-keyword text search engine.

Based on a set of keywords, it generates a search automaton (DFA) and runs it
over string input, either unicode or bytes.

It is based on the Aho-Corasick algorithm and an NFA-to-DFA powerset
construction.

Acora comes with both a pure Python implementation and a fast binary module
written in Cython. However, note that the current construction algorithm is not
suitable for really large sets of keywords (i.e. more than a couple of
thousand).


Required to run:
[lang/python27] [devel/py-cython] [devel/py-setuptools]

Master sites:

SHA1: dabbfcef46185a18a3e0f6eb1880d6af5f37dac6
RMD160: 703bfb93dc9c782a730a29d1452a93e752b1af11
Filesize: 191.95 KB

Version history: (Expand)