./textproc/py-acora, Fast multi-keyword search engine for text strings in Python

[ CVSweb ] [ Homepage ] [ RSS ] [ Required by ] [ Add to tracker ]


Branch: CURRENT, Version: 2.0, Package name: py27-acora-2.0, Maintainer: rodent

Acora is 'fgrep' for Python, a fast multi-keyword text search engine.

Based on a set of keywords, it generates a search automaton (DFA) and runs it
over string input, either unicode or bytes.

It is based on the Aho-Corasick algorithm and an NFA-to-DFA powerset
construction.

Acora comes with both a pure Python implementation and a fast binary module
written in Cython. However, note that the current construction algorithm is not
suitable for really large sets of keywords (i.e. more than a couple of
thousand).


Required to run:
[devel/py-setuptools] [devel/py-cython] [lang/python27]

Required to build:
[pkgtools/cwrappers]

Master sites:

SHA1: 783a3367609bf118b34c86cf96e78ccea1b5e5cc
RMD160: 48327b8551511900397e8a21b53e2bd9a7efff7b
Filesize: 162.387 KB

Version history: (Expand)


CVS history: (Expand)


   2017-01-12 01:45:31 by Blue Rats | Files touched by this commit (1)
Log message:
DEPENDS on devel/py-cython.
   2017-01-12 01:36:53 by Blue Rats | Files touched by this commit (3)
Log message:
2.0 [2016-03-17]

    rewrite of the construction algorithm to speed it up and save memory

1.9 [2015-10-10]

    recompiled with Cython 0.23.4 for better compatibility with recent Python \ 
versions.
   2016-06-08 19:43:49 by Thomas Klausner | Files touched by this commit (356)
Log message:
Switch to MASTER_SITES_PYPI.
   2015-11-04 03:00:17 by Alistair G. Crooks | Files touched by this commit (797)
Log message:
Add SHA512 digests for distfiles for textproc category

Problems found locating distfiles:
	Package cabocha: missing distfile cabocha-0.68.tar.bz2
	Package convertlit: missing distfile clit18src.zip
	Package php-enchant: missing distfile php-enchant/enchant-1.1.0.tgz

Otherwise, existing SHA1 digests verified and found to be the same on
the machine holding the existing distfiles (morden).  All existing
SHA1 digests retained for now as an audit trail.
   2014-02-13 01:50:13 by Blue Rats | Files touched by this commit (4)
Log message:
Import py27-acora-1.8 as textproc/py-acora.

Acora is 'fgrep' for Python, a fast multi-keyword text search engine.

Based on a set of keywords, it generates a search automaton (DFA) and runs it
over string input, either unicode or bytes.

It is based on the Aho-Corasick algorithm and an NFA-to-DFA powerset
construction.

Acora comes with both a pure Python implementation and a fast binary module
written in Cython. However, note that the current construction algorithm is not
suitable for really large sets of keywords (i.e. more than a couple of
thousand).