./textproc/py-acora, Fast multi-keyword search engine for text strings in Python

[ CVSweb ] [ Homepage ] [ RSS ] [ Required by ] [ Add to tracker ]


Branch: CURRENT, Version: 2.1, Package name: py27-acora-2.1, Maintainer: pkgsrc-users

Acora is 'fgrep' for Python, a fast multi-keyword text search engine.

Based on a set of keywords, it generates a search automaton (DFA) and runs it
over string input, either unicode or bytes.

It is based on the Aho-Corasick algorithm and an NFA-to-DFA powerset
construction.

Acora comes with both a pure Python implementation and a fast binary module
written in Cython. However, note that the current construction algorithm is not
suitable for really large sets of keywords (i.e. more than a couple of
thousand).


Required to run:
[devel/py-setuptools] [devel/py-cython] [lang/python27]

Required to build:
[pkgtools/cwrappers]

Master sites:

SHA1: dabbfcef46185a18a3e0f6eb1880d6af5f37dac6
RMD160: 703bfb93dc9c782a730a29d1452a93e752b1af11
Filesize: 191.95 KB

Version history: (Expand)


CVS history: (Expand)


   2017-12-21 15:09:13 by Adam Ciarcinski | Files touched by this commit (2) | Package updated
Log message:
py-acora: updated to 2.1

2.1:
* fix handling of empty engines
   2017-09-16 21:27:31 by Thomas Klausner | Files touched by this commit (372)
Log message:
Reset maintainer
   2017-09-03 10:53:18 by Thomas Klausner | Files touched by this commit (165)
Log message:
Follow some redirects.
   2017-01-12 01:45:31 by Blue Rats | Files touched by this commit (1)
Log message:
DEPENDS on devel/py-cython.
   2017-01-12 01:36:53 by Blue Rats | Files touched by this commit (3)
Log message:
2.0 [2016-03-17]

    rewrite of the construction algorithm to speed it up and save memory

1.9 [2015-10-10]

    recompiled with Cython 0.23.4 for better compatibility with recent Python \ 
versions.
   2016-06-08 19:43:49 by Thomas Klausner | Files touched by this commit (356)
Log message:
Switch to MASTER_SITES_PYPI.
   2015-11-04 03:00:17 by Alistair G. Crooks | Files touched by this commit (797)
Log message:
Add SHA512 digests for distfiles for textproc category

Problems found locating distfiles:
	Package cabocha: missing distfile cabocha-0.68.tar.bz2
	Package convertlit: missing distfile clit18src.zip
	Package php-enchant: missing distfile php-enchant/enchant-1.1.0.tgz

Otherwise, existing SHA1 digests verified and found to be the same on
the machine holding the existing distfiles (morden).  All existing
SHA1 digests retained for now as an audit trail.
   2014-02-13 01:50:13 by Blue Rats | Files touched by this commit (4)
Log message:
Import py27-acora-1.8 as textproc/py-acora.

Acora is 'fgrep' for Python, a fast multi-keyword text search engine.

Based on a set of keywords, it generates a search automaton (DFA) and runs it
over string input, either unicode or bytes.

It is based on the Aho-Corasick algorithm and an NFA-to-DFA powerset
construction.

Acora comes with both a pure Python implementation and a fast binary module
written in Cython. However, note that the current construction algorithm is not
suitable for really large sets of keywords (i.e. more than a couple of
thousand).