./devel/RE-flex, Fast lexical analyzer generator for C++ with full Unicode support

[ CVSweb ] [ Homepage ] [ RSS ] [ Required by ] [ Add to tracker ]


Branch: CURRENT, Version: 5.2.2, Package name: RE-flex-5.2.2, Maintainer: pkgsrc-users

RE-flex is a regex-centric, fast lexical analyzer generator for C++
with full Unicode support.

Faster than Flex and other regex matchers. Accepts Flex
specifications. Generates reusable source code that is easy to
understand. Easily integrates with Bison and other parsers. Includes
a fast stand-alone regex engine and library.

It also can use regex matchers from boost or pcre2, as well as
visualizing FSM graphs with graphviz.


Required to run:
[devel/bison]

Required to build:
[pkgtools/cwrappers]

Package options: examples

Master sites:

Filesize: 7125.935 KB

Version history: (Expand)


CVS history: (Expand)


   2025-02-16 10:40:42 by Makoto Fujiwara | Files touched by this commit (1)
Log message:
(devel/RE-flex) Correction to intentional extra space on COMMENT= , sorry
   2025-02-16 10:35:16 by Makoto Fujiwara | Files touched by this commit (1) | Package updated
Log message:
(devel/RE-flex) Updated 4.1.2 to 5.2.2

 I did send wrong commit message for above, let me add space after COMMENT
 I'll correct it later, sorry

 https://github.com/Genivia/RE-flex/releases/
-------------------------------
reflex v5.2.2 --- 03 Feb 18:46, 2025
What's new?
  • Matcher::find() speed improvements for certain regex patterns that do not
    match the input.
-------------------------------
reflex v5.2.1 --- 27 Jan 16:06, 2025
What's new?
  • fix a regression issue in v5.2 with non-SIMD-optimized search with
    Matcher::find()
-------------------------------
reflex v5.2 --- 24 Jan 17:05, 2025
What's new?
  • new file encoding type null_data to read NUL as LF and vice versa; supports
    reading xargs -0 output for example
-------------------------------
reflex v5.1.1 --- 10 Jan 02:04, 2025
What's new?

  • fix a minor issue with case-insensitive matching when regex patterns are
    specified in certain ways in combination with "string-like" \ 
patterns such
    that one ore more alternating sub-patterns overlap from the start.
-------------------------------
reflex v5.1 --- 29 Nov 21:31, 2024
What's new?

  • usability improvements
  • update the regex converter of reflex::Matcher (and reflex::FuzzyMatcher by
    inheritance) to accept \0 as NUL #216
  • appease compiler warnings #217
-------------------------------
reflex v5.0.1 --- 29 Oct 23:29, 2024

Minor update to fix compilation issue #214.
-------------------------------
reflex v5.0 --- 29 Oct 17:07, 2024
What's new?

  • faster SIMD regex search methods, see also the ugrep project update
  • improved anchor and word boundary matching speed and support
  • new std::string_view strview() matcher method
-------------------------------
reflex v4.5.0 --- 01 Aug 16:15, 2024
What's new?

Minor speed and usability improvements.
-------------------------------
reflex v4.4.0 --- 06 Jun 18:46, 2024
What's new?

  • upgraded reflex::Matcher and reflex::FuzzyMatcher to respect Unicode word
    boundaries instead of only ASCII \<, \>, \b, \B
  • upgraded regex Unicode converters to support Unicode [::] character classes
    instead of only ASCII [[:alpha:]] etc., note that \p{Class} always supports
    Unicode (a regex converter is invoked with reflex::Matcher::convert(...) to
    support Unicode and other syntax constructs that the lower-level
    reflex::Matcher does not support, see documentation)
  • improved FSM code generation
-------------------------------
reflex v4.3.0 --- 12 May 17:14, 2024
What's new?

  • faster Matcher::find() with refactored SIMD (SSE2/AVX2/AVX512BW/NEON/
    AArch64) code like ugrep 6.0 and with run-time AVX2/AVX512BW detection
  • increased default internal input buffer size

Note
When using cmake to build, please add a missing lib/simd.cpp lib source to
CMakeLists.txt:
   2025-02-16 02:23:12 by Makoto Fujiwara | Files touched by this commit (4)
Log message:
(lang/rakudo) Updated 2024.10 to 2025.01

 Rakudo compiler, Release #179 (2025.01)

2025-01-25

On behalf of the Rakudo development team, I’m very happy to announce
the January 2025 release of Rakudo #179. Rakudo is an implementation
of the Raku1 language.

The source tarball for this release is available from
https://rakudo.org/files/rakudo. Pre-compiled archives will be
available shortly.

New in 2025.01:

    Improvements:
        Add lower-bounding for the revision-gated trait [7c7ad5bd][0e9a2d73]
        Pod::To::Text: Sort enum pairs by value [c7004109]

        Improve throughput and reliability of
        CI::PrecompilationRepository [b6242f4b][4919b647]

        Get PRs working for everyone [b6ea5783]

        Make native arrays Cool (e.g. allow native arrays to be
        endpoints for Ranges [17fe51b8][8f38af87]

        Add proper support for regexes to multi-needle .split [77c4c64e][bea1f099]
        Final step of re-imaginining .trans [d7feae2f][e5a715ab][bcd7beaf][f055c31e]
        Update documentation of recently added syscalls [3265ff3b]
        Optimize Str.(words|lines) [603b07ac]

    Additions:
        Add VM.ownup to produce backtraces of all threads and exit \ 
[7ef44ef5][70d66b98]
        Provide adverbial forms of map [24b4001e][720cd014]

    Fixes:
        Fix obsolete reference to CompUnit::PlationStore::File [b2dd1185]
        Make sure strarray.raku doesn't die [c0c2e01f]
        Move precomp docker host ubuntu 24.04 -> 20.04 [eba9332d]
        Use .raku suffix in doc of install-dist instead of .p6 [a66b2e16]

    Internal:
        Run full spectest in CI (#5747) [ebecbeaf]
        Remove unnecessary stubs [3559bb9b]
        Move test file that dies on JVM to t/09-moar [4aa8d331]
        [JVM] Add extra check for GLOBAL being null (#5710) [e7f9223c]
        [JVM] Reduce backend-specific code in find_best_dispatchee (#5714) [8991173f]
        Remove playful "just yet" in error messages [87f0e37f]
        Remove now unnecessary R:I:ALL_TYPE method [26fbd4c3]
        Only call ensure-sc if there is a descriptor [70c268ba]

    RakuAST Development:
        Make A, P, and Δ = Capital Greek Delta) produce AST like markup X [f57a989a]
   2024-03-27 14:59:35 by Thomas Klausner | Files touched by this commit (2)
Log message:
RE-flex: fix doxygen option

Fix configure argument.

From Malte Dehling.
   2024-03-21 21:44:41 by Thomas Klausner | Files touched by this commit (13) | Package updated
Log message:
devel/RE-flex: Update to v4.1.2

Prepared by Malte Dehling in wip.

pkgsrc change: turn off cpu-optimization option by default.

Dec 10, 2021: 3.1.0 improved buffering mechanism with reduced buffer sizes; \ 
faster columno() for long lines; fix CP-1251 table typo.
Feb 20, 2022: 3.2.0 new %option ctorinit; faster compilation of regular \ 
expressions to tables and direct code DFAs; refactored SIMD source code to \ 
enable AVX2 and AVX512BW optimizations in multi-version matcher code; updated \ 
Windows binary file opening.
Feb 23, 2022: 3.2.1 regression bug in 3.2.0 fix.
Mar 13, 2022: 3.2.2 minor improvements.
Apr 2, 2022: 3.2.3 fixed C++17 compliance and dso build.
Apr 29, 2022: 3.2.4 new %begin directive; new --batch=SIZE option argument.
Apr 30, 2022: 3.2.5 minor correction.
May 6, 2022: 3.2.6 improved fuzzy matcher.
May 10, 2022: 3.2.7 improved fuzzy matcher, see also \ 
https://github.com/Genivia/FuzzyMatcher.
Jul 6, 2022: 3.2.8 minor update to correct a lexer file parsing issue in the \ 
reflex code generator tool.
Aug 12, 2022: 3.2.9 add cmake rules for Windows builds.
Aug 21, 2022: 3.2.10 bug fix in regular expression converter.
Aug 29, 2022: 3.2.11 new lua2lisp transpiler example with Bison complete symbols \ 
and locations.
Jan 6, 2023: 3.2.12 regression bug fix in trailing context pattern matching, \ 
e.g. xy and x/x patterns collided when they should not; updated yaml parser \ 
example.
Jan 24, 2023: 3.3.0 overall update; corrected a problem with Unicode characters \ 
in regex patterns adjacent to curly braces; permit cxx extension in FSM filename \ 
output.
Mar 6, 2023: 3.3.1 for consistency include NAME in default header file name, \ 
tables file name, and graphs file name when option --prefix is specified.
Mar 17, 2023: 3.3.2 fix a performance issue with case-insensitive pattern \ 
construction.
May 28, 2023: 3.3.3 fix yyrestart dropping the first character; faster \ 
Matcher::find().
May 31, 2023: 3.3.4 fix Matcher::find() initialization issue in 3.3.3.
Jun 12, 2023: 3.3.5 improve source code output of lexer class definitions.
Jul 11, 2023: 3.3.6 faster Matcher::find(); improved --stdout to include tables.
Jul 17, 2023: 3.3.7 faster Matcher::find().
Aug 4, 2023: 3.3.8 minor update to sync up the code base with the ugrep project.
Aug 16, 2023: 3.3.9 fix avx512bw compilation error; new LineMatcher matching engine.
Sep 16, 2023: 3.4.0 fix FuzzyMatcher::DEL flag when this is the only flag \ 
selected; fix FuzzyMatcher::matches() bug that incorrectly matched an extra \ 
character before the end of the input; optimize find(); updated saving the FSM \ 
pred[] hashes to a file, which has changed; increase default buffer size \ 
REFLEX_BUFSZ to 128K for best throughput performance.
Sep 25, 2023: 3.4.1 make word boundaries \b, \B, \< and \> applicable \ 
anywhere in a pattern.
Oct 7, 2023: 3.5.0 updated to Unicode 15.1; clarify . (dot) with %unicode \ 
enabled, which is a catch-all pattern; update \X to match only valid Unicode \ 
characters.
Nov 5, 2023: 3.5.1 minor improvements.
Feb 17, 2024: 4.0.0 faster Matcher::find() with a new DFA cut algorithm to \ 
optimize match prediction speed and accuracy, see also ugrep 5.0; apply Unicode \ 
pattern canonicalization with reflex::convert(..., \ 
reflex::convert_flag::unicode).
Feb 23, 2024: 4.0.1 new rawk example to demonstrate awk-like fast search in C++; \ 
enable <<EOF>> rules for option find to generate a fast search \ 
engine.
Mar 5, 2024: 4.1.0 improved lazy quantifiers for POSIX regex lazy matching in \ 
linear time using an advanced DFA transformation algorithm introduced in RE/flex \ 
in 2016.
Mar 11, 2024: 4.1.1 minor update to correct a DFA construction problem for POSIX \ 
regex lazy quantifiers matching too much in some cases.
Mar 17, 2024: 4.1.2 updated configure scripts; cast negative ctype function \ 
arguments (problem detected on NetBSD 10).
   2021-11-28 21:22:33 by Sean Cole | Files touched by this commit (5) | Package updated
Log message:
Updated to 3.0.12

Sep 20, 2020: 2.1.5 added matcher method lineno(n) to set or change the line \ 
number to n; added yyset_lineno(n,s) to flexlexer.h; updated Mini C compiler \ 
example.
Oct 12, 2020: 3.0.0 fixed a regression bug since v2.x; redesigned internals to \ 
increase IO efficiency and regex pattern search speed.
Oct 24, 2020: 3.0.1 improved handling of UTF-16/32 file encodings on Windows to \ 
prevent ^Z eof when files are opened in text mode.
Mar 16, 2021: 3.0.2 minor improvements and additions; fixed MinGW builds.
Apr 29, 2021: 3.0.3 fixed reflex::Input copy constructor; minor improvements.
May 27, 2021: 3.0.4 improvements and fixes for minor issues.
Jun 1, 2021: 3.0.5 minor usability improvements; changed \d to match Unicode \ 
when option unicode is enabled.
Jun 4, 2021: 3.0.6 minor improvements.
Jun 6, 2021: 3.0.7 updated examples to fix build problems.
Jul 15, 2021: 3.0.8 minor improvements.
Jul 22, 2021: 3.0.9 SIMD code split into simd_avx2.cpp and simd_avx512bw.cpp to \ 
support runtime CPU ID checking when the library is built with ./configure; \ 
make, disable AVX with ./configure --disable-avx, disable SSE2 with ./configure \ 
--disable-sse2; UTF-16LE BOM detection correction.
Aug 14, 2021: 3.0.10 fixed missing simd.h after installation, added \ 
REFLEX_BUFFER_SIZE to customize the initial size and growth of the input buffer.
Sep 14, 2021: 3.0.11 minor change to apply --prefix to the generated \ 
REFLEX_code_[PREFIX]STATE code.
Oct 11, 2021: 3.0.12 updated to Unicode 14; fixed a compilation issue with \ 
--params when used with --flex.
   2021-10-26 12:20:11 by Nia Alarie | Files touched by this commit (3016)
Log message:
archivers: Replace RMD160 checksums with BLAKE2s checksums

All checksums have been double-checked against existing RMD160 and
SHA512 hashes

Could not be committed due to merge conflict:
devel/py-traitlets/distinfo

The following distfiles were unfetchable (note: some may be only fetched
conditionally):

./devel/pvs/distinfo pvs-3.2-solaris.tgz
./devel/eclipse/distinfo eclipse-sourceBuild-srcIncluded-3.0.1.zip
   2021-10-07 15:44:44 by Nia Alarie | Files touched by this commit (3017)
Log message:
devel: Remove SHA1 hashes for distfiles