Log message:
cmark: updated to 0.31.0
0.31.0
* Update to 0.31.2 spec.txt.
* Treat unicode Symbols like Punctuation, as per the 0.31 spec.
* Add a new function to `utf8.h`:
`int cmark_utf8proc_is_punctuation_or_symbol(int32_t uc)`.
The old `cmark_utf8proc_is_punctuation` has been kept for
now, but it is no longer used.
* Add new exported function `cmark_parser_new_with_mem_into_root`
(API change) (John Ericson).
* Avoid repeated `language-` in info string
* Fix quadratic behavior in `S_insert_emph` (Nick Wellnhofer).
Fixes part of GHSA-66g8-4hjf-77xh.
* Fix quadratic behavior in `check_open_blocks` (Nick Wellnhofer).
Fixes part of GHSA-66g8-4hjf-77xh.
* Track underscore bottom separately mod 3, like asterisk (Michael
Howell). This was already implemented correctly for asterisks,
but not for underscore.
* Use `fwrite` instead of `printf` to print results in main
This avoids a massive slowdown in MSYS2.
* commonmark writer: less aggressive escaping for `!`
* Update libFuzzer build (Nick Wellnhofer):
+ Move fuzzing files into their own directory.
+ Use libFuzzer the modern way by compiling and linking with
`-fsanitize=fuzzer(-no-link)` without requiring `LIB_FUZZER_PATH`.
+ Update the `libFuzzer` rule in Makefile and the README.md.
* CMake build changes (Saleem Abdulrasool).
+ Inline multiple variables in CMake, following CMake recommendations.
+ Simplify the version computation.
+ Remove the `CMARK_STATIC` and `CMARK_SHARED` options as one of the two
must be enabled always as the cmark executable depends on the library.
Instead of having a custom flag to discern between the
library type, use the native CMake option `BUILD_SHARED_LIBS`,
allowing the user to control which library to build. This matches
CMake recommendations to only build a single copy of the library.
+ Introduce an author warning for the use of `CMARK_SHARED` and
`CMARK_STATIC` to redirect the author of the dependent package to
`BUILD_SHARED_LIBS`.
+ Permit incremental linking on Windows. Although incremental linking
does introduce padding in the binary for incremental links, that
should not matter for release mode builds in theory as `/OPT:REF`
and `/OPT:ICF` will trigger full links, which is the default in
release mode.
+ Hoist the CMake module inclusion to the top level.
+ Minor tweaks for speeding up the configure phase.
Restructure the file layout to place the custom modules into the
`cmake/modules` directory that is the common layout for CMake based
projects.
+ Squelch C4232 warnings on MSVC builds.
+ Remove check for `__builtin_expect`. Use `__has_builtin` to check
at compile time if the feature is supported.
This macro is supported by both clang and GCC (as of 10).
In the case that the compiler in use is not new enough, we still
provide the fallback so that the code will compile but without the
additional hints for the branch probability. `config.h` has been
removed from the code base as it is no longer needed.
+ Remove `/TP` usage on MSVC and replace `CMARK_INLINE` with `inline`.
These were workarounds for pre-VS2015 compilers, which are no longer
supported.
+ Hoist the C visibility settings to top level
+ Clean up C4267 warnings on MSVC builds.
+ Remove some compiler compatibility checks that are no longer
needed because VS 2013 is no longer supported
+ Adjust the policy to silence warnings on MSVC builds
CMake 3.15+ remove `/W3` from the language flags under MSVC with
CMP0092. Set the policy to new to avoid the D9025 warning.
+ Reflow some text to match CMake documentation style
+ Use generator expression for path computation.
+ Use CMake to propagate `CMARK_STATIC_DEFINE`.
+ Clean up an obsoleted variable (NFC).
+ Hoist the policy settings. Policy settings may impact how
`project` functions. They should be set immediately after
`cmake_minimum_required` (which implicitly sets policies).
Use the `POLICY` check to see if a policy is defined rather
than using a version check.
+ Replace `CMARK_TESTS` with CMake sanctioned `BUILD_TESTING`.
+ Correct typo and adjust command invocation. Use the proper
generator expression for the python interpreter and adjust
a typo in the component name.
+ Add an upgrade path for newer CMake.
CMake 3.12 deprecated `FindPythonInterp`, and with CMake 3.27, were
obsoleted with CMP0148. Add a version check and switch to the new
behaviour to allow building with newer releases.
* Fix regex syntax warnings in `pathological_tests.py` (Nick Wellnhofer).
* `test/cmark.py`: avoid star imports (Jakub Wilk).
* `spec_tests.py`: Add option to generate fuzz corpus (Nick Wellnhofer).
Add an option `--fuzz-corpus` that writes the test cases to separate
files including the options header, so they can be used as seed corpus
for fuzz testing.
* Fix some cmark.3 man rendering issues so we can do a clean regen
(John Ericson).
* Update Windows compilation instructions
|
Log message:
cmark: updated to 0.30.1
[0.30.1]
* Properly indent block-level contents of list items in man.
This handles nested lists as well as items with multiple paragraphs.
The change requires addition of a new field block_number_in_list_item
to cmark_renderer, but this does not change the public API.
* Fix quadratic behavior when parsing emphasis (Nick
Wellnhofer). Delimiters can be deleted, so store delimiter positions
instead of pointers in `openers_bottom`. Besides causing undefined
behavior when reading a dangling pointer, this could also result
in quadratic behavior when parsing emphasis.
* Fix quadratic behavior when parsing smart quotes (Nick Wellnhofer).
Remove matching smart quote delimiters. Otherwise, the same opener
could be found over and over, preventing the `openers_bottom`
optimization from kicking in and leading to quadratic behavior when
processing lots of quotes.
* Modify CMake configuration so that the project can be built with
older versions of CMake (Saleem Abdulrasool). (In 0.30.0,
some features were used that require CMake >= 3.3.) The cost of this
backwards compatibility is that developers must now explicitly invoke
`cmark_add_compile_options` when a new compilation target is added.
* Remove a comma at the end of an enumerator list, which was flagged
by clang as a C++11 extension.
* make_man_page.py: use absolute path with CDLL. This avoids the error
"file system relative paths not allowed in hardened programs."
* Include cmark version in cmark(3) man page (instead of LOCAL).
[0.30.0]
* Use official 0.30 spec.txt.
* Add `cmark_get_default_mem_allocator()`. API change: this
adds a new exported function in cmark.h.
* An optimization we used for emphasis parsing was
too aggressive, causing us to miss some emphasis that was legal
according to the spec. We fix this by indexing the `openers_bottom`
table not just by the type of delimiter and the length of the
closing delimiter mod 3, but by whether the closing delimiter
can also be an opener. (The algorithm for determining emphasis
matching depends on all these factors.) Add regression test.
* Fix quadratic behavior with inline HTML (Nick Wellnhofer).
Repeated starting sequences like `<?`, `<!DECL ` or `<![CDATA[` could
lead to quadratic behavior if no matching ending sequence was found.
Separate the inline HTML scanners. Remember if scanning the whole input
for a specific ending sequence failed and skip subsequent scans.
* Speed up hierarchy check in tree manipulation API (Nick Wellnhofer).
Skip hierarchy check in the common case that the inserted child has
no children.
* Fix quadratic behavior when parsing inlines (Nick Wellnhofer).
The inline parsing code would call `cmark_node_append_child` to append
nodes. This public function has a sanity check which is linear in the
depth of the tree. Repeated calls could show quadratic behavior in
degenerate trees. Use a special function to append nodes without this
check. (Issue found by OSS-Fuzz.)
* Replace invalid characters in XML output (Nick wellnhofer).
Control characters, U+FFFE and U+FFFF aren't allowed in XML 1.0, so
replace them with U+FFFD (replacement character). This doesn't solve
the problem how to roundtrip these characters, but at least we don't
produce invalid XML.
* Avoid quadratic output growth with reference links (Nick Wellnhofer).
Keep track of the number bytes added through expansion of reference
links and limit the total to the size of the input document. Always
allow a minimum of 100KB. Unfortunately, cmark has no error handling,
so all we can do is to stop expanding reference links without returning
an error. This should never be an issue in practice though. The 100KB
minimum alone should cover all real-world cases.
* Fix issue with type-7 HTML blocks interrupting paragraphs
(see commonmark/commonmark.js).
* Treat `textarea` like `script`, `style`, `pre` (type 1 HTML block),
in accordance with spec change.
* Define whitespace per spec (Asherah Conor).
* Add `MAX_INDENT` for xml. Otherwise we can get quadratic
increase in size with deeply nested structures.
* Fix handling of empty strings when creating XML/HTML output
(Steffen Kieß).
* Commonmark renderer: always use fences for code.
This solves problems with adjacent code blocks being merged.
* Improve rendering of commonmark code spans with spaces.
* Cleaner approach to max digits for numeric entities.
This modifies unescaping in `houdini_html_u.c` rather than
the entity handling in `inlines.c`. Unlike the other,
this approach works also in e.g. link titles.
* Fix entity parser (and api test) to respect length limit on
numeric entities.
* Don't allow link destinations with unbalanced unescaped parentheses.
See commonmark/commonmark.js.
* `print_usage()`: Minor grammar fix, swap two words (Øyvind A. Holm).
* Don't call `memcpy` with `NULL` as first parameter.
This is illegal according to the C standard, sec. 7.1.4.
See <https://www.imperialviolet.org/2016/06/26/nonnull.html>.
* Add needed include in `blocks.c`.
* Fix unnecessary variable assignment.
* Skip UTF-8 BOM if present at beginning of buffer.
* Fix URL check in `is_autolink` (Nick Wellnhofer). In a recent commit,
the check was changed to `strcmp`, but we really have to use `strncmp`.
* Fix null pointer deref in `is_autolink` (Nick Wellnhofer).
Introduced by a recent commit. Found by OSS-Fuzz.
* Rearrange struct cmark_node (Nick Wellnhofer). Introduce multi-purpose
data/len members in struct cmark_node. This is mainly used to store
literal text for inlines, code and HTML blocks.
Move the content strbuf for blocks from `cmark_node` to `cmark_parser`.
When finalizing nodes that allow inlines (paragraphs and headings),
detach the strbuf and store the block content in the node's data/len
members. Free the block content after processing inlines.
Reduces size of struct `cmark_node` by 8 bytes.
* Improve packing of `struct cmark_list` (Nick Wellnhofer).
* Use C string instead of chunk in a number of contexts (Nick Wellnhofer).
The node struct never references memory of other nodes now.
Node accessors don't have to check for delayed creation of C strings,
so parsing and iterating all literals using the public API should
actually be faster than before. These changes also reduce the size
of `struct cmark_node`.
* Add casts for MSVC10 (from kivikakk in cmark-cfm).
* commonmark renderer: better escaping in smart mode. When
`CMARK_OPT_SMART` is enabled, we escape literal `-`, `.`, and quote
characters when needed to avoid their being "smartified."
* Add options field to `cmark_renderer`.
* commonmark.c - use `size_t` instead of `int`.
* Include `string.h` in `cmark-fuzz.c`.
* Fix (hash collisions for references) (Vicent Marti via cmark-gfm).
Reimplemented reference storage as follows:
1. New references are always inserted at the end of a linked list. This
is an O(1) operation, and does not check whether an existing (duplicate)
reference with the same label already exists in the document.
2. Upon the first call to `cmark_reference_lookup` (when it is expected
that no further references will be added to the reference map), the
linked list of references is written into a fixed-size array.
3. The fixed size array can then be efficiently sorted in-place in O(n
log n). This operation only happens once. We perform this sort in a
_stable_ manner to ensure that the earliest link reference in the
document always has preference, as the spec dictates. To accomplish
this, every reference is tagged with a generation number when initially
inserted in the linked list.
4. The sorted array is then compacted in O(n). Since it was sorted in a
stable way, the first reference for each label is preserved and the
duplicates are removed, matching the spec.
5. We can now simply perform a binary search for the current
`cmark_reference_lookup` query in O(log n). Any further lookup calls
will also be O(log n), since the sorted references table only needs to
be generated once.
The resulting implementation is notably simple (as it uses standard
library builtins `qsort` and `bsearch`), whilst performing better than
the fixed size hash table in documents that have a high number of
references and never becoming pathological regardless of the input.
* Comment out unused function `cmark_strbuf_cstr` in `buffer.h`.
* Re-add `--safe` command-line option as a no-op, for backwards
compatibility.
* Update to Unicode 13.0
* Generate and install cmake-config file (Reinhold Gschweicher).
Add full cmake support. The project can either be used with
`add_subdirectory` or be installed into the system (or some other
directory) and be found with `find_package(cmark)`. In both cases the
cmake target `cmark::cmark` and/or `cmark::cmark_static` is all that
is needed to be linked. Previously the `cmarkConfig.cmake` file
was generated, but not installed. As additional bonus of generation
by cmake we get a generated `cmake-config-version.cmake` file for
`find_package()` to search for the same major version.
The generated config file is position independent, allowing the
installed directory to be copied or moved and still work.
The following four files are generated and installed:
`lib/cmake/cmark/cmark-config.cmake`,
`lib/cmake/cmark/cmark-config-version.cmake`,
`lib/cmake/cmark/cmark-targets.cmake`,
`lib/cmake/cmark/cmark-targets-release.cmake`.
* Adjust the MinGW paths for MinGW64 (Daniil Baturin).
* Fix CMake generator expression checking for MSVC (Nick Wellnhofer).
* Fix `-Wconst-qual` warning (Saleem Abdulrasool). This enables building
with `/Zc:strictString` with MSVC as well.
* Improve and modernize cmake build (Saleem Abdulrasool).
+ Build: add exports targets for build tree usage.
+ Uuse target properties for include paths.
+ Remove the unnecessary execute permission on CMakeLists.txt.
+ Reduce property computation in CMake.
+ Use `CMAKE_INCLUDE_CURRENT_DIRECTORY`.
+ Improve man page installation.
+ Only include `GNUInstallDirs` once.
+ Replace `add_compile_definitions` with `add_compile_options`
since the former was introduced in 3.12.
+ Cleanup CMake.
+ Inline a variable.
+ Use `LINKER_LANGUAGE` property for C++ runtime.
+ Use CMake to control C standard.
+ Use the correct variable.
+ Loosen the compiler check
+ Hoist shared flags to top-level CMakeLists
+ Remove duplicated flags.
+ Use `add_compile_options` rather than modify `CMAKE_C_FLAGS`.
+ Hoist sanitizer flags to global state.
+ Hoist `-fvisibilty` flags to top-level.
+ Hoist the debug flag handling.
+ Hoist the profile flag handling.
+ Remove incorrect variable handling.
+ Remove unused CMake includes.
* Remove "-rdynamic" flag for static builds (Eric Pruitt).
* Fixed installation on other than Ubuntu GNU/Linux distributions
(Vitaly Zaitsev).
* Link executable with static or shared library (Nick Wellnhofer).
If `CMARK_STATIC` is on (default), link the executable with the static
library. This produces exactly the same result as compiling the library
sources again and linking with the object files.
If `CMARK_STATIC` is off, link the executable with the shared library.
This wasn't supported before and should be the preferred way to
package cmark on Linux distros.
Building only a shared library and a statically linked executable
isn't supported anymore but this doesn't seem useful.
* Reintroduce version check for MSVC /TP flag (Nick Wellnhofer).
The flag is only required for old MSVC versions.
* normalize.py: use `html.escape` instead of `cgi.escape`.
* Fix pathological_tests.py on Windows (Nick Wellnhofer).
When using multiprocessing on Windows, the main program must be
guarded with a `__name__` check.
* Remove useless `__name__` check in test scripts (Nick Wellnhofer).
* Add CIFuzz (Leo Neat).
* cmark.1 - Document --unsafe instead of --safe.
* cmark.1: remove docs for `--normalize` which no longer exists.
* Add lint target to Makefile.
* Add uninstall target to Makefile.
* Update benchmarks.
* Fix typo in documentation (Tim Gates).
* Increase timeout for pathological tests to avoid CI failure.
* Update the Racket wrapper with the safe -> unsafe flag change
|