./converters/py-html2text, Convert HTML into easy-to-read plain ASCII text

[ CVSweb ] [ Homepage ] [ RSS ] [ Required by ] [ Add to tracker ]


Branch: CURRENT, Version: 2024.2.26, Package name: py311-html2text-2024.2.26, Maintainer: schmonz

html2text is a Python script that convers a page of HTML into clean,
easy-to-read plain ASCII text. Better yet, that ASCII also happens
to be valid Markdown (a text-to-HTML format).


Required to run:
[devel/py-setuptools] [lang/python310]

Master sites:

Filesize: 55.202 KB

Version history: (Expand)


CVS history: (Expand)


   2024-02-28 13:34:10 by Thomas Klausner | Files touched by this commit (1)
Log message:
py-html2text: also needs py-wheel to build
   2024-02-28 12:56:39 by Amitai Schleier | Files touched by this commit (1)
Log message:
py-html2text: needs py-setuptools to build.
   2024-02-27 20:34:05 by Amitai Schleier | Files touched by this commit (2) | Package updated
Log message:
py-html2text: update to 2024.2.26. Changes:

- Fix IndexError on empty strong mark on version.
   2024-02-25 23:31:25 by Amitai Schleier | Files touched by this commit (3) | Package updated
Log message:
py-html2text: update to 2024.2.25. Changes:

* Fix #332: Insert at most one space for multiple emphasis
* Feature #318: Make padded tables more similar to pandoc's pipe_tables.
* Add support for Python 3.9.
* Fix extra line breaks inside html link text (between '[' and ']')
* Fix #344: indent ``<ul>`` inside ``<ol>`` three spaces instead of two
  to comply with CommonMark, GFM, etc.
* Fix #324: unnecessary spaces around ``<b>``, ``<em>``, and \ 
``strike`` tags.
* Don't wrap tables by default and add a ``--wrap-tables`` config option.
* Feature #198: Ignore ``<p>`` tags inside table rows.
* Don't wrap tables by default and add a ``--wrap-tables`` config option
* Remove support for Python ≤ 3.5. Now requires Python 3.6+.
* Support for Python 3.10+.
* Fix #320 padding empty tables and tables with no ``</tr>`` tags.
* Add ``ignore_mailto_links`` config option to ignore ``mailto:``
  style links.
* Feature #407: Support the superscript and subscript tags.
* Fix #373: ``\n`` inside text of a Markdown link.
* Feature #406: Improve support for null atttibute values.
   2022-01-05 16:41:32 by Thomas Klausner | Files touched by this commit (289)
Log message:
python: egg.mk: add USE_PKG_RESOURCES flag

This flag should be set for packages that import pkg_resources
and thus need setuptools after the build step.

Set this flag for packages that need it and bump PKGREVISION.
   2022-01-04 21:55:40 by Thomas Klausner | Files touched by this commit (1595)
Log message:
*: bump PKGREVISION for egg.mk users

They now have a tool dependency on py-setuptools instead of a DEPENDS
   2021-10-26 13:23:42 by Nia Alarie | Files touched by this commit (1161)
Log message:
textproc: Replace RMD160 checksums with BLAKE2s checksums

All checksums have been double-checked against existing RMD160 and
SHA512 hashes

Unfetchable distfiles (fetched conditionally?):
./textproc/convertlit/distinfo clit18src.zip
   2021-10-07 17:02:49 by Nia Alarie | Files touched by this commit (1162)
Log message:
textproc: Remove SHA1 hashes for distfiles