./textproc/py-feedparser, Parse RSS and Atom feeds in Python

[ CVSweb ] [ Homepage ] [ RSS ] [ Required by ] [ Add to tracker ]


Branch: CURRENT, Version: 5.2.1, Package name: py27-feedparser-5.2.1, Maintainer: schmonz

Universal Feed Parser is a Python module for downloading and parsing
syndicated feeds. It can handle RSS 0.90, Netscape RSS 0.91, Userland
RSS 0.91, RSS 0.92, RSS 0.93, RSS 0.94, RSS 1.0, RSS 2.0, Atom 0.3,
Atom 1.0, and CDF feeds. It also parses several popular extension
modules, including Dublin Core and Apple's iTunes extensions.

To use Universal Feed Parser, you will need Python 2.1 or later.
Universal Feed Parser is not meant to run standalone; it is a module
for you to use as part of a larger Python program.

Universal Feed Parser is easy to use; the module is self-contained
in a single file, feedparser.py, and it has one primary public
function, parse. parse takes a number of arguments, but only one
is required, and it can be a URL, a local filename, or a raw string
containing feed data in any format.


Required to run:
[devel/py-setuptools] [lang/python27]

Master sites:

SHA1: 84f05bf1d1443cc55b9893e2cb5aa2b16ab55dec
RMD160: d48648b794bdd8af38e77b8337ac2241d79bca4c
Filesize: 247.027 KB

Version history: (Expand)


CVS history: (Expand)


   2015-10-31 15:18:32 by Amitai Schlair | Files touched by this commit (2) | Package updated
Log message:
Update to 5.2.1. From the changelog:

* Fix #22 (pip package keeps upgrading all the time)
* Support PyPy
* Remove the HTTP Status 9001 test that caused unit test tracebacks
* Remove the completely-untested HTML tidy code
* Remove BeautifulSoup as a dependency
* Remove the XFN microformat parsing code
* Remove the rel_enclosure microformat parsing code
* Remove the rel_hcard microformat parsing code
* Remove the rel_tag microformat parsing code
* Replace the regex-based RFC 822 date parser with a procedural one
* Replace the Python-licensed W3DTF date parser
* Support HTML5 audio/source/video element relative URL's
* Remove the unparsed itunes_keywords key from the result dictionary
* Fix issue 321 just a little more (yet another code path was missed)
* Issue 62 (support georss and gml namespaces)
* Issue 296 (GUID's are always treated like relative URI's)
* Issue 334 (media:restriction element content is not returned)
* Issue 335 (sub-elements of media:group are not parsed and returned)
* Issue 342 (support multiple dc:creator elements)
* Issue 357 (loose parser breaks ampersands in link element URL's)
* Issue 374 (support the Podlove Simple Chapters namespace)
* Issue 380 (support media:rating element)
* Issue 384 (fix chardet support in Python 3)
* Issue 389 (elements in unknown uppercase namespaces are ignored)
* Issue 392 (tags element subverts 'tags' key in result dictionary)
* Issue 396 (Podlove Simple Chapters version 1.0 causes a KeyError)
* Issue 399 (docs call `request_headers` parameter `extra_headers`)
* Issue 401 (support additional dcterms and media namespaces elements)
* Issue 404 (support asctime datetime strings with timezone information)
* Issue 407 (decode forward slashes encoded as character entities)
* Issue 421 (delay chardet invocation as long as possible)
* Issue 422 (add return types docstrings)
* Issue 433 (update the list of allowed MathML elements and attributes)
   2014-01-25 11:30:32 by Thomas Klausner | Files touched by this commit (533) | Package updated
Log message:
Mark packages as not ready for python-3.x where applicable;
either because they themselves are not ready or because a
dependency isn't. This is annotated by
PYTHON_VERSIONS_INCOMPATIBLE=  33 # not yet ported as of x.y.z
or
PYTHON_VERSIONS_INCOMPATIBLE=  33 # py-foo, py-bar
respectively, please use the same style for other packages,
and check during updates.

Use versioned_dependencies.mk where applicable.
Use REPLACE_PYTHON instead of handcoded alternatives, where applicable.
Reorder Makefile sections into standard order, where applicable.

Remove PYTHON_VERSIONS_INCLUDE_3X lines since that will be default
with the next commit.

Whitespace cleanups and other nits corrected, where necessary.
   2013-06-06 03:57:55 by Amitai Schlair | Files touched by this commit (1)
Log message:
The revived rss2email (not yet in pkgsrc) requires Python 3.2 or
higher, and depends on this, so this must work with Python 3.2 or
higher.
   2013-01-14 15:03:59 by Amitai Schlair | Files touched by this commit (2) | Package updated
Log message:
Update to 5.1.3. From the changelog:

* Consolidated and simplified the character encoding detection code
* Issue 346 (the gb2312 encoding isn't always upgraded to gb18030)
* Issue 350 (HTTP Last-Modified example is incorrect in documentation)
* Issue 352 (importing lxml.etree changes what exceptions libxml2 throws)
* Issue 356 (add support for the HTML5 attributes `poster` and `preload`)
* Issue 364 (enclosure-sniffing microformat code can throw ValueError)
* Issue 373 (support RFC822-ish dates with swapped days and months)
* Issue 376 (uppercase 'X' in hex character references cause ValueError)
* Issue 382 (don't strip inline user:password credentials from FTP URL's)
   2012-10-25 08:57:09 by Aleksej Saushev | Files touched by this commit (587)
Log message:
Drop superfluous PKG_DESTDIR_SUPPORT, "user-destdir" is default these days.
   2012-05-26 18:51:59 by Amitai Schlair | Files touched by this commit (2) | Package updated
Log message:
Update to 5.1.2. From the changelog:

* Minor changes to the documentation
* Strip potentially dangerous ENTITY declarations in encoded feeds
* feedparser will now try to continue parsing despite compression errors
* Fix issue 321 a little more (the initial fix missed a code path)
* Issue 337 (`_parse_date_rfc822()` returns None on single-digit days)
* Issue 343 (add magnet links to the ACCEPTABLE_URI_SCHEMES)
* Issue 344 (handle deflated data with no headers nor checksums)
* Issue 347 (support `itunes:image` elements with a `url` attribute)
* Fix mistakes, typos, and bugs in the unit test code
* Fix crash in Python 2.4 and 2.5 if the feed has a UTF_32 byte order mark
* Replace the RFC822 date parser for more extensibility
* Issue 304 (handle RFC822 dates with timezones like GMT+00:00)
* Issue 309 (itunes:keywords should be split by commas, not whitespace)
* Issue 310 (pubDate should map to `published`, not `updated`)
* Issue 313 (include the compression test files in MANIFEST.in)
* Issue 314 (far-flung RFC822 dates don't throw OverflowError on x64)
* Issue 315 (HTTP server for unit tests runs on 0.0.0.0)
* Issue 321 (malformed URIs can cause ValueError to be thrown)
* Issue 322 (HTTP redirect to HTTP 304 causes SAXParseException)
* Issue 323 (installing chardet causes 11 unit test failures)
* Issue 325 (map `description_detail` to `summary_detail`)
* Issue 326 (Unicode filename causes UnicodeEncodeError if locale is ASCII)
* Issue 327 (handle RFC822 dates with extraneous commas)
* Issue 328 (temporarily map `updated` to `published` due to issue 310)
* Issue 329 (escape backslashes in Windows path in docs/introduction.rst)
* Issue 331 (don't escape backslashes that are in raw strings in the docs)
   2012-02-04 13:45:36 by OBATA Akio | Files touched by this commit (1)
Log message:
No compiler is requilred.
   2012-01-12 19:28:30 by Joerg Sonnenberger | Files touched by this commit (2)
Log message:
Simplify. Don't allow Python 3 due to unsupported setuptools dependency.