NOTICE: This package has been removed from pkgsrc

./wip/hs-tagsoup, Haskell library for parsing unstructured HTML code

[ CVSweb ] [ Homepage ] [ RSS ] [ Required by ]


Branch: CURRENT, Version: 0.13.1, Package name: hs-tagsoup-0.13.1, Maintainer: pho

TagSoup is a library for parsing HTML/XML. It supports the HTML 5
specification, and can be used to parse either well-formed XML, or
unstructured and malformed HTML from the web. The library also provides
useful functions to extract information from an HTML document, making it
ideal for screen-scraping.

Users should start from the Text.HTML.TagSoup module.


Required to run:
[wip/ghc]

Master sites:

SHA1: 4066f9e949dacf8111fb187bf01b1f214f5ee7d8
RMD160: 612408151a172be0a461c0d773355073896b9dfd
Filesize: 41.754 KB

Version history: (Expand)


CVS history: (Expand)


   2014-08-03 23:35:39 by Ryosuke Moro | Files touched by this commit (23) | Package removed
Log message:
Remove hs-cgrep, hs-tagsoup, hs-xml, imported to pkgsrc/textproc.
   2014-05-18 23:33:25 by Ryosuke Moro | Files touched by this commit (119)
Log message:
Remove hs-data-default-class, hs-dlist, hs-text, hs-utf8-string,
imported to pkgsrc/devel.
   2014-04-15 12:59:40 by Ryosuke Moro | Files touched by this commit (53)
Log message:
- ready for HASKELL_ENABLE_HADDOCK_DOCUMENTATION= yes
   2014-01-28 23:56:29 by Ryosuke Moro | Files touched by this commit (3)
Log message:
Update to 0.13.1

CHANGES:
0.13.1
 - #562, treat <script> specially as per HTML5
( #562 => http://code.google.com/p/ndmitchell/issues/detail?id=562 )
   2013-09-19 12:29:19 by Ryosuke Moro | Files touched by this commit (4)
Log message:
Update to 0.13

CHANGES:
0.13
    #616, extend to all HTML5 entities
    Optimise lookupNamedEntity
    Replace escapeXMLChar with escapeXML
    Change all Entity functions to return String, not Int or Char
   2013-08-21 17:44:27 by Ryosuke Moro | Files touched by this commit (5)
Log message:
Update to 0.12.8

CHANGES:
0.12.8
    #560, space leak (fixed)
0.12.7
    Allow deepseq-1.3
    Allow QuickCheck-2.5
    Support bytestring-0.10 (NFData instances added)
0.12.6
    #515, don't crash on malformed characters (use ? instead)
0.12.5
    Add optRawTag to Render, to ensure script's don't get escaped
0.12.4
    #487, fix the behaviour of ~== for TagComment and others
0.12.3
    GHC 7.2 compatibility
0.12.2
    Add StringLike instances for Text
0.12.1
    Add parseOptionsEntities and improve documentation
0.12
    Upgrade to QuickCheck 2.4.*
    Export toTagRep
    Make the -download flag off by default
    Eliminate HTTP dependency
    Eliminate mtl dependency
0.11.1
    Support --flags=-download to eliminate the network dependency
0.11
    #326, <a "foo"> is no longer treated as an attribute
    Add Eq/Ord instances to Tree
    Don't mark Text.HTML.TagSoup.Tree as preliminary
    #325, \r should be treated as whitespace
0.10.1
    #322, don't change ' to &apos; in render (do what the docs say)
0.10
    Improve the cabal file, make the test program off by default
    Expose Text.HTML.TagSoup.Match again (was hidden accidentally)
0.9
    #294, let optEntityData see if there was a ';' (CHANGES API)
    Numeric/hex entities in attributes were misparsed
    #149, add escapeHTML function
   2010-02-09 05:12:57 by PHO / phonohawk | Files touched by this commit (5) | Package updated
Log message:
Upstream update to tagsoup-0.8

   2009-10-11 12:44:40 by Thomas Klausner | Files touched by this commit (286)
Log message:
Remove obsolete @dirrm lines.