2005-11-23 23:17:07 by Thomas Klausner | Files touched by this commit (2) |
Log message:
Update to 3.46:
2005-10-24 Gisle Aas
Release 3.46
Don't try to treat an literal as space.
This breaks Unicode parsing.
https://rt.cpan.org/Ticket/Display.html?id=15068
The unbroken_text option is now by default on
for HTML::TokeParser.
HTML::Entities::encode will no encode "'" by default.
Improved report/ignore_tags documentation by
Norbert Kiesel
Test suite now use Test::More, by
Norbert Kiesel
Fix HTML::Entities typo spotted by
Stefan Funke
Faster load time with XSLoader (perl-5.6 or better now required).
Fixed POD markup errors in some of the modules.
|
2005-08-06 08:19:41 by Johnny C. Lam | Files touched by this commit (634) | |
Log message:
Bump the PKGREVISIONs of all (638) packages that hardcode the locations
of Perl files to deal with the perl-5.8.7 update that moved all
pkgsrc-installed Perl files into the "vendor" directories.
|
2005-07-13 20:01:49 by Johnny C. Lam | Files touched by this commit (578) |
Log message:
Turn PERL5_PACKLIST into a relative path instead of an absolute path.
These paths are now relative to PERL5_PACKLIST_DIR, which currently
defaults to ${PERL5_SITEARCH}. There is no change to the binary
packages.
|
2005-04-11 23:48:17 by Todd Vierling | Files touched by this commit (3539) |
Log message:
Remove USE_BUILDLINK3 and NO_BUILDLINK; these are no longer used.
|
2005-02-24 15:08:42 by Thomas Klausner | Files touched by this commit (277) |
Log message:
Add RMD160 checksums.
|
2005-02-07 16:56:19 by Thomas Klausner | Files touched by this commit (2) |
Log message:
Update to 3.45:
2005-01-06 Gisle Aas
Release 3.45
Fix stack memory leak caused by missing PUTBACK. Only
code that used $p->parse(\&cb) form was affected.
Fix provided by Gurusamy Sarathy
2004-12-28 Gisle Aas
Release 3.44
Fix confusion about nested quotes in <script> and <style> text.
2004-12-06 Gisle Aas
Release 3.43
The SvUTF8 flag was not propagated correctly when replacing
unterminated entities.
Fixed test failure because of missing binmode on Windows.
|
2004-12-20 12:31:14 by grant beattie | Files touched by this commit (467) |
Log message:
since perl is now built with threads on most platforms, the perl archlib
module directory has changed (eg. "darwin-2level" vs.
"darwin-thread-multi-2level").
binary packages of perl modules need to be distinguishable between
being built against threaded perl and unthreaded perl, so bump the
PKGREVISION of all perl module packages and introduce
BUILDLINK_RECOMMENDED for perl as perl>=5.8.5nb5 so the correct
dependencies are registered and the binary packages are distinct.
addresses PR pkg/28619 from H. Todd Fujinaka.
|
2004-12-05 19:38:58 by Havard Eidnes | Files touched by this commit (3) |
Log message:
Update p5-HTML-Parser from version 3.35 to 3.42.
Change HOMEPAGE to author-independent link on search.cpan.org.
Change log:
2004-12-04 Gisle Aas <gisle@ActiveState.com>
Release 3.42
Avoid sv_catpvn_utf8_upgrade() as that macro was not
available in perl-5.8.0.
Patch by Reed Russell <Russell.Reed@acxiom.com>.
Add casts to suppress compilation warnings for char/U8
mismatches.
HTML::HeadParser will always push new header values.
This make sure we never loose old header values.
2004-11-30 Gisle Aas <gisle@ActiveState.com>
Release 3.41
Fix unresolved symbol error with perl-5.005.
2004-11-29 Gisle Aas <gisle@ActiveState.com>
Release 3.40
Make utf8_mode only available on perl-5.8 or better. It produced
garbage with older versions of perl.
Emit warning if entities are decoded and something in the first
chunk looks like hibit UTF-8. Previously this warning was only
triggered for documents with BOM.
2004-11-23 Gisle Aas <gisle@ActiveState.com>
Release 3.39_92
More documentation of the Unicode issues. Moved around HTML::Parser
documentation a bit.
New boolean option; $p->utf8_mode to allow parsing of raw UTF-8.
Documented that HTML::Entities::decode_entities() can take multiple
arguments.
Unterminated entities are now decoded in text (compatibility
with MSIE misfeature).
Document HTML::Entities::_decode_entities(); this variation of the
decode_entities() function has been available for a long time, but
have not been documented until now.
HTML::Entities::_decode_entities() can now be told to try to
expand unterminated entities.
Simplified Makefile.PL
2004-11-23 Gisle Aas <gisle@ActiveState.com>
Release 3.39_91
The HTML::HeadParser will skip Unicode BOM. Previously it
would consider the <head> section done when it saw the BOM.
The parser will look for Unicode BOM and give appropriate
warnings if the form found indicate trouble.
If no matching end tag is found for <script>, <style>, <xmp>
<title>, <textarea> then generate one where the next tag
starts.
For <script> and <style> recognize quoted strings and don't
consider end element if the corresponding end tag is found
inside such a string.
2004-11-17 Gisle Aas <gisle@ActiveState.com>
Release 3.39_90
The <title> element is now parsed in literal mode, which
means that other tags are not recognized until </title> has
been seen.
Unicode support for perl-5.8 and better.
Decoding Unicode entities always enabled; no longer a compile
time option.
Propagation of UTF8 state on strings.
Patch contributed by John Gardiner Myers <jgmyers@proofpoint.com>.
Calculate offsets and lengths in chars for Unicode strings.
Fixed link typo in the HTML::TokeParser documentation.
2004-11-11 Gisle Aas <gisle@ActiveState.com>
Release 3.38
New boolean option; $p->closing_plaintext
Contributed by Alex Kapranoff <alex@kapranoff.ru>
2004-11-10 Gisle Aas <gisle@ActiveState.com>
Release 3.37
Improved handling of HTML encoded surrogate pairs and illegally
endoded Unicode; <http://rt.cpan.org/Ticket/Display.html?id=7785>.
Patch by John Gardiner Myers <jgmyers@proofpoint.com>.
Avoid generating bad UTF8 strings when decoding entities
representing chars beyond #255 in 8-bit strings. Such bad
UTF8 sometimes made perl-5.8.5 and older segfault.
Undocument v2 style subclassing in synopsis section.
Internal cleanup:
Make 'gcc -Wall' happier.
Avoid modification of PVs during parsing of attrspec.
Another patch by John Gardiner Myers.
2004-04-01 Gisle Aas <gisle@ActiveState.com>
Release 3.36
Improved MSIE/Mozilla compatibility. If the same attribute
name repeats for a start tag, use the first value instead
of the last. Patch by Nick Duffek <html-parser@duffek.com>.
<https://rt.cpan.org/Ticket/Display.html?id=5472>
|
2004-04-25 00:46:14 by Johnny C. Lam | Files touched by this commit (109) |
Log message:
Relinquish maintainership of packages to tech-pkg@NetBSD.org.
|
2004-01-20 14:41:19 by David Brownlee | Files touched by this commit (2) |
Log message:
Updated p5-HTML-Parser to 3.35
- Documentation fixes by Paul Croome <Paul.Croome@softwareag.com>.
- Removed redundant dSP.
|