2004-01-05 23:16:25 by Johnny C. Lam | Files touched by this commit (27) |
Log message:
bl3ify
|
2003-11-27 17:34:00 by grant beattie | Files touched by this commit (2) |
Log message:
Updated p5-HTML-Parser to 3.34.
changes since 3.31:
Release 3.34
Fix segfault that happened when the parse callback caused
the stack to get reallocated. The original bug report was
<http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=217616>
Release 3.33
Perl 5.005 or better is now required. For some reason we get
a test failure with perl-5.004 and I don't really feel like
debugging that perl any more. Details about this failure can
be found at <http://rt.cpan.org/Ticket/Display.html?id=4065>.
New HTML::TokeParser method called 'get_phrase'. It returns
all current text while ignoring any phrase-level markup.
The HTML::TokeParser method 'get_text' now expands skipped
non-phrase-level tags as a single space.
Release 3.32
If the document parsed ended with some kind of unterminated markup,
then the parser state was not reset properly and this piece of markup
would show up in the beginning of the next document parsed.
<http://rt.cpan.org/Ticket/Display.html?id=3954>
The get_text and get_trimmed_text methods of HTML::TokeParser can
now take multiple end tags as argument. Patch by <siegmann@tinbergen.nl>
at <http://rt.cpan.org/Ticket/Display.html?id=3166>.
Various documentation tweaks.
Included another example program: hdump
|
2003-08-25 02:00:10 by Klaus Heinz | Files touched by this commit (3) |
Log message:
Update to 3.31.
Better compatibility with Mozilla/MSIE behaviour.
==== Changes since 3.27 ====
2003-08-19 Gisle Aas <gisle@ActiveState.com>
Release 3.31
The -DDEBUGGING fix in 3.30 was not really there :-(
2003-08-17 Gisle Aas <gisle@ActiveState.com>
Release 3.30
The previous release failed to compile on a -DDEBUGGING perl
like the one provided by Redhat 9.
Got rid of references to perl-5.7.
Further fixes to avoid warnings from Visual C.
Patch by Steve Hay <steve.hay@uk.radan.com>.
2003-08-14 Gisle Aas <gisle@ActiveState.com>
Release 3.29
Setting xml_mode now implies strict_names also for end tags.
Avoid warning from Visual C. Patch by <gsar@activestate.com>.
64-bit fix from Doug Larrick <doug@ties.org>
http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=195500
Try to parse similar to Mozilla/MSIE in certain edge cases.
All these are outside of the official definition of HTML but
HTML spam often tries to take advantage of these.
- New configuration attribute 'strict_end'. Unless enabled
we will allow end tags to contain extra words or stuff
that look like attributes before the '>'. This means that
tags like these:
</foo foo="<ignored>">
</foo ignored>
</foo ">" ignored>
are now all parsed as a 'foo' end tag instead of text.
Even if the extra stuff looks like attributes they will not
be reported if requested via the 'attr' or 'tokens' argspecs
for the 'end' handler.
- Parse '</:comment>' and '</ comment>' as comments unless
strict_comment is enabled. Previous versions of the parser
would report these as text. If these comments contain
quoted words prefixed by space or '=' these words can
contain '>' without terminating the comment.
- Parse '<! "<>" foo>' as comment containing ' \
"<>" foo'.
Previous versions of the parser would terminate the comment
at the first '>' and report the rest as text.
- Legacy comment mode: Parse with comments terminated with a
lone '>' if no '-->' is found before eof.
- Incomplete tag at eof is reported as a 'comment' instead
of 'text' unless strict_comment is enabled.
2003-04-16 Gisle Aas <gisle@ActiveState.com>
Release 3.28
When 'strict_comment' is off (which it is by default)
treat anything that matches <!...> a comment.
Should now be more efficient on threaded perls.
|
2003-07-22 06:14:48 by Martti Kuparinen | Files touched by this commit (92) |
Log message:
COMMENT should start with a capital letter.
|
2003-07-18 00:57:07 by grant beattie | Files touched by this commit (1065) |
Log message:
s/netbsd.org/NetBSD.org/
|
2003-04-12 17:40:38 by Martti Kuparinen | Files touched by this commit (2) |
Log message:
Updated p5-HTML-Parser to 3.27
* Typo fixes to the documentation
* Added HTML::Entities::escape_entities_numeric
|
2002-10-20 17:06:34 by Stoned Elipot | Files touched by this commit (1) |
Log message:
Use perl5/module.mk.
|
2002-09-21 09:48:20 by Johnny C. Lam | Files touched by this commit (1) |
Log message:
buildlink1 -> buildlink2
|
2002-07-01 14:12:02 by Thomas Klausner | Files touched by this commit (2) |
Log message:
Update to 3.26:
Avoid core dump in some cases where the callback croaks.
The perl_call_method and perl_call_sv needs G_EVAL flag
to be safe.
New parser attributes; 'attr_encoded' and 'case_sensitive'.
Contributed by Guy Albertelli II <guy@albertelli.com>.
HTML::Entities
- don't encode \r by default as suggested by Sean M. Burke.
HTML::HeadParser
- ignore empty http-equiv
- allow multiple <link> elements. Patch by
Timur I. Bakeyev <timur@gnu.org>
Avoid warnings from bleadperl on the uentities test.
|