Subject: CVS commit: pkgsrc/net/wget
From: Thomas Klausner
Date: 2002-12-16 12:39:02
Message id: 20021216113902.265B4B42C@cvs.netbsd.org

Log Message:
Update to 1.8.2, with the recently discovered security problem
already fixed (patches from RedHat RPM).

Note: this package does not support IPv6 any longer, since no
updated IPv6 patch for wget-1.8.2 exists (to my knowledge).
The next major wget release is expected to contain IPv6 support.

Uservisible changes since 1.7:
** A new progress indicator is now available and used by default.
You can choose the progress bar type with `--progress=TYPE'.  Two
types are available, "bar" (the new default), and "dot" (the old
dotted indicator).  You can permanently revert to the old progress
indicator by putting `progress = dot' in your `.wgetrc'.

** You can limit the download rate of the retrieval using the
`--limit-rate' option.  For example, `wget --limit-rate=15k URL' will
tell Wget not to download the body of the URL faster than 15 kilobytes
per second.

** Recursive retrieval and link conversion have been revamped:

*** Wget now traverses links breadth-first.  This makes the
calculation of depth much more reliable than before.  Also, recursive
downloads are faster and consume *significantly* less memory than
before.

*** Links are converted only when the entire retrieval is complete.
This is the only safe thing to do, as only then is it known what URLs
have been downloaded.

*** BASE tags are handled correctly when converting links.  Since Wget
already resolves <base href="..."> when resolving handling URLs, link
conversion now makes the BASE tags point to an empty string.

*** HTML anchors are now handled correctly.  Links to an anchor in the
same document (<a href="#anchorname">), which used to confuse Wget,
are now converted correctly.

*** When in page-requisites (-p) mode, no-parent (-np) is ignored when
retrieving for inline images, stylesheets, and other documents needed
to display the page.

*** Page-requisites (-p) mode now works with frames.  In other words,
`wget -p URL-THAT-USES-FRAMES' will now download the frame HTML files,
and all the files that they need to be displayed properly.

** `--base' now works conjunction with `--input-file', providing a
base for each URL and thereby allowing the URLs in the file to be
relative.

** If a host has more than one IP address, Wget uses the other
addresses when accessing the first one fails.

** Host directories now contain port information if the URL is at a
non-standard port.

** Wget now supports the robots.txt directives specified in
<http://www.robotstxt.org/wc/norobots-rfc.txt>.

** URL parser has been fixed, especially the infamous overzealous
quoting.  Wget no longer dequotes reserved characters, e.g. `%3F' is
no longer translated to `?', nor `%2B' to `+'.  Unsafe characters
which are not reserved are still escaped, of course.

** No more than 20 successive redirections are allowed.

Files:
RevisionActionfile
1.54modifypkgsrc/net/wget/Makefile
1.6modifypkgsrc/net/wget/PLIST
1.15modifypkgsrc/net/wget/distinfo
1.6modifypkgsrc/net/wget/patches/patch-ad
1.6modifypkgsrc/net/wget/patches/patch-ae
1.4modifypkgsrc/net/wget/patches/patch-af
1.4modifypkgsrc/net/wget/patches/patch-ag
1.5addpkgsrc/net/wget/patches/patch-ac