Path to this page:
./
www/p5-WWW-RobotRules,
Perl 5 module database of robots.txt-derived permissions
Branch: pkgsrc-2011Q3,
Version: 6.01nb4,
Package name: p5-WWW-RobotRules-6.01nb4,
Maintainer: pkgsrc-usersThe Perl 5 module WWW::RobotRules parses /robots.txt files as specified
in "A Standard for Robot Exclusion", at
http://www.robotstxt.org/wc/norobots.htmls
Webmasters can use the /robots.txt file to forbid conforming robots
from accessing parts of their web site.
The parsed files are kept in a WWW::RobotRules object, and this object
provides methods to check if access to a given URL is prohibited.
The same WWW::RobotRules object can be used for one or more parsed
/robots.txt files on any number of hosts.
Required to run:[
www/p5-URI]
Master sites: (Expand)
SHA1: 426920bbfc73a38dffa319dd2f53b0eb9b294b5b
RMD160: 6f2c1bef375ad2b2f171b4feae721eec8e1007ec
Filesize: 8.835 KB
Version history: (Expand)
- (2011-10-04) Package added to pkgsrc.se, version p5-WWW-RobotRules-6.01nb4 (created)