Path to this page:
./
www/p5-WWW-RobotRules,
Perl 5 module database of robots.txt-derived permissions
Branch: pkgsrc-2022Q4,
Version: 6.02nb11,
Package name: p5-WWW-RobotRules-6.02nb11,
Maintainer: pkgsrc-usersThe Perl 5 module WWW::RobotRules parses /robots.txt files as specified
in "A Standard for Robot Exclusion", at
http://www.robotstxt.org/wc/norobots.htmls
Webmasters can use the /robots.txt file to forbid conforming robots
from accessing parts of their web site.
The parsed files are kept in a WWW::RobotRules object, and this object
provides methods to check if access to a given URL is prohibited.
The same WWW::RobotRules object can be used for one or more parsed
/robots.txt files on any number of hosts.
Master sites: (Expand)
Filesize: 8.847 KB
Version history: (Expand)
- (2022-12-27) Package added to pkgsrc.se, version p5-WWW-RobotRules-6.02nb11 (created)