# Robots file for farrand.net # $Id: robots.txt 151 2004-05-23 23:07:32Z jim $ # Don't index: # * The archive of the old site # * The directory of junk designed to hurt bad automata # * My subversion repos # * My home directory, which has a copy of the site and other junk # * Mediawiki metadata User-agent: * Disallow: /archive-2003/ Disallow: /content/ Disallow: /svn/ Disallow: /~jim/ Disallow: /jimbosmediawiki/ # Allow atomz to index the old site. This is because their is an Atomz search # box on the old pages and we don't want to confuse users by making it only # search new pages. User-agent: Atomz Disallow: /content/ Disallow: /svn/ Disallow: /~jim/ # Grub can't seem to behave itself. Ban it entirely. User-agent: grub-client Disallow: / # Digital Brand Protection? No thanks. User-agent: NPBot Disallow: / # "Panscient crawls the web and collects information on people and companies for # vertical search applications. Our databases can be used to augment search # engines for corporate information, sale leads, business intelligence and # genealogy. We help organizations provide complete and comprehensive web # information to their customers." # # Doesn't sound like my bag User-agent: panscient.com Disallow: /