##### This file will tell search engines what they should index. ##### ############# Settings for all crawlers/spiders. User-agent:* #Disallow: /index.html #Disallow: /ArquivoDigital/bin #Disallow: /ArquivoDigital/Web.config #Disallow: /aspnet_client #Disallow: /WSArquivoDigital #Disallow: /ipac20/help.html #Disallow: /ipac20/validate.jsp #Don't index help files #Disallow: /hipres/ #Don't index ANYTHING! #Disallow: / #Disallow: /ipac20/ ############# Dissallow all crawling from a specific user-agent ##Uncomment the specific user-agent/disallow but NOT the sites name, ##example: (do not uncomment "#Google's crawler") but you can uncomment ##the next two lines to tell google not to index your site. #Google's crawler User-agent: Googlebot #Disallow: / #Altavista's crawler #User-agent: Scooter #Disallow: / #Lycos' crawler #User-agent: Lycos_Spider_(T-Rex) #Disallow: / #AlltheWeb's crawler #User-agent: FAST-WebCrawler/ #Disallow: / #INKTOMI's crawler #User-agent: Slurp #Disallow: / #Yahoo's crawler #User-agent: Yahoo Slurp #Disallow: / #MSN's crawler #User-agent: Msnbot #Disallow: / ############# A list of some of the more useless useragents that use up bandwidth User-agent: Baiduspider Disallow: / Disallow: /ipac20/ User-agent: Black Hole Disallow: / Disallow: /ipac20/ User-agent: Titan Disallow: / Disallow: /ipac20/ User-agent: WebStripper Disallow: / Disallow: /ipac20/ User-agent: NetMechanic Disallow: / Disallow: /ipac20/ User-agent: CherryPicker Disallow: / Disallow: /ipac20/ User-agent: EmailCollector Disallow: / Disallow: /ipac20/ User-agent: EmailSiphon Disallow: / Disallow: /ipac20/ User-agent: WebBandit Disallow: / Disallow: /ipac20/ User-agent: EmailWolf Disallow: / Disallow: /ipac20/ User-agent: ExtractorPro Disallow: / Disallow: /ipac20/ User-agent: CopyRightCheck Disallow: / Disallow: /ipac20/ User-agent: Crescent Disallow: / Disallow: /ipac20/ User-agent: NICErsPRO Disallow: / Disallow: /ipac20/