Impedir acesso de bots com o .htaccess
Se os bots estão sempre a aparecer nos vossos logs de visitas do site e não vos interessa pois querem reduzir o tráfego inútil e evitar o spam, basta criarem o ficheiro .htaccess com o seguinte conteúdo:
SetEnvIfNoCase User-Agent "^abot" unwanted_bot SetEnvIfNoCase User-Agent "^aipbot" unwanted_bot SetEnvIfNoCase User-Agent "^EI" unwanted_bot SetEnvIfNoCase User-Agent "^libwww-perl" unwanted_bot SetEnvIfNoCase User-Agent "^LWP" unwanted_bot SetEnvIfNoCase User-Agent "^nameprotect" unwanted_bot SetEnvIfNoCase User-Agent "^PlantyNet_WebRobot" unwanted_bot (...) order allow,deny allow from all deny from env=unwanted_bot
Onde estão os parêntesis basta colocarem mais bots que vos interessa bloquear. Para terem uma ideias do que não interessam podem visitar esta lista.


3. April 2008 at 08:52
Hello,
not sure I understand your post, but MSIECrawler is a legit browser substring from what I know. It’s used by IE when making favorites available for offline use.
3. April 2008 at 09:40
Hello Johann,
I don’t know if you understand portuguese, but this post is about blocking bots on a website, by adding the lines that I mentioned in .htaccess. Well, I was not sure about MSIECrawler, I wasn’t sure if it was a legit browser string so I added it as an example on my post.
Thanks for your feedback.