Impedir acesso de bots com o .htaccess

Se os bots estão sempre a aparecer nos vossos logs de visitas do site e não vos interessa pois querem reduzir o tráfego inútil e evitar o spam, basta criarem o ficheiro .htaccess com o seguinte conteúdo:

SetEnvIfNoCase User-Agent "^abot" unwanted_bot
SetEnvIfNoCase User-Agent "^aipbot" unwanted_bot
SetEnvIfNoCase User-Agent "^EI" unwanted_bot
SetEnvIfNoCase User-Agent "^libwww-perl" unwanted_bot
SetEnvIfNoCase User-Agent "^LWP" unwanted_bot
SetEnvIfNoCase User-Agent "^nameprotect" unwanted_bot
SetEnvIfNoCase User-Agent "^PlantyNet_WebRobot" unwanted_bot

(...)

order allow,deny
allow from all
deny from env=unwanted_bot

Onde estão os parêntesis basta colocarem mais bots que vos interessa bloquear. Para terem uma ideias do que não interessam podem visitar esta lista.


 
 
 

2 Responses to “Impedir acesso de bots com o .htaccess”

  1. Johann
    3. April 2008 at 08:52

    Hello,

    not sure I understand your post, but MSIECrawler is a legit browser substring from what I know. It’s used by IE when making favorites available for offline use.

  2. rogeriopvl
    3. April 2008 at 09:40

    Hello Johann,

    I don’t know if you understand portuguese, but this post is about blocking bots on a website, by adding the lines that I mentioned in .htaccess. Well, I was not sure about MSIECrawler, I wasn’t sure if it was a legit browser string so I added it as an example on my post.

    Thanks for your feedback.

Leave a Reply