Forum Moderated by: open

Crawler, Spider, and User Agent ID


Forum to identify search engine spiders and user agents

 
Thread SubjectMessagesStarted byLast Message
MultiCrawler
Doesn't obey robots.txt
13 Mokita 3:02 pm Feb 6, 2008
Are there "bot networks"?
Controlled by the same person or company
10 Reno 5:49 pm Feb 5, 2008
Getting traffic from llnw.net
2 JohnKelly 3:58 am Feb 4, 2008
What could this be?
veoh-\xe2\xb8\xb3\xe2\xb8\xb40 service
4 madmatt69 12:25 pm Feb 2, 2008
Gator still a threat?
7 keyplyr 8:23 pm Jan 30, 2008
voyager-hc/1.0
5 JAB_Creations 6:30 am Jan 30, 2008
Another newbie question
11 nickgl 7:50 pm Jan 28, 2008
Newbie Questions
not understanding entries
8 Ken_Smith 7:42 pm Jan 27, 2008
Amazon being sneaky?
2 keyplyr 1:02 am Jan 25, 2008
facebookexternalhit
Face book image spider
6 urbanadventureorg 9:53 am Jan 22, 2008
Scraper currently operating from 84.130.73.***
2 Mokita 9:56 pm Jan 17, 2008
Grub powers Wikia Search
4 incrediBILL 6:17 am Jan 12, 2008
Blackspider
is a Websense company
2 Mokita 11:40 am Jan 10, 2008
Gigamega bot is now known as LiteFinder bot
The same mail harvesters did it again
2 malaiac 9:57 am Jan 7, 2008
Question about banned IP
2 nickgl 11:25 pm Jan 6, 2008
Domain Tools Exposed
11 incrediBILL 7:42 pm Jan 4, 2008
GigaMega.bot CIDR's found
confirmed by looking at rwhois information
7 Jimx 11:49 pm Dec 28, 2007
Yahoo now lurking around as Firefox
5 balam 4:21 am Dec 24, 2007
Fangcrawl
4 wilderness 6:59 am Dec 21, 2007
Spambot or linkchecker
spambot, linkchecker, new bot, jazztel .es
4 marodhum 6:51 pm Dec 16, 2007
Another badbot
2 Achernar 12:14 pm Dec 16, 2007
Hot Jobs spoofing as.
6 keyplyr 4:31 pm Dec 11, 2007
Identifying spider traffic
Moving all spider traffic to a particular server
4 arieng 6:06 pm Dec 10, 2007
how to gently spider
2 alexmg 12:17 am Dec 8, 2007
ActiveTourist gone mad
2 Hobbs 1:18 pm Dec 6, 2007