Forum Moderated by: open

Crawler, Spider, and User Agent ID


Forum to identify search engine spiders and user agents

 
Thread SubjectMessagesStarted byLast Message
WordPress/MU Bot
3 dstiles 9:42 pm May 30, 2010
Facbook spidering my site?
been going on for about 4 hours now
3 Bewenched 6:00 pm May 29, 2010
Gazopa Feast
Far too many hits
5 dstiles 9:44 pm May 28, 2010
Bing/1.1 CFNetwork/459
3 wilderness 6:38 pm May 24, 2010
Deluged with unknown bots
19 montclairguy 5:56 am May 23, 2010
Unknown bot scrape or probe?
6 enigma1 8:05 am May 18, 2010
Googlebot now hiding its UA
4 enigma1 3:18 pm May 17, 2010
New archive.org UA
includes heritrix :)
13 caribguy 2:52 am May 17, 2010
.b2b3a
7 wilderness 8:59 pm May 14, 2010
Missing HTTP HOST
No value at all in HTTP_HOST
9 dstiles 8:30 pm May 14, 2010
80legs
[3] ( 1 2 3 )
61 GaryK 7:18 am May 12, 2010
Proxy IT
2 keyplyr 5:41 pm May 11, 2010
Twitter Chasing Bots
How many bots are chasing your tweets?
10 incrediBILL 5:52 am May 9, 2010
SERPAnalytics
yet another SEO scraper
4 incrediBILL 10:25 pm May 8, 2010
SmartViper
another SEO scraper
5 incrediBILL 12:43 am May 8, 2010
TweetmemeBot
2 incrediBILL 10:28 pm May 7, 2010
80legs on the crawl
5 incrediBILL 6:15 am May 6, 2010
Strange Requests from Googlebot
9 aristotle 8:23 pm May 5, 2010
downforeveryoneorjustme Revisited
(incl. AppEngine-Google)
4 Pfui 12:24 am May 5, 2010
Blocking Cuil bots
8 Asia_Expat 4:40 am May 4, 2010
Apple iPad UA ID
3 Pfui 5:09 am Apr 28, 2010
MSN's many cloaked bots.
Mass undocumented activity in search.msn.com ranges[2] ( 1 2 )
42 Pfui 8:28 am Apr 27, 2010
Protoype
Cloaked somethingorother from .us.ibm.com
5 Pfui 3:09 pm Apr 26, 2010
Interesting Google-bot Image Encounter
3 caribguy 3:08 pm Apr 25, 2010
Blank User agent string
Blank User agent string query
4 baiwan 6:50 pm Apr 24, 2010