Forum Moderated by: open

Crawler, Spider, and User Agent ID


Forum to identify search engine spiders and user agents

 
Thread SubjectMessagesStarted byLast Message
Snoop
2 wilderness 11:45 am Jul 26, 2008
ODP/dmoz.org link checker
pmoz bot does not read robots.txt
17 Umbra 8:50 pm Jul 25, 2008
No IP
IP of bot just a dot
8 farflappin 3:21 pm Jul 24, 2008
betaBot
2 keyplyr 1:00 am Jul 23, 2008
extrabot dot com
4 keyplyr 7:15 pm Jul 21, 2008
Visbot returns
With a proper UA and webmaster info page
2 jdMorgan 7:40 pm Jul 19, 2008
Web Crawlers basics
beginner need help!
2 PhoShzzle 4:13 pm Jul 17, 2008
Oozbot/0.17
8 incrediBILL 11:26 pm Jul 15, 2008
Munax bots cloaking themselves and causing high server load
16 Asia_Expat 7:49 pm Jul 15, 2008
Webbot/0.1
Russian bot
3 keyplyr 11:29 pm Jul 14, 2008
MSN bot finds php robots.txt
4 phred 8:26 am Jul 14, 2008
CatchBot
4 wilderness 11:59 pm Jul 13, 2008
WebSense Hiding Behind Rotating User Agents
Internet Security Company Tricks Site Security
20 Not_academic 4:11 pm Jul 11, 2008
Google
66.249.84.
2 wilderness 5:33 am Jul 11, 2008
Googlebot / image bot causing probs?
sometimes hitting 404 pg, sometimes skips
14 Megaclinium 12:19 am Jul 11, 2008
Wanadoo some scraping?
3 incrediBILL 12:18 am Jul 11, 2008
is bot from 207.200.116 netscape?
just 'AOL 9.0' listed in UA
4 Megaclinium 10:31 pm Jul 10, 2008
Google Web Accelerator
... it's back!
6 Umbra 5:04 pm Jul 8, 2008
Spider testing UAs?
92.48.126.nnn
3 Mokita 7:06 am Jul 8, 2008
Mr.Carlito
4 idiotgirl 1:36 pm Jul 7, 2008
Register Scolds AVG For Generating Fake Traffic As Link Malware
Webmasters Complain AVG Debilitating Traffic Analytics[8] ( 1 2 3 4 5 6 7 8 )
219 Samizdata 7:49 am Jul 7, 2008
CyberPatrol SiteCat Webbot
It crawls the internet looking for harmful sites
5 thetrasher 4:31 am Jul 7, 2008
CFNetwork/330
3 Receptional_Andy 11:23 pm Jul 6, 2008
get 'head' is bots?
noticed new gets with this notation
5 Megaclinium 5:59 am Jul 6, 2008
Edgy from Columbia?
getting hit by new spider
10 Megaclinium 3:49 am Jul 5, 2008