Forum Moderated by: open

Crawler, Spider, and User Agent ID


Forum to identify search engine spiders and user agents

 
Thread SubjectMessagesStarted byLast Message
Hits from unidentified blog trackers
where are these spiders from?
5 abates 4:04 pm Sep 27, 2006
ShopWiki
Very active little bot .. no referrals
5 Bewenched 6:45 pm Sep 23, 2006
Definitive way to validate GoogleBot authenticity
official word from Google
8 incrediBILL 4:53 pm Sep 23, 2006
Bot banning consultants and/or bad bot banning services
Do they exist? Help for those of us who haven't a clue
16 Webwork 3:00 am Sep 23, 2006
panscient data services
block them NOW or pay later - 200 megs a day!
7 amznVibe 5:11 am Sep 22, 2006
seo bot from AOL?
who is this new bot showing up
3 bhartzer 10:57 pm Sep 21, 2006
Bot or no Bot
no user agent odd requests
2 Bewenched 4:25 pm Sep 20, 2006
Google Mobile Indexing Agent?
no request for robots.txt
3 keyplyr 10:49 pm Sep 18, 2006
Adwords - adbot
HELP - can't find it!
2 howiejs 3:14 am Sep 15, 2006
IEAutoDiscovery
Pre-fetcher, feed reader, e-mail harvester, or what?
2 GaryK 8:03 pm Sep 13, 2006
Who is TridentSpider/3.1?
spider from Slovenia?
3 Bewenched 4:25 pm Sep 12, 2006
Why would ebay visit my site?
7 smokeybarnable 6:28 am Sep 11, 2006
Microsoft with no UA
Microsoft hitting site with no UA
2 ssgumby 9:15 am Sep 7, 2006
ActiveTouristBot
16 Mokita 12:08 am Sep 7, 2006
Fatlens
3 smokeybarnable 5:37 pm Sep 6, 2006
MSN IP list
3 Brett_Tabke 11:07 pm Sep 4, 2006
Googlebot visit pages that not exist
googlebot is indexing pages that we dont have links
3 ivanff 11:23 am Sep 2, 2006
What is Net::Trackback/1.01? -Good/Bad?
5 fusion5 1:57 am Sep 1, 2006
Filtering Internet noise: That is counting only human visitors
I want to improve my own web logging tools
10 vite_rts 12:58 pm Aug 28, 2006
Is Inktomisearch spider ADHD?
Yahoo! Spider requests only 1 or 2 pages per visit.
2 geoffyp 4:12 am Aug 28, 2006
problem with JSESSIONIDs and robot crawlers
JSESSIONIDs combined with the use of Struts with Hibernate
5 KBee 6:31 pm Aug 25, 2006
Getty Images spider?
5 Stu_Rogers 7:02 am Aug 22, 2006
"libwww-perl/5.65" from an IBM address block
scanning for blog feeds?
2 zCat 9:42 am Aug 21, 2006
blogsearchbot-pumpkin
No Robots.txt
2 Ocean10000 4:42 am Aug 21, 2006
New Russian Spider
webalta.net
9 GaryK 10:10 pm Aug 20, 2006