Forum Moderators: open
(Crawl-delay) Did you try that method
User-agent: *
Crawl-delay: 90 Bing sometimes uses a corrupt UA (trailing underscore). Could that have been the source?
207.46.13.98 - - [22/Jun/2011:04:50:45 +0100] "GET /faq.php HTTP/1.1" 200 13386 "-" "Mozilla/5.0 (compatible; bingbot/2.0; +http://www.bing.com/bingbot.htm)" In:- Out:-:-pct. "-"
(13 x 200 OK accesses)
207.46.13.98 - - [22/Jun/2011:04:50:52 +0100] "GET /profile.php?mode=viewprofile&u=3 HTTP/1.1" 403 132 "-" "Mozilla/5.0 (compatible; bingbot/2.0; +http://www.bing.com/bingbot.htm)" In:137 Out:114:83pct. "-"
I wonder if the problem lies in the high delay factor
I assume the crawl rates you give are for pages not pages + images
if there was no robots.txt on your site
two or more sites on the same server. Would this be possible on the forum site?
I assume no crawl rate was specified for the site in the MSN/Bing Control Panel
Methinks no / means nothing's disallowed a.k.a. everything's allowed
Is there a definitive list of IPs that identify as bingbot/MSN/whatever?
RewriteCond %{REMOTE_HOST} !\.(bing|live|msn)\.com$
RewriteCond %{REMOTE_HOST} !\.phx\.gbl$
RewriteCond %{REMOTE_ADDR} !^65\.54\.
RewriteCond %{REMOTE_ADDR} !^65\.55\.
RewriteCond %{REMOTE_ADDR} !^157\.55\.
RewriteCond %{REMOTE_ADDR} !^207\.46\.
I would seriously get in touch with MS about your problem
So how about the simplest and quickest route -- a firewall rule?
"if they behave themselves they are ignored; if they are abusive they are stopped/blocked/reported. Whoever they are."
I doubt you'd find many in this forum that don't agree with that:)