Forum Moderators: phranque
ie: www.example.com/images/some-picture instead of www.example.com/images/some-picture.jpg
The mistake was live for roughly 36 hours and the thumbnail appeared on the index page and a category page only. The image resolved, strangely, in both firefox and IE but search engines went mad (4x more attempts to load this file than any other page on the site in that time span) trying to find a page called "images/some-picture/", notice the ending backslash.
It's a standard wordpress site however it has a custom canonical fix in place that adds a trailing slash to category pages... which apparently added the backslash to the image file since it had no ending.
I'm a bit puzzled as to why:
1) the image resolved without a file type (.jpg)
2) search engines tried to load the page /some-picture/ over 3800 times in 36 hours, getting a 404 each time.
3) I can't think of a way to make this knowledge beneficial to the site somehow.
Any ideas ?
edit: analytics and WMT dodn't report the surge or any 404 errors, server logs and another tracking service do.
[edited by: JS_Harris at 2:13 am (utc) on May 16, 2009]
So, to try to answer your questions;
1. The browsers are probably displaying the image because they are guessing that you intended to put a .jpg on the uri. Browsers guess that when you do an <img src=""> you are not sending them to get a page, but an image.
2. Since the search engines don't do code-correction like browsers, they are seeing the trailing slash and must think that you are intending to send them to a directory to look for an default page.
3. It'd be interesting to see if the SEs tried requesting page names from your images directory in an attempt to fish out a default page. Also, it's a good reminder to do browser testing.
Very interesting that WMT and Analytics don't report the problem. Keep your eye on WMT and it may show up soon.
100+ requests per hour indicates a lot of inbound links to those pages.