Forum Moderators: phranque
http://example.com/forum/download/file.php?id=123456&mode=view GET /forum/download/file.php?id\\u003d123456\\u0026mode\\u003dview
That will not return a 404Well, that's the point. If it did return a 404 it wouldn't be listed as a “soft 404”. (Editorial comment: Is g### really being that stupid? If an URL that doesn't require parameters is redirected to the parameterless version, that's a correct and appropriate response; the alternative is Duplicate Content. What the ### do they expect you to do?)
id\\u003d123456\\u0026mode\\u003dviewThat looks like encoding of non-ascii, or non-alphanumeric, characters. Quick lookup confirms that it translates to
//Same thing before for other sites.
<li>
<div class="ktw_img" itemprop="associatedMedia" itemscope itemtype="http://schema.org/ImageObject" itemid="https://example.com/forum/download/file.php?id\u003d123456\u0026mode\u003dview">
Some relevant text
<a href="https://exmple.com/forum/download/file.php?id\u003d123456\u0026mode\u003dview" rel="nofollow" class="fancybox">
<img src="https://example.com/forum/download/file.php?id\u003d42565\u0026mode\u003dview" alt="file.php?id=123456&mode=view text relevant to image at offendingsite.co" title="Text relative to the image" height="798" itemprop="contentURL" onError="this.onerror=null;this.src='https://encrypted-tbn0.gstatic.com/images?q=random_string_for_thumbnail_on_google';" />
</a>
Relevant text
</div>
</li>
//Same thing after for other sites.
which you are presumably either blocking or serving some other content
example.com/forum/download/file.php?id\\u003d123456\\u0026mode\\u003dview http://example.com/forum/download/file.php?id=123456&mode=view GET /forum/download/file.php?id\\u003d123456\\u0026mode\\u003dview If you don't want them asking for the files you can add a few lines to robots.txt:
Disallow: /forum/download/file.php?*\\u003d
Disallow: /forum/download/file.php?*\\u0026