Forum Moderators: phranque

Message Too Old, No Replies

Should I be worried about these links in GWT?

"Via this intermediate link"

         

rajesh8759

11:40 am on May 30, 2016 (gmt 0)

10+ Year Member



I have a pdf file on my website (mysite.com). Today while seeing backlinks in GWT, I see that there are sites which are showing as backlinks to my pdf file. The pattern is ->

Site1.com/page1.html Via this intermediate link SiteB.com/download/file1.pdf
Site2.com/page1.html Via this intermediate link SiteB.com/download/file1.pdf
Site3.com/page1.html Via this intermediate link SiteB.com/download/file1.pdf

file1.pdf is on mysite.com but with a different name.

SiteB is in my niche, it's a forum/blog site. But Site1, Site1, Site3... these are irrelevant sites.

Should I be worried about this or I can just ignore these links in GWT?

robzilla

12:11 pm on May 30, 2016 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



It means Site1.com/page1.html links to SiteB.com/download/file1.pdf which redirects to file1.pdf on mysite.com. It's a little odd, coming from irrelevant sites, so it could be web spam, but even then I wouldn't worry about it. My backlink reports in GWT/SC are full of spammy sites, but I have no control over who links to me so I just ignore them.

rajesh8759

12:23 pm on May 30, 2016 (gmt 0)

10+ Year Member



No redirection to my site. That's the strange thing. I am just guessing Google identify pdf on SiteB as duplicate to my site. But there is nothing to verify this.

robzilla

12:48 pm on May 30, 2016 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



Oh, I see. It's possible Google recognizes the file on mysite.com as the original (canonical) version, and the others as duplicates, and then attributes the link value to the original document rather than (or on top of) the file on SiteB.com. Interesting! I can find other reports of this, but no word from Google. I do wonder if this is related to the PDF spam I'd been seeing a lot of earlier this year. And also if this could potentially be similar for other web content (zip, exe, doc, xls, images, perhaps even web text). Attribution is a bit of an issue online; it's certainly not always given, and finding the source for any given piece of (redistributed) content may be very important to Google.

rajesh8759

1:00 pm on May 30, 2016 (gmt 0)

10+ Year Member



Not aware about PDF spam but if Google does this on a pdf file and without rel="canonical" tag, then it has to be a smart thing from Google. I was worried because the links are from low quality sites (Site1, Site2,..) to the duplicate content site (SiteB) and showing as my backlinks. So, indirectly I am getting low quality backlinks. Sounds a bizarre logic though!