From the
manual [support.google.com], Google gives us a way to set canonical tags on PDF documents via HTTP headers, useful if you use multiple versions to track:
you can use a rel="canonical" HTTP header to specify to Google the canonical URL for the PDF file, as follows:
Link: <http://www.example.com/downloads/white-paper.pdf>; rel="canonical"
Google currently supports these link header elements for Web Search only.
Anyone ever tried this pointing at an HTML document instead?
So let's say I have a PDF of my HTML content. And the PDF is a mirror image of the valid HTML content, just in an easily viewable form for offline consumption. Both the HTML and PDF file are indexed and rank.
What happens if I serve the canonical link header on the PDF file pointing back to the HTML file? Will Google treat the HTML version as the canonical version?