Forum Moderators: Robert Charlton & goodroi
So if you don't want "juice" to be passed then you are better off making sure all links are marked with rel="nofollow" rather than using META.
One more question for everyone. We have the following tag on all the "Print" pages (they have many stores and people bring in the printed products):
<meta name = "robots" content="noindex, nofollow, noarchive">
I wanted to make sure that the search engines didn't spider these page as they are almost identical to product pages. Is this enough and the best way to do it? I also have a no follow tag on the links pointing to the "Print" page.
One more question for everyone. We have the following tag on all the "Print" pages (they have many stores and people bring in the printed products):<meta name = "robots" content="noindex, nofollow, noarchive">
Best to use robots.txt to disallow crawling of print pages explicitly.
The most explicit way to disallow all references to the print pages is to use the robots noindex meta tag, but not also to use robots.txt. If you use robots.txt to block a print page, it is true that Google won't spider the content of the page, but that prevents Google from reading the robots meta nofollow on the page. In that event, Google will index the url of the print page if the link is publicly available, albeit it won't show any content from the page. This url may or may not end up appearing in a Google search.
By itself, the noindex attribute in the link to the page won't prevent the page from being indexed by Google, because it's possible for Google to follow another link to this page, or to spider it via urls it finds in publicly accessible log files.
And... if you had the noindex meta on the print page, but you had a noindex attribute in the link to the print page, and that was the only link to the print page, it's not clear how that would affect Google's indexing the url. ;) Google wouldn't spider the print page and therefore wouldn't see the noindex meta. Google would index the link anchor text as part of the source page content, but opinions and logic varies on whether Google would index that text as a link anchor.
See discussions about links with the noindex attribute showing up in some backlinks to a page....
Nofollow links showing up in backlinks
[webmasterworld.com...]
At least for Google, we have taken a very clear stance that those links are not even used for discovery; they are not used for PageRank; they are not used for anchor text in any way.
(the full text of that q&a is almost halfway in the transcript)
I said...
...then you've got to wonder how the links appear in the index at all. Not "even... for discovery" is a pretty strong statement... and, as I note in the previous discussion, "the nofollow meta tag is what you use when you want to prevent Google from indexing the link itself."
My dilemma is I can't reconcile this with the original post, and a similar observation on another thread I cite, that people are seeing nofollow links appearing in Google backlinks.
One thought that comes to mind is that the links may have been indexed before they were nofollowed, and perhaps hadn't been purged from the backlinks index at the time they were seen.