Forum Moderators: Robert Charlton & goodroi
Google Will Eventually Stop Following Links on Noindex Pages
...is it better to "index,follow" low-value pages to make sure the link juice from those pages is not lost OR it is better to "noindex" low-value pages and lose the link juice?
Pagination is now best done via canonical tags, not noindex.
So what is the consensus on that: is it better to "index,follow" low-value pages to make sure the link juice from those pages is not lost OR it is better to "noindex" low-value pages and lose the link juice?
Pagination is now best done via canonical tags, not noindex....
It is possible, though, to misread your comment, suggesting canonical tags in pagination, as a recommendation that all pages in a paginated series should be canonicalized to a root or central page. This would be a misuse of the canonical tag, as individual pages in a series aren't usually similar enough to a root page for the canonical to be applicable, etc etc. I don't think this is what you meant.
I'm thinking you might mean that self-canonicals are often used in the individual pages in a series in conjunction with rel="next" and rel="prev", but I won't presume to interpret your comment further than that.
I see WW doesn't block them via noindexIn the specific case of the present site, this happens to be true. But a page may also be noindexed via the X-Robots-Tag header, which Google recognizes (don't know about other SEs). That wouldn't be visible in the page.
Pagination is now best done via canonical tags, not noindex....
...That's an old quote, and I can't remember what I was thinking. It wouldn't have been self-canonical though. // But you're right, rel="prev" and rel="next" is one correct way of doing it (we use this for results pages), or alternatively canonical to an unpaginated version (I see this used for long articles). Either way, noindex on pagination is not ideal- and canonical to page1 is plain wrong
There is an outstanding explanation from Google's John Mueller about the differences around the noindex and rel=canonical signals and why they should not be mixed. In short, Google wants clear signals that are consistent and straightforward....
The general rule of thumb is that signals get forwarded & combined with canonicalization. When Google sees two URLs from your site, they look the same, and you tell us your preference clearly, we'll try to combine them and treat them as one (usually stronger) URL instead of separate ones....
On the other hand, noindex (alone) & robots.txt disallow (in general) are not clear signs for canonicalization. Just having a noindex on a page doesn't tell us that you want to have it combined with something else, and that signals should be forwarded. A robots.txt disallow is even trickier, we don't even know if the page matches anything else on your site, so we couldn't even use it for canonicalization if we wanted to.
This is also where the guide that you shouldn't mix noindex & rel=canonical comes from: they're very contradictory pieces of information for us. We'll generally pick the rel=canonical and use that over the noindex, but any time you rely on interpretation by a computer script, you reduce the weight of your input :) (and SEO is to a large part all about telling computer scripts your preferences).