Forum Moderators: goodroi
My preference would be for no search engines to cache any of my site's pages at all. Whenever someone visits any of the pages on my site, I want to be sure that they see them in their most-current form.
How many people (other than SEOs) are likely to prefer the cached copy to the live page?
As a user, I like the Google Cache because it lets me see what's on a page if the site's server or Web connection is down.
As a publisher, I like the Google cache because, in the unlikely event that a user can't get to my live page, the cache shows the user what my site has to offer. It's a win-win situation for me and for the user.
With the Google Toolbar you get highlighted search terms without cache
yeah, but who wants to install the google toolbar.
i always search through caches because of keyword highlighting and i honestly can't believe how you guys work without if you want to do some serious online research. sometimes keywords are buried in a 10000 words or 200 KB article.
although my websites have caching turned off. but that's ok for the user, if you have a built-in search keyword highlighting script. i recommend to do it server-side, not with javascript.
yeah, but who wants to install the google toolbar.
Quite a few people do, apparently. I'm one of them. The Google toolbar is a great timesaver. Try it--you might actually like it. :-)
Google's cache saved me a lot of work when I lost data form my data base.
That's what backup is for.
I still see my pages cached on google.be
The courts in their wisdom ordered GOOG only to not cache content from the French and German speaking press in Belgium (the plaintiffs).
Facts remains that the search engines (not just Google) and archives alike do violate copyright, but it's only Google that's been prohibited not to violate the rights of a section of the press in one country. A more global approach is needed.
There might be more permissive laws in other countries, but it's clear that the caching as done by a.o. Google isn't considered fair use out here. Still the cached pages and probably even worse the archives continue to violate basic copyright law.
yeah, but who wants to install the google toolbar
It's about your only way to have even an idea of the PR of a page. Unfortunately it does not include an adsense preview.
I'd like to thank all the folks who are defending my copyrights, but fact is, I didn't ask any of you to step in for me. Thank God I'm not from Belgium. If I have an issue with Google archiving my pages, I'll take it directly to them. I don't need people talking about protecting my copyrights when I genuinely want Y!, MSN and G to archive my sites for me. Read this, I WANT THEM TO ARCHIVE MY SITES.
I want them to refer to my news sites, they provide me many new visitors and revenue - not from Adsense, but from people who would have never stumbled upon me and discover what I have to offer.
This is not a black and white issue and I'd like all the copyrights freaks to stop assuming that they represent every Webmaster. They don't.
With scrappers, you lose out completely. But with the Big Three, it's a give and take and you can benefit from this. And it's quite obvious that they are archiving an entire site and not just scrapping for Adsense. They never act as if the contents is theirs.
I usually care very much about copyrights, but we all know that the only way to fully protect one's contents online is not to post it at all anywhere on the Web or offline.
The Belgian decision reminds me of an old story where a baker wanted a beggar to pay for smelling the scent of his freshly baked breads. The beggar said, "no problem." He took his only coin and threw it on the ground a couple of time so the the baker could here the sound of the pennies. Then he said, I think you're all paid up now.
Here, the baker actually wants real money instead of noise...
If a webmaster likes pages to be cached by a search engine he/she should be able to indicate that in a robot.txt-like file.
Caching pages is different from "indexing" pages and this decision in Belgium could be the start of more to come.
A CNN reporter was talking about the latest scandal in Washington, and said something to effect that, "The government's official Web page (that is most directly related to the victim in the scandal) has been taken down, but we were able to find cached versions going back to 2004, and this is what was being said."
By displaying the cached versions of the pages to viewers, CNN showed that years ago there was knowledge of a problem that some politicians (and the attorney for the rehab-bound man at the center of the scandal) were today continuing to deny.
It was pretty neat.