How does Google cache web pages that require a subscription?

DarrenS · November 6, 2003, 6:12am

Sometimes in the results of a Google search, I’ll see pages that require you to subscribe to see the info. When I click, I get the prompt to register (I think the New York Times is a classic example). Yet, if I click the “cached” button, Google has a copy of the content. How does Google do it? Does it have a subscription to every web site on the planet, or do the websites allow “robots” in by default?

ravage2 · November 6, 2003, 6:48am

When certain webpages are uploaded (eg New York Times pieces) they let people read them for free for a certain period of time. In the case of the NY Times, its one week. After this they put it into archive and you have to pay to see it. Google caches the pages before they are archived. Dunno about the legality of it though.

II_Gyan_II · November 6, 2003, 6:53am

ravage2

IIRC, you can also append “&partner=Google” to the end of a NYTimes URL to view pages reserved for registered users.

Bob55 · November 6, 2003, 7:29am

I always guessed sites allowed Google’s spiders to browse/archive them so that the pages will turn up in search engines and people will be inclined to subscribe. But if they had their choice they probably wouldn’t want their pages cached.

rsa · November 6, 2003, 12:06pm

Sites do have the option to not have their pages cached. I believe they can just use the NOARCHIVE option.

Topic		Replies	Views
Google cache and copyright law Factual Questions	3	1098	October 17, 2002
Legallity of the Google cache Factual Questions	22	1653	June 20, 2003
Are Google's archives a copyright violation Factual Questions	10	858	August 4, 2003
Google vs. Copyright Law Factual Questions	25	1690	March 14, 2004
How does google get around site restrictions? Factual Questions	4	1050	February 3, 2004

How does Google cache web pages that require a subscription?

Related topics