I'd like to save the contents of a large website. Recommendations?

www.bartleby.com

as im sure most of you know, this is a website that features:
Columbia Encyclopedia, 6th edition
The Harvard Classics
The World Factbook
and much more, all for free.

I’d like to save this information to my harddrive in case one day, its not free anymore,

or,

for when i travel and am without internet i have some reading material to keep me occupied.

Does anyone know a fast way to save these books beside going though and manually copying and pasting all of it into text docs?
Maybe by saving directories of the website, instead of individual pages?
Any software?

Thanks…

There’s this http://www.httrack.com/
Not sure how smart it is when it comes to the differing methods of linking pages within a site, though.

From the TOS “The Service and its Contents are protected by copyright pursuant to U.S. and international copyright laws. You may not modify, publish, transmit, participate in the transfer or sale of, reproduce, create new works from, distribute, perform, display, or in any way exploit, any of the Content or the Service (including software) in whole or in part”

I’d check with them first that your intended archiving isn’t seen as exploiting the service.

The CIA World Factobook is available as an easy, packaged download to be run offline, available at the CIA website. It’s fairly obvious how to get to it.

Some of the Harvard Classics are available via Project Gutenberg as well, and you can download those for free too in easy to read form.

The Sixth Edition Columbia Encyclopedia, however, appears to be copyrighted, so it’s of questionable legality to download the whole thing for offline use.

For archiving a site legally, programs like Webzip exist for just such a reason.

http://www.interlog.com/~tcharron/wgetwin.html is an implementation of the wget program for Windows machines. If you don’t run Windows, you should already have wget on your machine (it’s a standard Unix tool and therefore comes with MacOS X, Linux, and the BSDs).

It can be extremely rude to slurp large amounts of data off someone’s servers, though. If you want to mirror Bartleby.com, ask them and they might give you an easier alternative.