wildcard downloads?

cornflakes · April 4, 2000, 3:05am

A web-based discussion group I hang out at has started to recycle its post addresses. I’d like to copy the old posts before they disappear. To do this, I need to download a bunch of files with the following format: http://www.qwerty.com/####.htm , where #### is a sequential number.

Can this be done with early Win95 and possibly Netscape 4.5 or Internet Explorer using wildcard commands? If all fails, I can get a friend to write a perl script to download the data (shouldn’t a Unix program be able to do this w/o a script?), but I’d rather do it myself.

cornflakes · April 4, 2000, 3:08am

Well, you make up a dummy URL and it’s already taken. My apologies to qwerty.com for not checking first.

ubermensch · April 4, 2000, 3:18am

i’d recommend a program called offline explorer…or web snake. both of them can mirror sites. so set it to go one night, grab the whole site, and then maybe manually grab each thread as it’s closed.

nice thing about those programs is that you can set them to download *.jpg or *.gif or what ever (a boon to when you download porn, er, clip art) or .txt or whatever format you want, or the whole site. and you can tell them to download however many ‘clicks’ you want, and, at least with offline explorer, you can tell it to not download anything off of the main site, so it won’t ‘click’ on any banners.

cornflakes · April 4, 2000, 6:07am

That’ll work, ubermensch! I downloaded the program, and while I haven’t figured out how to restrict levels (what I need is several levels down from the top, but not all the way down), it looks like this is what I need. I guess another hard drive to store all this is next. Thanks!

cornflakes · April 4, 2000, 6:52pm

Well, I spoke too soon. The files are archived, and it looks like I’ll have to download each one by name. perl script here I come, I guess…

Arjuna34 · April 4, 2000, 11:19pm

You might try asking them if they can help you archive them- they may even let you ftp the whole thing.

Arjuna34

billehunt · April 6, 2000, 7:37am

If you have a copy of MS Office, it includes FrontPage, which has an import ability. You give it a top page, the depth you want to go and a size limit, and it’ll suck in a web site. You can then pare to your needs.

Zor · April 6, 2000, 8:16am

If the website you’re going to download these archived files from has a simple index page, or allow directory listing, a simple batch download utility will do the trick. Offline browsers are great, but they may get unecessarily complicated. I’d recommend something like Go!Zilla (you want the link leecher specifically), which can work better than offline browsers in some instances. It’s also free, if that matters…

AWB · April 6, 2000, 12:48pm

If you can FTP the site, use the -i switch and the command “mget .”.

Wrong thinking is punished, right thinking is just as swiftly rewarded. You’ll find it an effective combination.

cornflakes · April 6, 2000, 2:26pm

Everyone, thanks for the suggestions. As of yesterday, they went back to giving each post a new number, so the need for a second archive site is gone.

The strange thing is that I can’t access the archived posts. Running a browser, you just click on an archive page which has a tar###.shtml filename, then click on whatever post (.shtml file) you want. With Offline Explorer, I’ve left the archive settings open, changed the starting URL, &c, &c, and I still can’t get any of the posts from the archives to download. The archive directories load just fine.

As far as I can tell, the guys who own the site have lost interest in it and pay the bills more as a courtesy to the participants. They rarely return e-mail, so I don’t think I could get an ftp address from them.

Anyway, it’s a moot point now. Just for the hell of it, I’ll probably play around with this. Go!zilla, here I come!

Hey, thanks again.

Topic		Replies	Views
Multiple Website browsing/archiving Factual Questions	0	547	April 16, 2003
Looking for a slick way to offer offline web content Factual Questions	0	606	September 21, 2006
Recommend an offliner browser/automatic downloader In My Humble Opinion	6	972	December 24, 2007
Mirroring my password-protected website Factual Questions	2	835	April 28, 2001
How to save an ENTIRE web page offline? Factual Questions	18	48912	January 15, 2013

wildcard downloads?

Related topics