I work for a web-based software company, managing their website. We are currently in the process of moving the files from an NT-based host to a UNIX host (yay!). Anyway, I’ve run across a really, really annoying problem: filenames. You see, when the sites original creators built the pages, they gave files names with alternating case (you know, “titleGraPHIC.GIF”, that kind of thing). Why they did this, I do not know, but its causing all sorts of problems.
The NT webserver, IIS, apparently makes no distinction between cases when dealing with files. UNIX, (and in this case, Apache), does. When the original site designers were doing the pages, they didn’t pay strict attention to the cases of the images, and such, not realizing that it would matter if we switched the site over to UNIX.
Therefore, I’m in a bit of a predicament. I figure the best course of action is to rename all files in completely lower case, and then change all applicable links. I’ve already instructed everyone that from now on, all filenames are to be lowercase only, so this is a standard we will be following in the future.
So, finally, here’s the question: is there any tool out there that can open up a whole bunch (probably five, six hundred html files), grab every link, and then change all links to be lowercase only? I can’t do this through something like Word, since I need to preserve the case of the non-link text. I’ve considered (and actually started) writing a program to do this, but I realize that this will probably take many, many hours.
I’ve tried several linkcheckers, but many only point out broken links (and typically ignore case), rather than doing a mass replacement.
Any thoughts?