I have a PDF file that, when I try to open it using Acrobat 6.0, gives me the unhelpful error message: “There was an error opening this document. The file is damaged and could not be repaired.” It’s not a critical document, but it would be really nice if I could recover at least the document’s text, if not all the content. I did a little searching, but I wasn’t able to come up with any utilities that were able to help me out.
Are there any free utilities out there that can do what I’m looking for? Or should I be trying something else? Or is this file likely lost for good?
Ghostscript and GhostView may help (try Google).
Of course, you could always try wordpad.exe. A PDF file is a form of Postscript file, which is a plain text format. It may take you a while to get past the headers and font definitions to find the text, and if the text has been kerned (space adjusted) each letter or word may have its own chunk of postscript code around it. If your PDF was created from a scanned file, you will not have any chance at all, as the PDF will just be a bunch of images.