I’ve received a fairly long document in html, with all the <codes> and such. Is there some easy way to convert this into a normal Word document? That is, replacing all the &146; with ’ etc? It’s long, so going through and fixing by hand seems like a long and tedious process (which is liable to miss stuff anyhow). Is there some simpler way?
I guess I could Find/Replace all the &151; with – and so forth for each ASCII character, but I’m still stuck with how to get <BR> to a line break, how to convert text after [noparse]<i>[/noparse] to italic, etc.
(Yes, I know the ASCII code actually has a # after the &, but if I type that here, it gets parsed and the no-parse tags don’t seem to avoid it.)
From your browser, do a Select All and Copy. And then go to Word, and Paste.
I’ve done this lots of times. It’s not perfect, but it is a decent approximation. And it can take quite a long time, especially of it has loads of nested tables and such. But it usually does work, eventually.
Wait a minute. You have HTML code. Just insert that code in an HTML document and then open as HTML, highlight, copy etc.
Same concept explained another way. Does the code have tha header and footer? If not, just add it and save it as text, change extension to .htm and you’re done.
<HTML>
<HEAD></HEAD>
<BODY>
Your code goes here
</BODY></HTML>
Save as TXT, change extension to HTM. Done!
If you have Outlook Express you can use that too. There’s a hundred ways to skin this cat, all very easy.