I’ve cut and pasted about 200 pages of text into Word. Unfortunately each line of the original text has a CRLF at the end, which results in broken sentences. Is there a way to get rid of the CRLFs to keep the sentences intact, but at the same time not get rid of paragraph separations?
ETA: I’m using Word for Mac, so AFAIK I can’t use ALT+xxx codes.
This is how you’d do it in Windows: open the Find and Replace dialog and click on the “More” button, then click on “Special.” You’ll see there various invisible and other special characters like the Paragraph character.
If that doesn’t work, you can try this: highlight the offending CRLF mark and copy it, then open the Find and Replace dialog and paste it into the Find field. Put whatever you want to replace it with (a space, or whatever) in the Replace with field and replace all.
KneadToKnow: I didn’t know there was a ‘special’ option in F&R. I found it and tried to find Manual Line Break. No Joy. I tried changing to Outline View, and I saw little squares before the broken line. Like this (not actually the text I’m copying):
This is the story of the great war that Rikki-tikki-tavi fought single-handed, through the bath-rooms of the big bungalow in Segowlee
[] cantonment.
I can’t highlight the square (represented in the code box by brackets).
More specifically:
Type the characters exactly as written below. In Word, ^p means new paragraph; the ^ is the upper case 6 and the “p” is a lower case “p”.
- Are there extra lines between paragraphs. If so, go to three. If not, go to 2.
- Do a search and replace. Replace .^p (period and line space) with .^p^p and replace ."^p with ."^p^p This is not perfect, but will put a new paragraph on any line ending with a . or " There will be mistakes, of course, but it should catch most things.
- Do a search and replace to change ^p^p to ~~ (converts the extra line between paragraphs. The ~~ could be any character combination that’s not likely to appear in the text.
- Do a search and replace converting ^p (all paragraph markers). Leave nothing in the “replace with” field. This will remove all paragraph markers from the text.
- Finally, do a search and replace converting ~~ to ^p (puts paragraph marks where the tildes were put in step three).
- Adjust as necessary.
RealityChuck: After I posted the OP, I realised I could repeat the CRLFs for the paragraphs and change them to a unique string so I could change the end-of-line indicators and then change the paragraphs back later. Only I can’t see how to find and change them in Word for Macs.
For clarification: is the problem that your Find/Replace isn’t actually finding the end-of-line characters? (That is, when you do a search for ^p, nothing is found?)
Out of curiosity, what happens when you open the Find window, select the “Use wildcards” box, and do a search for “^13” (without the quotation marks)?
(I haven’t worked with a Mac in a while, so forgive me for just shooting in the dark.)
LilShieste
Aha! I was looking for end-of-line, which I assumed Word calls Manual Line Break. That’s a ^|. The Paragraph Mark is ^p and I was able to use it.
I replaced ^p^p with PARAGRAPH BREAK, then replaced ^p with a space, and then replaced PARAGRAPH BREAK with ^p^p. Now I just have to insert page breaks at the Chapters.
Thanks for the help!
Next question, pagination:
I’m starting the page numbers at zero, with the page number not appearing on the title page. Following the title page are an introduction and a table of contents. I’d like to do this: No number on the title page (as it is now), Introduction page ‘i’, Table of Contents page ‘ii’, text pages 1 through 144 (147 pages total, less the three leading pages).
IIRC, page numbering is done on a per-section basis, which means you’ll just need to specify where each section starts (e.g., Section 1 starts with the title page, Section 2 starts after the TOC).
Here’s how I would do it:
[ol]
[li] Insert the initial page numbers; style = “i”, skipping the title page[/li][li] Insert a new section directly after the TOC (Insert --> Break… --> One of the options under “Section break types”[/li][li] On the first page following the TOC, insert the new page number format; style = “1”[/li][/ol]
Does that help?
LilShieste
OK, I put a Section Break at the end of the text on the title page. Then I put a Section Break at the end of the TOC. The text is numbered correctly, starting with Page 1. The Introduction, however, is page ii and the TOC is page iii. I can’t seem to get them to i and ii.
Hmm… make sure there isn’t a page break, or something, appearing between the Title and Intro pages. The easiest way to check this is by switching to “Normal View” (if you aren’t there already), and seeing if there’s a page break directly next to a section break. If so, remove it (the section break acts as a page break).
LilShieste
That did it. Thanks!
I removed the (continuous) section breaks (yes, multiple – I kept trying it) and replaced them with a Next Page Section Break. Then I went back to Insert Page Numbers and told it not to show the page number of the first page (because there’s not an option that I can see to remove page numbering).
So it looks good now.