E-Books are Full of Errors!

Pepper Mill just decided to drag us Luddites in the CalMeacham household kicking and screaming into the 21st Century (Her words, mostly) – she bought a Nook.
The first book we bought was – well – mine. I knew it was available, because I could bring it up on the nooks at our local Barnes and Noble (and i did, every chance I got, leaving it there in the hopes of promoting sales).
So i was perusing my magnum opus when I noticed that it had spelling errors in it. Sirius was rendered Sinus, which will come as a surprise to people encountering the name of this star for the first time. I started looking closely at the text, and found numerous spelling errors – some of them pretty impressive. I hasten to point out that none of the errors i found were in the printed editions. They seem to be the result of putting the work into e-book format.
I wondered how widespread this phenomenon was, since I hadn’t heard about it before. Evidently it’s fairly common:

http://www.amazon.com/Spelling-errors-in-Kindle-ebooks/forum/Fx8K5S3V834TCX/Tx1KP4WTT41001B/1?_encoding=UTF8&asin=B001P81618

http://www.amazon.com/kindle-missing-words-and-misspellings/forum/Fx1AB9PYUFUHA2E/Tx160O7UKD6GBA0/1?_encoding=UTF8&asin=0812971183

http://www.kindleboards.com/index.php?topic=93298.0

http://www.teleread.com/ebooks/kindle-books-riddled-with-typographical-and-formatting-errors-says-bloomberg-columnist/

All of the above are about Kindle, but it seems Nook has the same issues. My book’s on Kindkle, as well, and in two e-book formats used by libraries. I haven’t checked any of those, but maybe I should.

Has anyone else heard about this?

I agree that Sirius and Sinus, Halder and Haider, Dorne and Dome all look like scanning errors. I guessing some books get scanned and sold without anyone going through and proof-reading the text.

One of the first books I purchased to read on the Kindle app was a volume of Connie Willis short stories. The profusion of scanning errors is outrageous. In particular, the letter combination “rn” is constantly rendered as “m,” so “turn” becomes “tum” and “burn” becomes “bum”–the latter error being rather pervasive in a story that takes place during the firebombing of London! Open quote marks are frequently rendered as “Cc.” It’s a mess.

So what are you going to do CalMeacham? I hope contact the publisher and get it fixed?

As noted, this is usually a scanning problem, and is something we have to live with until enough unpaid interns are on the job proof-reading every book published in e-format that was written before 1990.

Metadata is easily fixable with several different programs. I use Mobi2mobi personally.

A book I just read had some random spaces within words. The only problem with that is my poor brain can take a while. “She went to the cupboard to get out some ban dages.” When you’re reading a fantasy novel, it’s not necessarily immediately obvious that it’s a random space and not some made up word or phrase!

I’m not sure yet. According to those pages I’ve cited, Amazon has a mechanism for correcting its Kindle problems – but some readers couldn’t access the relevant pages. I don’t even know yet what Kindle says. I’ll probably contact my publisher first, and also see wht B&N says about errors on the Nook editions.

Goodness knows actual *print *publishers–even the big ones–do not “edit” anymore. They spell-check, and sometimes they have freelancers or an intern do an “edit,” which leaves numerous spelling, grammatical, punctuation and factual errors scattered through even best-sellers. Which is why I pay an editor out of my own pocket to go over my books with a microscope before it even goes to my publisher.

In the old days, publishing houses had “house editors,” lifers, who took pride in “*their *authors.” No more. I assume cramming it into e-format even further fucks things up, and no one except the author (and the poor reader) gives a damn.

I have never bought an e-book. However, I have several hundred ebooks on my iPad (all of them of books I already own in hard copy), and I do notice frequent spelling errors, spacing problems, and odd pagination. Until today, I assumed the spelling errors were caused by OCR errors when the book was scanned, and others to sloppy processing by the booklegger. I am distressed to hear that commercial ebooks are no better.

Yep. The first ebooks I bought were some of Barbara Hambly’s, and I don’t always know if she’s referring to something I’ve never come across or if this is just yet another typo, especially if it could really go either way.

There have always been typos in books. But it seems to get worse and worse.

I have just experienced exactly this problem.

Purchased The Great Gatsby from Amazon two or three days ago to read on my Android via the Kindle app. It’s like a bad OCR job. Random capitalization, spacing and some punctuation errors too.

Very disappointing.

Most books I’ve read on my ereader are fine, but one of them was clearly very badly scanned - I had to figure out what some words were by imagining the shape of the words, and figuring out what words might have similar shapes. It was still jarring to read that one character wrapped his ‘anus’ around another character - what was meant was “arms” of course (that was the most annoying error, but having a typo in a chapter title was also bad). I wrote a note to the author, who of course has no control over the situation (but she did appreciate knowing about it).

My guess is that the publishers have made a rational if irritating decision that they are best off just scanning, using OCR and then publishing. They can crowdsource the corrections later.

I don’t think the general public, and that seems to include publishers, know the difference between original data in a text file and a scanned copy. While OCR is much improved now, I was constantly bombarded with “but can’t you just scan it?” years ago when OCRing was in its infancy. The concept of a graphics vs. text file still isn’t a well-known technical difference.

It would be nice if they actually had a clear and easily-used method for accepting corrections from the masses, because that would quickly get a lot of books fixed. But so far, I haven’t found one. I’ve been limited to e-mailing lists of corrections to authors, and only the authors who self-publish have been able to get things fixed.

Maybe there’s a market here for a good third-party editing site here…

I’ve bought a number of books from the Baen Ebooks site and they are far worse than the average books I get from, say, Kobo. I’ve found sections that were so meaningless that I checked a print edition and found multiple lines missing in the middle of paragraphs.

In general, I think, new release ebooks are typically reasonably good because they’re prepared from the same electronic source as the print version. But re-releases of older books and generally done from scans and the quality is VERY variable.

You’d think e-readers themselves should have some way for you to mark a spelling or markup error, include your suggested correction (if applicable), instantly send it back to the e-publisher via your digital connection, and once approved instantly correct all editions as soon as you log on, or whatever it is you do with e-readers.

A friend of mine is a fiction author and has almost all her books in e-book format, across multiple versions, and encounters errors all the time, and she sends all her manuscripts in as PDFs, I think.

I shudder to think what it must be like for non-English alphabet editions.

I saw one recently where “smiling through his tears” became “smiling through his teats.” Which is good for a laugh but did rather take me out of the story.

Agreed. Copy editing in general has gone to hell in a handcart–if it happens at all–and seems to be getting worse… Both print and ebook editions of things that I’ve read in the past year or so have been riddled with errors, which pains me to my persnickety soul. The best-proofed books I’ve read lately have been novels by P.C. Hodgell…who enlists fan volunteers to help proofread them.

But I don’t understand the reason they would have to scan a print copy. All, or most books are available in electronic format anyway. That’s the way they are submitted to the publisher isn’t it? Even Jessica Fletcher gave up her old manual typewriter for a computer.

Bob