WebAIM - Web Accessibility In Mind

E-mail List Archives

Re: Fixing PDF OCR Errors

for

From: Karlen Communications
Date: Aug 15, 2018 1:12PM


This is why I have ABBYY PDF Transformer...aside from its cool name.

It is only $79 USD and I can open a PDF in it and it gives me better OCR
than Adobe acrobat. Since Acrobat X, I've never been able to get the "find
suspects" or find mistakes to work with the Adobe Text Recognition tool. It
even says that there are no errors but when I Tag the document, there are no
spaces between words.

I took the same document to PDF Transformer, did the OCR, saved it as PDF
again, opened it in Acrobat and was able to give it correct Tags with no OCR
errors and there were spaces between the words.

I can also open a PDF document in PDF Transformer and send it to Word for
easier reading when I have untagged PDF or poorly tagged PDF.

Cheers, Karen

-----Original Message-----
From: WebAIM-Forum < <EMAIL REMOVED> > On Behalf Of
Joseph Sherman
Sent: Wednesday, August 15, 2018 1:09 PM
To: 'WebAIM Discussion List' < <EMAIL REMOVED> >
Subject: [WebAIM] Fixing PDF OCR Errors

What's the easiest was to fix PDF OCR Errors? For example, I have a signed
one page legal memo that was scanned in after signed. I ran OCR and Acrobat
says there are no OCR suspects. But a couple of places the letter O was
recognized as the number 0.

I tried going into the tag tree and changing the actual and alt text for
that content, which seemed to work when I read it with JAWS. It there a
different way I should be doing this?


Joseph

http://webaim.org/discussion/archives