E-mail List Archives
Fixing OCR issues in PDF with Adobe Acrobat Pro
From: Jonathan Avila
Date: May 14, 2021 5:26PM
- Next message: Philip Kiff: "Re: Fixing OCR issues in PDF with Adobe Acrobat Pro"
- Previous message: allyssa jessicon: "Re: Can someone list out some differences between JAWS and NVDA screen reader when it comes to Accessibility?"
- Next message in Thread: Philip Kiff: "Re: Fixing OCR issues in PDF with Adobe Acrobat Pro"
- Previous message in Thread: None
- View all messages in this Thread
Hi all, I still have not found a great way within Acrobat to address optical character recognition (OCR) errors. The situation is that the text was incorrectly recognized but Acrobat does not perceive the issues as suspect and thus the tools typically in Acrobat to fix OCR suspects are not available. I'm not sure if there is a way to flag the content as suspect somehow - but it seems silly to not allow you to edit any of the OCR text unless it's a suspect.
OCR'd content appears to have hidden objects that represent the text for the tags structure but this text is not editable itself. While Acrobat does have an edit text option in the last couple versions that does a good job in allowing you to edit the visual content in a type face that looks like OCR'd text - I am dealing with a document that can't be edited in that way for legal reasons. I need to edit the hidden text.
In addition, hacks like use of actual text don't work with mobile devices so using that approach is not an option. The only way I have found is to artifact the object and create a new text box - but the text in that and hide it behind the image. That does work across desktop and mobile assistive technology.
I also played with the preflight option to make OCR text into layers. It does a good job converting the OCR text into a different layer that can be edited. The challenge is then merging or flattening the layers back into one. When I try that I either lose the content in all the tags or I end up with duplicated text on screen even though I have chosen to not display the layer and mark the layer as a reference layer. Has anyone had luck with this method?
Does anyone have any thoughts on how best to edit OCR text in Acrobat when you cannot edit the visual text and OCR suspects are not detected? I've tried Axes Quick for PDF but it doesn't seem to have any options for this either. I believe some programs like Abbyy Fine Reader could be used but my license for that is very old.
Best Regards,
Jonathan
- Next message: Philip Kiff: "Re: Fixing OCR issues in PDF with Adobe Acrobat Pro"
- Previous message: allyssa jessicon: "Re: Can someone list out some differences between JAWS and NVDA screen reader when it comes to Accessibility?"
- Next message in Thread: Philip Kiff: "Re: Fixing OCR issues in PDF with Adobe Acrobat Pro"
- Previous message in Thread: None
- View all messages in this Thread