WebAIM - Web Accessibility In Mind

E-mail List Archives

Re: pdf to html

for

From: Chagnon | PubCom
Date: Nov 9, 2017 7:16PM


Can you clarify?

"HTML" means HTML tags plus the HTML head and body (aka, structure). It can be at a rudimentary, standalone webpage.



If you mean you only want tags on the text, such as H1, table, OL, etc. that's something different. It's a tagged text file.



I don't know of any software that will generate just the tags without the HTML structure or encoding. And I don't know of any software that will read this type of file because without the HTML structure or encoding, they really don't know what to do with the tags.



Hopefully, someone on the list might know of such software.



Keep in mind that HTML tags are slightly different than PDF accessibility tags, so you might not be able to convert a PDF's <L> list tags to the correct <OL> or <UL> HTML ordered and unordered list tags. PDF tags to HTML tags is not a one-to-one smooth translation/conversion.



RE: batch processing, that's exactly what an Acrobat Action can do. Please read the first paragraph of: <https://helpx.adobe.com/acrobat/using/action-wizard-acrobat-pro.html> https://helpx.adobe.com/acrobat/using/action-wizard-acrobat-pro.html



--Bevi Chagnon



From: Lucy Greco [mailto: <EMAIL REMOVED> ]
Sent: Thursday, November 9, 2017 8:03 PM
To: <EMAIL REMOVED> ; WebAIM Discussion List < <EMAIL REMOVED> >
Subject: Re: [WebAIM] pdf to html



we don't want a page we want just the code we can put in a cms but the other problem is there are so many files we want to batch convert them



Lucia Greco
Web Accessibility Evangelist
IST - Architecture, Platforms, and Integration
University of California, Berkeley
(510) 289-6008 skype: lucia1-greco
http://webaccess.berkeley.edu
Follow me on twitter @accessaces



On Thu, Nov 9, 2017 at 4:56 PM, Chagnon | PubCom < <EMAIL REMOVED> <mailto: <EMAIL REMOVED> > > wrote:

You can do that directly in Acrobat Pro DC (latest version of Acrobat makes better conversions).

File / Export To / HTML Webpage.

It does create a webpage from the PDF file, complete with all the "stuff" you might not want. If that's the case, export to Word instead and from there convert it to HTML.

To automate this process for a large number of PDFs, create an Acrobat Action.

Open the Actions Tool, and choose New Action or look for an Acrobat Action on the web.

--Bevi Chagnon

— — —
Bevi Chagnon | www.PubCom.com <http://www.PubCom.com>;
Technologists, Consultants, Trainers, Designers, and Developers
for publishing & communication
| Acrobat PDF | Print | EPUBS | Sec. 508 Accessibility |
— — —

-----Original Message-----
From: WebAIM-Forum [mailto: <EMAIL REMOVED> <mailto: <EMAIL REMOVED> > ] On Behalf Of Lucy Greco
Sent: Thursday, November 9, 2017 4:48 PM
To: WebAIM Discussion List < <EMAIL REMOVED> <mailto: <EMAIL REMOVED> > >
Subject: [WebAIM] pdf to html

hello:
i have a department that wants to convert a large number of pdf documents to html and wants to try and automate the job in some way does any one have a recmendation of a tool that can do this vary easily. i think they will be posting the files on a cms when they are done so will only need to have basic html or as i like to call it cms ready files. any help on this would be wellcome thanks lucy Lucia Greco Web Accessibility Evangelist IST - Architecture, Platforms, and Integration University of California, Berkeley