WebAIM - Web Accessibility In Mind

E-mail List Archives

Thread: pdf to html

for

Number of posts in this thread: 5 (In chronological order)

From: Lucy Greco
Date: Thu, Nov 09 2017 2:47PM
Subject: pdf to html
No previous message | Next message →

hello:
i have a department that wants to convert a large number of pdf
documents to html and wants to try and automate the job in some way does
any one have a recmendation of a tool that can do this vary easily. i
think they will be posting the files on a cms when they are done so will
only need to have basic html or as i like to call it cms ready files. any
help on this would be wellcome thanks lucy
Lucia Greco
Web Accessibility Evangelist
IST - Architecture, Platforms, and Integration
University of California, Berkeley
(510) 289-6008 skype: lucia1-greco
http://webaccess.berkeley.edu
Follow me on twitter @accessaces

From: Tim Harshbarger
Date: Thu, Nov 09 2017 3:14PM
Subject: Re: pdf to html
← Previous message | Next message →

Maybe pandoc would help? It is an OS tool that can be used for converting files. I don't recall if PDF to HTML is one of the options though.

-----Original Message-----
From: WebAIM-Forum [mailto: = EMAIL ADDRESS REMOVED = ] On Behalf Of Lucy Greco
Sent: Thursday, November 9, 2017 3:48 PM
To: WebAIM Discussion List < = EMAIL ADDRESS REMOVED = >
Subject: [WebAIM] pdf to html

hello:
i have a department that wants to convert a large number of pdf
documents to html and wants to try and automate the job in some way does
any one have a recmendation of a tool that can do this vary easily. i
think they will be posting the files on a cms when they are done so will
only need to have basic html or as i like to call it cms ready files. any
help on this would be wellcome thanks lucy
Lucia Greco
Web Accessibility Evangelist
IST - Architecture, Platforms, and Integration
University of California, Berkeley
(510) 289-6008 skype: lucia1-greco
http://webaccess.berkeley.edu
Follow me on twitter @accessaces

From: Chagnon | PubCom
Date: Thu, Nov 09 2017 5:56PM
Subject: Re: pdf to html
← Previous message | Next message →

You can do that directly in Acrobat Pro DC (latest version of Acrobat makes better conversions).

File / Export To / HTML Webpage.

It does create a webpage from the PDF file, complete with all the "stuff" you might not want. If that's the case, export to Word instead and from there convert it to HTML.

To automate this process for a large number of PDFs, create an Acrobat Action.

Open the Actions Tool, and choose New Action or look for an Acrobat Action on the web.

--Bevi Chagnon

— — —
Bevi Chagnon | www.PubCom.com
Technologists, Consultants, Trainers, Designers, and Developers
for publishing & communication
| Acrobat PDF | Print | EPUBS | Sec. 508 Accessibility |
— — —

-----Original Message-----
From: WebAIM-Forum [mailto: = EMAIL ADDRESS REMOVED = ] On Behalf Of Lucy Greco
Sent: Thursday, November 9, 2017 4:48 PM
To: WebAIM Discussion List < = EMAIL ADDRESS REMOVED = >
Subject: [WebAIM] pdf to html

hello:
i have a department that wants to convert a large number of pdf documents to html and wants to try and automate the job in some way does any one have a recmendation of a tool that can do this vary easily. i think they will be posting the files on a cms when they are done so will only need to have basic html or as i like to call it cms ready files. any help on this would be wellcome thanks lucy Lucia Greco Web Accessibility Evangelist IST - Architecture, Platforms, and Integration University of California, Berkeley

From: Lucy Greco
Date: Thu, Nov 09 2017 6:03PM
Subject: Re: pdf to html
← Previous message | Next message →

we don't want a page we want just the code we can put in a cms but the
other problem is there are so many files we want to batch convert them

Lucia Greco
Web Accessibility Evangelist
IST - Architecture, Platforms, and Integration
University of California, Berkeley
(510) 289-6008 skype: lucia1-greco
http://webaccess.berkeley.edu
Follow me on twitter @accessaces


On Thu, Nov 9, 2017 at 4:56 PM, Chagnon | PubCom < = EMAIL ADDRESS REMOVED = > wrote:

> You can do that directly in Acrobat Pro DC (latest version of Acrobat
> makes better conversions).
>
> File / Export To / HTML Webpage.
>
> It does create a webpage from the PDF file, complete with all the "stuff"
> you might not want. If that's the case, export to Word instead and from
> there convert it to HTML.
>
> To automate this process for a large number of PDFs, create an Acrobat
> Action.
>
> Open the Actions Tool, and choose New Action or look for an Acrobat Action
> on the web.
>
> --Bevi Chagnon
>
> — — —
> Bevi Chagnon | www.PubCom.com
> Technologists, Consultants, Trainers, Designers, and Developers
> for publishing & communication
> | Acrobat PDF | Print | EPUBS | Sec. 508 Accessibility |
> — — —
>
> -----Original Message-----
> From: WebAIM-Forum [mailto: = EMAIL ADDRESS REMOVED = ] On
> Behalf Of Lucy Greco
> Sent: Thursday, November 9, 2017 4:48 PM
> To: WebAIM Discussion List < = EMAIL ADDRESS REMOVED = >
> Subject: [WebAIM] pdf to html
>
> hello:
> i have a department that wants to convert a large number of pdf
> documents to html and wants to try and automate the job in some way does
> any one have a recmendation of a tool that can do this vary easily. i
> think they will be posting the files on a cms when they are done so will
> only need to have basic html or as i like to call it cms ready files. any
> help on this would be wellcome thanks lucy Lucia Greco Web Accessibility
> Evangelist IST - Architecture, Platforms, and Integration University of
> California, Berkeley
>
> > > > >

From: Chagnon | PubCom
Date: Thu, Nov 09 2017 7:16PM
Subject: Re: pdf to html
← Previous message | No next message

Can you clarify?

"HTML" means HTML tags plus the HTML head and body (aka, structure). It can be at a rudimentary, standalone webpage.



If you mean you only want tags on the text, such as H1, table, OL, etc. that's something different. It's a tagged text file.



I don't know of any software that will generate just the tags without the HTML structure or encoding. And I don't know of any software that will read this type of file because without the HTML structure or encoding, they really don't know what to do with the tags.



Hopefully, someone on the list might know of such software.



Keep in mind that HTML tags are slightly different than PDF accessibility tags, so you might not be able to convert a PDF's <L> list tags to the correct <OL> or <UL> HTML ordered and unordered list tags. PDF tags to HTML tags is not a one-to-one smooth translation/conversion.



RE: batch processing, that's exactly what an Acrobat Action can do. Please read the first paragraph of: <https://helpx.adobe.com/acrobat/using/action-wizard-acrobat-pro.html> https://helpx.adobe.com/acrobat/using/action-wizard-acrobat-pro.html



--Bevi Chagnon



From: Lucy Greco [mailto: = EMAIL ADDRESS REMOVED = ]
Sent: Thursday, November 9, 2017 8:03 PM
To: = EMAIL ADDRESS REMOVED = ; WebAIM Discussion List < = EMAIL ADDRESS REMOVED = >
Subject: Re: [WebAIM] pdf to html



we don't want a page we want just the code we can put in a cms but the other problem is there are so many files we want to batch convert them



Lucia Greco
Web Accessibility Evangelist
IST - Architecture, Platforms, and Integration
University of California, Berkeley
(510) 289-6008 skype: lucia1-greco
http://webaccess.berkeley.edu
Follow me on twitter @accessaces



On Thu, Nov 9, 2017 at 4:56 PM, Chagnon | PubCom < = EMAIL ADDRESS REMOVED = <mailto: = EMAIL ADDRESS REMOVED = > > wrote:

You can do that directly in Acrobat Pro DC (latest version of Acrobat makes better conversions).

File / Export To / HTML Webpage.

It does create a webpage from the PDF file, complete with all the "stuff" you might not want. If that's the case, export to Word instead and from there convert it to HTML.

To automate this process for a large number of PDFs, create an Acrobat Action.

Open the Actions Tool, and choose New Action or look for an Acrobat Action on the web.

--Bevi Chagnon

— — —
Bevi Chagnon | www.PubCom.com <http://www.PubCom.com>;
Technologists, Consultants, Trainers, Designers, and Developers
for publishing & communication
| Acrobat PDF | Print | EPUBS | Sec. 508 Accessibility |
— — —

-----Original Message-----
From: WebAIM-Forum [mailto: = EMAIL ADDRESS REMOVED = <mailto: = EMAIL ADDRESS REMOVED = > ] On Behalf Of Lucy Greco
Sent: Thursday, November 9, 2017 4:48 PM
To: WebAIM Discussion List < = EMAIL ADDRESS REMOVED = <mailto: = EMAIL ADDRESS REMOVED = > >
Subject: [WebAIM] pdf to html

hello:
i have a department that wants to convert a large number of pdf documents to html and wants to try and automate the job in some way does any one have a recmendation of a tool that can do this vary easily. i think they will be posting the files on a cms when they are done so will only need to have basic html or as i like to call it cms ready files. any help on this would be wellcome thanks lucy Lucia Greco Web Accessibility Evangelist IST - Architecture, Platforms, and Integration University of California, Berkeley