E-mail List Archives
Thread: pdf to html
Number of posts in this thread: 5 (In chronological order)
From: Lucy Greco
Date: Thu, Nov 09 2017 2:47PM
Subject: pdf to html
No previous message | Next message →
hello:
i have a department that wants to convert a large number of pdf
documents to html and wants to try and automate the job in some way does
any one have a recmendation of a tool that can do this vary easily. i
think they will be posting the files on a cms when they are done so will
only need to have basic html or as i like to call it cms ready files. any
help on this would be wellcome thanks lucy
Lucia Greco
Web Accessibility Evangelist
IST - Architecture, Platforms, and Integration
University of California, Berkeley
(510) 289-6008 skype: lucia1-greco
http://webaccess.berkeley.edu
Follow me on twitter @accessaces
From: Tim Harshbarger
Date: Thu, Nov 09 2017 3:14PM
Subject: Re: pdf to html
← Previous message | Next message →
Maybe pandoc would help? It is an OS tool that can be used for converting files. I don't recall if PDF to HTML is one of the options though.
From: Chagnon | PubCom
Date: Thu, Nov 09 2017 5:56PM
Subject: Re: pdf to html
← Previous message | Next message →
You can do that directly in Acrobat Pro DC (latest version of Acrobat makes better conversions).
File / Export To / HTML Webpage.
It does create a webpage from the PDF file, complete with all the "stuff" you might not want. If that's the case, export to Word instead and from there convert it to HTML.
To automate this process for a large number of PDFs, create an Acrobat Action.
Open the Actions Tool, and choose New Action or look for an Acrobat Action on the web.
--Bevi Chagnon
â â â
Bevi Chagnon | www.PubCom.com
Technologists, Consultants, Trainers, Designers, and Developers
for publishing & communication
| Acrobat PDF | Print | EPUBS | Sec. 508 Accessibility |
â â â
From: Lucy Greco
Date: Thu, Nov 09 2017 6:03PM
Subject: Re: pdf to html
← Previous message | Next message →
we don't want a page we want just the code we can put in a cms but the
other problem is there are so many files we want to batch convert them
Lucia Greco
Web Accessibility Evangelist
IST - Architecture, Platforms, and Integration
University of California, Berkeley
(510) 289-6008 skype: lucia1-greco
http://webaccess.berkeley.edu
Follow me on twitter @accessaces
On Thu, Nov 9, 2017 at 4:56 PM, Chagnon | PubCom < = EMAIL ADDRESS REMOVED = > wrote:
> You can do that directly in Acrobat Pro DC (latest version of Acrobat
> makes better conversions).
>
> File / Export To / HTML Webpage.
>
> It does create a webpage from the PDF file, complete with all the "stuff"
> you might not want. If that's the case, export to Word instead and from
> there convert it to HTML.
>
> To automate this process for a large number of PDFs, create an Acrobat
> Action.
>
> Open the Actions Tool, and choose New Action or look for an Acrobat Action
> on the web.
>
> --Bevi Chagnon
>
> â â â
> Bevi Chagnon | www.PubCom.com
> Technologists, Consultants, Trainers, Designers, and Developers
> for publishing & communication
> | Acrobat PDF | Print | EPUBS | Sec. 508 Accessibility |
> â â â
>
>
From: Chagnon | PubCom
Date: Thu, Nov 09 2017 7:16PM
Subject: Re: pdf to html
← Previous message | No next message
Can you clarify?
"HTML" means HTML tags plus the HTML head and body (aka, structure). It can be at a rudimentary, standalone webpage.
If you mean you only want tags on the text, such as H1, table, OL, etc. that's something different. It's a tagged text file.
I don't know of any software that will generate just the tags without the HTML structure or encoding. And I don't know of any software that will read this type of file because without the HTML structure or encoding, they really don't know what to do with the tags.
Hopefully, someone on the list might know of such software.
Keep in mind that HTML tags are slightly different than PDF accessibility tags, so you might not be able to convert a PDF's <L> list tags to the correct <OL> or <UL> HTML ordered and unordered list tags. PDF tags to HTML tags is not a one-to-one smooth translation/conversion.
RE: batch processing, that's exactly what an Acrobat Action can do. Please read the first paragraph of: <https://helpx.adobe.com/acrobat/using/action-wizard-acrobat-pro.html> https://helpx.adobe.com/acrobat/using/action-wizard-acrobat-pro.html
--Bevi Chagnon
From: Lucy Greco [mailto: = EMAIL ADDRESS REMOVED = ]
Sent: Thursday, November 9, 2017 8:03 PM
To: = EMAIL ADDRESS REMOVED = ; WebAIM Discussion List < = EMAIL ADDRESS REMOVED = >
Subject: Re: [WebAIM] pdf to html
we don't want a page we want just the code we can put in a cms but the other problem is there are so many files we want to batch convert them
Lucia Greco
Web Accessibility Evangelist
IST - Architecture, Platforms, and Integration
University of California, Berkeley
(510) 289-6008 skype: lucia1-greco
http://webaccess.berkeley.edu
Follow me on twitter @accessaces
On Thu, Nov 9, 2017 at 4:56 PM, Chagnon | PubCom < = EMAIL ADDRESS REMOVED = <mailto: = EMAIL ADDRESS REMOVED = > > wrote:
You can do that directly in Acrobat Pro DC (latest version of Acrobat makes better conversions).
File / Export To / HTML Webpage.
It does create a webpage from the PDF file, complete with all the "stuff" you might not want. If that's the case, export to Word instead and from there convert it to HTML.
To automate this process for a large number of PDFs, create an Acrobat Action.
Open the Actions Tool, and choose New Action or look for an Acrobat Action on the web.
--Bevi Chagnon
â â â
Bevi Chagnon | www.PubCom.com <http://www.PubCom.com>
Technologists, Consultants, Trainers, Designers, and Developers
for publishing & communication
| Acrobat PDF | Print | EPUBS | Sec. 508 Accessibility |
â â â