WebAIM - Web Accessibility In Mind

E-mail List Archives

Re: Html from a .pdf file, what is the best way?

for

From: Ron Stewart
Date: Apr 9, 2010 8:36AM


Morning,

The XHTML file would have to be extracted out of the DTB folder in order to
do it directly and the process for this depends on what type of DTB you are
creating. In A DAISY3 format you will need to extract it from the package
file this can be done with the DAISY Pipeline or with EasyConverter a
commercial package from Dolphin. There are probably other tools as well but
these are the two that I use. A crude way to do this if you want just the
straight text is to change the .XML file to .HTML and then open it in a
browser. This will typically give you the linear text but none of the
navigatibility.

If it is in a DAISY2 format you can actually copy and paste the source from
the DTB folder. You will find two HTML docs the NCC and another that has the
document title. It is this second file that you would open in the web
browser of your choice.

Ron Stewart