WebAIM - Web Accessibility In Mind

E-mail List Archives

Thread: PDF to HTML or XML or something useful

for

Number of posts in this thread: 5 (In chronological order)

From: Wayne Dick
Date: Thu, May 19 2005 3:59PM
Subject: PDF to HTML or XML or something useful
No previous message | Next message →

Hi I am looking for a product that translates PDF to HTML or XML,
hopefully in an accessible format. The PDF does not work with my
WebAdapt2Me or anything that produces accessible alternative text
formats. I have a product called imaginatively, PDFtoHTML, but its
output is very odd. I've made style sheets that civilize it, but you
can only do so much with strange markup.

So, any ideas? I'd really like to start reading the ACM Digital Library
is a little comfort.

Sincerely, Wayne Dick, Chair CECS
CSU Long Beach


From: Robinson, Norman B - Washington, DC
Date: Thu, May 19 2005 4:22PM
Subject: RE: PDF to HTML or XML or something useful
← Previous message | Next message →

Wayne,

1. Have you ever used the online conversation tools for Adobe
PDF documents
(http://www.adobe.com/products/acrobat/access_onlinetools.html) ? I
don't know if that will meet your immediate needs.

2. Are you using the latest version of PDFtoHTML
(http://pdftohtml.sourceforge.net/)?

3. I also have previously used Ghostscript to extract the plain
text. While looking for the latest version I came across "PSTOTEXT"
(http://research.compaq.com/SRC/virtualpaper/pstotext.htmlhttp://researc
h.compaq.com/SRC/virtualpaper/pstotext.html) which actually may be of
more use to you.

4. Finally, I just came across PDF995
(http://www.pdf995.com/faq.html) which seems to indicate it will output
PDF to HTML - I'm guessing that would be the best option yet. Seems like
it is only 19.00 or so.

Hope that helps.

Regards,


Norman Robinson



From: Electronic Publishing Service
Date: Fri, May 20 2005 4:55AM
Subject: Re: PDF to HTML or XML or something useful
← Previous message | Next message →

Try Adobe Acrobat Pro 7.0. It seems Adobe has some hidden technology that
will not allow PDF's made with other applications to run on all systems.

Eugene Williams
Electronic Publishing Service
Laughlin, Nevada
----- Original Message -----
From: "Wayne Dick" < = EMAIL ADDRESS REMOVED = >
To: "WebAIM Discussion List" < = EMAIL ADDRESS REMOVED = >
Sent: Thursday, May 19, 2005 2:10 PM
Subject: [WebAIM] PDF to HTML or XML or something useful


> Hi I am looking for a product that translates PDF to HTML or XML,
> hopefully in an accessible format. The PDF does not work with my
> WebAdapt2Me or anything that produces accessible alternative text formats.
> I have a product called imaginatively, PDFtoHTML, but its output is very
> odd. I've made style sheets that civilize it, but you can only do so much
> with strange markup.
> So, any ideas? I'd really like to start reading the ACM Digital Library
> is a little comfort.
>
> Sincerely, Wayne Dick, Chair CECS
> CSU Long Beach
> _______________________________________________
> To manage your subscription, visit http://list.webaim.org/
> Address list messages to = EMAIL ADDRESS REMOVED =



From: Robinson, Norman B - Washington, DC
Date: Fri, May 20 2005 1:16PM
Subject: RE: PDF to HTML or XML or something useful
← Previous message | Next message →

Most frequently I encounter this error when the PDF is actually invalid
- but that requires reviewing the content at a source level.

I would welcome real-world example, if you could please forward it, of a
PDF made with other application that doesn't work with Adobe 7. Let me
know the application that produced it if you know. If it opens in other
PDF tools and is standard, I'll take Adobe to task and assume it is a
defect in their product.

Regards,

Norman Robinson


From: Hoffman, Allen
Date: Fri, May 20 2005 1:57PM
Subject: Pdf to HTML or XML or something useful
← Previous message | No next message

See http://www.deque.com and the UnDoc products. Also look at
http://www.scansoft.com for their PDF manipulation products.


Allen Hoffman