WebAIM - Web Accessibility In Mind

E-mail List Archives

Thread: HTML to tagged PDF conversion, with alt text and other attributes?

for

Number of posts in this thread: 5 (In chronological order)

From: Paul Bohman
Date: Wed, Oct 23 2013 6:17PM
Subject: HTML to tagged PDF conversion, with alt text and other attributes?
No previous message | Next message →

I've been searching all over for a tool that convert HTML to tagged PDF in
a way that retains all tags and attributes. So far I haven't found any.
Adobe's tool creates the tags accurately, but it does not retain any of the
attributes. No alt text, no table header scopes or ids, and so on. Is there
a tool that can convert an accessible HTML file into an accessible PDF file?

Paul Bohman, PhD
Director of Training
Deque Systems, Inc
www.deque.com
703-225-0380, ext.121

From: Olaf Drümmer
Date: Thu, Oct 24 2013 1:53AM
Subject: Re: HTML to tagged PDF conversion, with alt text and other attributes?
← Previous message | Next message →

Hi Paul,

out of curiosity - what would you need this for?

And what would you expect, given that the typical HTML page comes as part of a page structure, pages contain navigation, etc. whereas your typical PDF document has all the content in one go, needs less navigation, because some of the navigation is inherent to the file format and/or you have different mechanism, like bookmarks. Would you expect that several HTMLp ages are converted at the same time, and that based on linking you would go from PDF to the other by clicking on an entry in some kind of navigation bar that existed in the HTML but also is present in the PDF converted from the HTML?

And what about dynamic features, like accordions or sliders or …

And last but not least: would expect you result to match styling in the HTML as much as possible, or are you looming at a styling more suitable for a static, page based medium like PDF?


Olaf


Am 24 Oct 2013 um 02:17 schrieb Paul Bohman < = EMAIL ADDRESS REMOVED = >:

> I've been searching all over for a tool that convert HTML to tagged PDF in
> a way that retains all tags and attributes. So far I haven't found any.
> Adobe's tool creates the tags accurately, but it does not retain any of the
> attributes. No alt text, no table header scopes or ids, and so on. Is there
> a tool that can convert an accessible HTML file into an accessible PDF file?
>
> Paul Bohman, PhD
> Director of Training
> Deque Systems, Inc
> www.deque.com
> 703-225-0380, ext.121
> > >

From: Paul Bohman
Date: Thu, Oct 24 2013 6:40AM
Subject: Re: HTML to tagged PDF conversion, with alt text and other attributes?
← Previous message | Next message →

Those are all good questions, but in my case they have already been
resolved, or at least I should say I have taken that into account in the
planning process. I have one set of styles for screen, and another for
print and PDF. The print styles strip the web navigation. Wherever there
are interactive elements, I use conditional scripting to insert screenshots
and/or text substitutes.

Basically, I want to use a single document source in HTML format that can
be accessed for various purposes, including printing or accessible ebook
via PDF.


Paul Bohman, PhD
Director of Training
Deque Systems, Inc
www.deque.com
703-225-0380, ext.121


On Thu, Oct 24, 2013 at 3:53 AM, Olaf Drümmer < = EMAIL ADDRESS REMOVED = > wrote:

> Hi Paul,
>
> out of curiosity - what would you need this for?
>
> And what would you expect, given that the typical HTML page comes as part
> of a page structure, pages contain navigation, etc. whereas your typical
> PDF document has all the content in one go, needs less navigation, because
> some of the navigation is inherent to the file format and/or you have
> different mechanism, like bookmarks. Would you expect that several HTMLp
> ages are converted at the same time, and that based on linking you would go
> from PDF to the other by clicking on an entry in some kind of navigation
> bar that existed in the HTML but also is present in the PDF converted from
> the HTML?
>
> And what about dynamic features, like accordions or sliders or …
>
> And last but not least: would expect you result to match styling in the
> HTML as much as possible, or are you looming at a styling more suitable for
> a static, page based medium like PDF?
>
>
> Olaf
>
>
> Am 24 Oct 2013 um 02:17 schrieb Paul Bohman < = EMAIL ADDRESS REMOVED = >:
>
> > I've been searching all over for a tool that convert HTML to tagged PDF
> in
> > a way that retains all tags and attributes. So far I haven't found any.
> > Adobe's tool creates the tags accurately, but it does not retain any of
> the
> > attributes. No alt text, no table header scopes or ids, and so on. Is
> there
> > a tool that can convert an accessible HTML file into an accessible PDF
> file?
> >
> > Paul Bohman, PhD
> > Director of Training
> > Deque Systems, Inc
> > www.deque.com
> > 703-225-0380, ext.121
> > > > > > >
> > > >

From: Olaf Drümmer
Date: Thu, Oct 24 2013 9:22AM
Subject: Re: HTML to tagged PDF conversion, with alt text and other attributes?
← Previous message | Next message →

Hi Paul,

you may want to have a look at PDFreactor 6 from RealObjects:
http://www.realobjects.com/products/pdfreactor/
I do know that they turn HTML tags and other information into tags in PDF (to support accessibility) but I do now know exactly the extent to which they do it, e.g. whether they cover all the relevant attributes etc. I suggest to simply get in touch with them and tell them what you are looking for. Then if they claim they have what you need, you can download samples or try their software.

Olaf



Am 24 Oct 2013 um 14:40 schrieb Paul Bohman < = EMAIL ADDRESS REMOVED = >:

> Those are all good questions, but in my case they have already been
> resolved, or at least I should say I have taken that into account in the
> planning process. I have one set of styles for screen, and another for
> print and PDF. The print styles strip the web navigation. Wherever there
> are interactive elements, I use conditional scripting to insert screenshots
> and/or text substitutes.
>
> Basically, I want to use a single document source in HTML format that can
> be accessed for various purposes, including printing or accessible ebook
> via PDF.
>
>
> Paul Bohman, PhD
> Director of Training
> Deque Systems, Inc
> www.deque.com
> 703-225-0380, ext.121
>
>
> On Thu, Oct 24, 2013 at 3:53 AM, Olaf Drümmer < = EMAIL ADDRESS REMOVED = > wrote:
>
>> Hi Paul,
>>
>> out of curiosity - what would you need this for?
>>
>> And what would you expect, given that the typical HTML page comes as part
>> of a page structure, pages contain navigation, etc. whereas your typical
>> PDF document has all the content in one go, needs less navigation, because
>> some of the navigation is inherent to the file format and/or you have
>> different mechanism, like bookmarks. Would you expect that several HTMLp
>> ages are converted at the same time, and that based on linking you would go
>> from PDF to the other by clicking on an entry in some kind of navigation
>> bar that existed in the HTML but also is present in the PDF converted from
>> the HTML?
>>
>> And what about dynamic features, like accordions or sliders or …
>>
>> And last but not least: would expect you result to match styling in the
>> HTML as much as possible, or are you looming at a styling more suitable for
>> a static, page based medium like PDF?
>>
>>
>> Olaf
>>
>>
>> Am 24 Oct 2013 um 02:17 schrieb Paul Bohman < = EMAIL ADDRESS REMOVED = >:
>>
>>> I've been searching all over for a tool that convert HTML to tagged PDF
>> in
>>> a way that retains all tags and attributes. So far I haven't found any.
>>> Adobe's tool creates the tags accurately, but it does not retain any of
>> the
>>> attributes. No alt text, no table header scopes or ids, and so on. Is
>> there
>>> a tool that can convert an accessible HTML file into an accessible PDF
>> file?
>>>
>>> Paul Bohman, PhD
>>> Director of Training
>>> Deque Systems, Inc
>>> www.deque.com
>>> 703-225-0380, ext.121
>>> >>> >>> >>
>> >> >> >>
> > >

From: Paul Bohman
Date: Thu, Oct 24 2013 11:42AM
Subject: Re: HTML to tagged PDF conversion, with alt text and other attributes?
← Previous message | No next message

Thank you! This does look promising. I did a quick test and at least it
saved the alt attribute, so that's already a huge step in the right
direction. For anyone else who may be curious, this tool doesn't seem to
have any kind of graphic user interface. It's a web developer tool, meant
to be installed on a web server, or I suppose you can run it locally too,
which is what I did, but I had to use the command line. It also integrates
with Java, PHP, Ruby, Perl, and other languages. This may in fact be
exactly what I want. I'll have to do some more testing before I know for
sure. But yes, thank you. This seems to be much better than everything else
I was finding.


Paul Bohman, PhD
Director of Training
Deque Systems, Inc
www.deque.com
703-225-0380, ext.121


On Thu, Oct 24, 2013 at 11:22 AM, Olaf Drümmer < = EMAIL ADDRESS REMOVED = > wrote:

> Hi Paul,
>
> you may want to have a look at PDFreactor 6 from RealObjects:
> http://www.realobjects.com/products/pdfreactor/
> I do know that they turn HTML tags and other information into tags in PDF
> (to support accessibility) but I do now know exactly the extent to which
> they do it, e.g. whether they cover all the relevant attributes etc. I
> suggest to simply get in touch with them and tell them what you are looking
> for. Then if they claim they have what you need, you can download samples
> or try their software.
>
> Olaf
>
>
>
> Am 24 Oct 2013 um 14:40 schrieb Paul Bohman < = EMAIL ADDRESS REMOVED = >:
>
> > Those are all good questions, but in my case they have already been
> > resolved, or at least I should say I have taken that into account in the
> > planning process. I have one set of styles for screen, and another for
> > print and PDF. The print styles strip the web navigation. Wherever there
> > are interactive elements, I use conditional scripting to insert
> screenshots
> > and/or text substitutes.
> >
> > Basically, I want to use a single document source in HTML format that can
> > be accessed for various purposes, including printing or accessible ebook
> > via PDF.
> >
> >
> > Paul Bohman, PhD
> > Director of Training
> > Deque Systems, Inc
> > www.deque.com
> > 703-225-0380, ext.121
> >
> >
> > On Thu, Oct 24, 2013 at 3:53 AM, Olaf Drümmer < = EMAIL ADDRESS REMOVED = > wrote:
> >
> >> Hi Paul,
> >>
> >> out of curiosity - what would you need this for?
> >>
> >> And what would you expect, given that the typical HTML page comes as
> part
> >> of a page structure, pages contain navigation, etc. whereas your typical
> >> PDF document has all the content in one go, needs less navigation,
> because
> >> some of the navigation is inherent to the file format and/or you have
> >> different mechanism, like bookmarks. Would you expect that several HTMLp
> >> ages are converted at the same time, and that based on linking you
> would go
> >> from PDF to the other by clicking on an entry in some kind of navigation
> >> bar that existed in the HTML but also is present in the PDF converted
> from
> >> the HTML?
> >>
> >> And what about dynamic features, like accordions or sliders or …
> >>
> >> And last but not least: would expect you result to match styling in the
> >> HTML as much as possible, or are you looming at a styling more suitable
> for
> >> a static, page based medium like PDF?
> >>
> >>
> >> Olaf
> >>
> >>
> >> Am 24 Oct 2013 um 02:17 schrieb Paul Bohman < = EMAIL ADDRESS REMOVED = >:
> >>
> >>> I've been searching all over for a tool that convert HTML to tagged PDF
> >> in
> >>> a way that retains all tags and attributes. So far I haven't found any.
> >>> Adobe's tool creates the tags accurately, but it does not retain any of
> >> the
> >>> attributes. No alt text, no table header scopes or ids, and so on. Is
> >> there
> >>> a tool that can convert an accessible HTML file into an accessible PDF
> >> file?
> >>>
> >>> Paul Bohman, PhD
> >>> Director of Training
> >>> Deque Systems, Inc
> >>> www.deque.com
> >>> 703-225-0380, ext.121
> >>> > >>> > >>> > >>
> >> > >> > >> > >>
> > > > > > >
> > > >