WebAIM - Web Accessibility In Mind

E-mail List Archives

Re: HTML/CSS-to-PDF-engines that Produce Tagged PDFs

for

From: chagnon@pubcom.com
Date: Apr 1, 2019 10:53AM


Brandon wrote:
"PDF tags and HTML tags are almost identical."

There are significant differences between HTML and PDF tags. Items like lists, tables, footnotes, indexes, tables of content, and many others either don't exist in HTML at all or are vastly different.

Bottom line: a PDF isn't anything like an HTML webpage, and neither are PDF tags.

—Bevi
— — —
Bevi Chagnon, founder/CEO | <EMAIL REMOVED>
— — —
PubCom: Technologists for Accessible Design + Publishing
consulting ' training ' development ' design ' sec. 508 services
Upcoming classes at www.PubCom.com/classes
— — —
Latest blog-newsletter – Accessibility Tips at www.PubCom.com/blog

-----Original Message-----
From: WebAIM-Forum < <EMAIL REMOVED> > On Behalf Of Brandon Keith Biggs
Sent: Monday, April 1, 2019 11:19 AM
To: WebAIM Discussion List < <EMAIL REMOVED> >
Subject: Re: [WebAIM] HTML/CSS-to-PDF-engines that Produce Tagged PDFs

Hello,
If the HTML is properly tagged, why is there any question about the PDF?
Isn't it almost a direct conversion? I've never programmed PDF, but from what I've seen with my screen reader, PDF tags and HTML tags are almost identical.
Thanks,

Brandon Keith Biggs <http://brandonkeithbiggs.com/>;


On Mon, Apr 1, 2019 at 8:16 AM < <EMAIL REMOVED> > wrote:

> First, PDFLib makes excellent PDF tools.
> Second, no automated tool can make a perfectly accessible PDF. Still
> needs a trained human to review and test the PDF.
>
> Any automated tool can tag content in a PDF.
> The problems arise when the tags need to be assessed: are they the
> correct tags for the content (such as all P tags without any heading
> tags)? Are they in a logical reading order? Only a human can verify these items.
>
> Artificial intelligence isn't very developed or intelligent at this
> stage of the industry, and every automated tool uses A I to autotag a file.
>
> --Bevi Chagnon
> — — —
> Bevi Chagnon, founder/CEO | <EMAIL REMOVED> — — —
> PubCom: Technologists for Accessible Design + Publishing consulting '
> training ' development ' design ' sec. 508 services Upcoming classes
> at www.PubCom.com/classes — — — Latest blog-newsletter – Accessibility
> Tips at www.PubCom.com/blog
>
> -----Original Message-----
> From: WebAIM-Forum < <EMAIL REMOVED> > On Behalf Of
> Detlev Fischer
> Sent: Monday, April 1, 2019 10:45 AM
> To: <EMAIL REMOVED>
> Subject: Re: [WebAIM] HTML/CSS-to-PDF-engines that Produce Tagged PDFs
>
> We use PDF Lib https://www.pdflib.com/ - not sure how good it is though.
> Detlev
>
> Am 31.03.2019 um 16:49 schrieb Brandon Keith Biggs:
> > Hello,
> > Does anyone know of an HTML/CSS-to-PDF-engine that produces a
> > properly tagged PDF?
> > One would think all the engines would do this, but the demo pages for:
> > weasyprint <https://weasyprint.org/samples/> and prince
> > <https://www.princexml.com/samples/>
> > are not tagged.
> > Thanks,
> >
> > Brandon Keith Biggs <http://brandonkeithbiggs.com/>;
> > > > > > archives at http://webaim.org/discussion/archives
> > >
> --
> Detlev Fischer
> Testkreis
> Werderstr. 34, 20144 Hamburg
>
> Mobil +49 (0)157 57 57 57 45
>
> http://www.testkreis.de
> Beratung, Tests und Schulungen für barrierefreie Websites
>
> > > archives at http://webaim.org/discussion/archives
> >
> > > archives at http://webaim.org/discussion/archives
> >