WebAIM - Web Accessibility In Mind

E-mail List Archives

Thread: PDF Accessibility

for

Number of posts in this thread: 3 (In chronological order)

From: Duff Johnson
Date: Wed, Jun 26 2013 5:36PM
Subject: PDF Accessibility
No previous message | Next message →

Anya,

These are good questions; I'll try to offer specific answers accordingly.

> What I don't really understand is what my tags should actually be

Broadly speaking, it's the same question as you have in HTML - do the tags reflect the document's structure? Many of the most important PDF tags are similar to HTML tags.

> and how to fix them.

If using Adobe Acrobat XI, change the tag-type in the Tags panel (select the tag, edit the tag's name). You can, in principle, also use the Touch Up Reading Order Tool, but I'm (deliberately) avoiding that advice in this context.

> For example, when I run the Accessibilty check on the document,
> it passes, but when I view the tags, they don't make sense to me.

Yes; Acrobat's Accessibility Checker, in its current incarnation, does not offer a means of reviewing or validating the appropriateness of tags. :-(

I suggest you check out callas's pdfGoHTML (it's free) as one way to visualize the document's semantics. The VIP reader (mentioned below) is another, but neither of these is a validation tool.

> I am usually not lucky enough to get simple <H#s> and <Ps>, instead I get
> <Normal> <1stpara><P> with one line of the paragraph instead of the
> entire paragraph or even just a carriage return.

This sounds entirely normal. Unfortunate, but normal. We await smarter software; we cheer on those brave developers who dive into the fray. From a software development point of view, attempting to add tags to existing PDF files using automation is a very challenging problem.

> Usually my bulleted
> lists turn out as some kind of paragraph style. Do I need to go through
> every tag and change the properties on it to make it perfect, like HTML,
> or if it passes the Accessbility check and the reading order is OK, is
> that sufficient?

It's a fair question, and it illustrates my point above; today's Acrobat Accessibility Checker ignores certain features that are critical to accessibility.

Both logical reading order (tag order) *and* tag semantics (H#, P, TH etc) must be correct. It is *not* OK to ignore incorrect tags. Checking for the appropriateness of tags is a critical aspect of accessibility remediation in PDF just as it is in HTML.

I've seen plenty of tables in PDF tagged as a "Figure," list-items tagged as paragraphs and headings tagged as Notes. Would that be OK in HTML? Hell no! So it's not OK in PDF either.

That said, some tags matter much more than others. <Sect>, <Art>, <Div>, <Part> tags don't add value in current-generation AT, for example. There are others (<Note> comes to mind), but you should still tag correctly because over time AT *does* slowly deliver a better, richer job of reporting PDF document semantics to the reader.

> and I don't want to get into a political discussion about whether PDFs can be accessible.

It's an antiquated argument in any event. :-) The fact is that more and more software supports well-tagged PDF everyday. Who really wants to "defend" software that doesn't support the accessibility features in the world's chosen electronic document file-format??

Indeed, an all-new, totally free, PDF viewer for visually-impaired users on Mac, Windows and Linux was just released on Monday of this week! Here's the link for those who are interested:

http://www.szb.ch/en/press/press-releases/the-first-pdf-reader-for-visually-impaired-people.html

Of course, it works beautifully with *well tagged* PDF files and works no better than a 10 year-old cell-phone on untagged PDF.

I'll be writing more about the exciting new VIP Reader down the road.

but I digress...

> I want to provide what is both legally required and what is desirable to the users.

That's as concise and reasonable a statement as one might wish for.

Duff.

From: Chagnon | PubCom
Date: Wed, Jun 26 2013 10:28PM
Subject: Re: PDF Accessibility
← Previous message | Next message →

> I am usually not lucky enough to get simple <H#s> and <Ps>, instead I
> get <Normal> <1stpara><P> with one line of the paragraph instead of
> the entire paragraph or even just a carriage return.

This sounds like your PDF was made from InDesign. The PDF exporter from
InDesign usually takes the first half-dozen characters of the Paragraph
Style's name and creates a tag from it. Then, to correct the error, it uses
Acrobat's Role Map utility to designate <Normal> to be read as <P>.

So, check Role Map and edit any of the mappings that it didn't get right.
You can go and change every individual <Normal> to <P>, for example, but
that's not worth the time when Role Map gets the same result but quicker.

Also in InDesign, use the Export Tags feature in CS 5.5, CS6, and CC (CS7).
This way you can tell each paragraph style what it should map to when
exported to an accessible PDF.

> Usually my bulleted
> lists turn out as some kind of paragraph style.

Looks like you might not be formatting your bullets with the correct styles.

In Word, rather than using the bullet icon at the top, instead use the
paragraph style for List Bullets. I find it translates accurately every
time.
In InDesign, you must create a paragraph style and define the Bullet
settings in it. Hand formatted bullets don't export to PDF correctly, but
those created through a paragraph style create the correct <L> and <LI>
tags.

--Bevi Chagnon
- - - -
www.PubCom.com - Trainers, Consultants, Designers, Developers.
Print, Web, Acrobat, XML, eBooks, and U.S. Federal Section 508
Accessibility.
New Sec. 508 Workshop & EPUBs Tour in 2013 - www.Workshop.Pubcom.com

From: Jonathan Metz
Date: Thu, Jun 27 2013 6:51AM
Subject: Re: PDF Accessibility
← Previous message | No next message

On 6/26/13 4:59 PM, "Anya Levysmith" < = EMAIL ADDRESS REMOVED = > wrote:

>What I don¹t really understand is what my tags should actually be and how
>to fix
>them.

A List Apart has a very good, yet old, article on PDF Tags:
http://alistapart.com/d/pdf_accessibility/PDFtags.html

There are explanations of what the tags specifically mean at the bottom of
the page. Remember in PDF, unlike in HTML, there are no closing tags. Then
take all of Duff and Bevi¹s recommendations (though dealing with the Role
Map dialog can be incredibly confusing, IMO).


You will (unfortunately) need to use the Reading Order panel to mark up
tables. There is a very good PDF from Accessde.info
(http://www.pdf-accessible.com/en/accede-manuals/) that has details on
correctly tagging tables and what the difference is between a complex
table and a simple table. The PDF focuses on Acrobat 9, but the formula is
still the same. It starts (roughly) on page 59 or so.

One thing I¹ll say about using <Sect>, <Art>, <Div>, <Part> is that it can
be helpful for you to use them for organizing really long documents. It
breaks up the PDF tags into sections and makes it a little easier to find
your place later in the document.

HTH
Jonathan