WebAIM - Web Accessibility In Mind

E-mail List Archives

Thread: PDF files have user default language set for some tags (<P>, <LBody>, <Link>)

for

Number of posts in this thread: 7 (In chronological order)

From: Dona Patrick
Date: Thu, Apr 06 2017 10:00AM
Subject: PDF files have user default language set for some tags (<P>, <LBody>, <Link>)
No previous message | Next message →

I've just finished remediating a 370 page PDF Spanish language file. I ran
it through JAWS and realized that some of the text was being read in
Spanish but some were being read in English. When I checked the content of
the tags that were being read in English I noticed that the tag properties
had English - US listed as the language.

I'd not noticed this before and wondered if I'd accidently enabled a
setting in either Acrobat Pro or Word to make this happen. I tested it on
both my home and work computers and found that when a Word file was
converted to PDF using the Acrobat ribbon or the Save As PDF option on the
File menu, English -- US was applied to many tags. When I used the Save
option on the File menu and chose PDF from the Save As dialog box, this did
not happen -- the language was clear in the tags properties.

Is there any way to remove the language from all of the tag properties in
one go or am I doomed to do it for each and every tag? I promised this file
to the client by next Thursday and have already spent far too long on it.

Are there settings I have enabled that I should disable to stop this from
happening in the future?

I did see a thread that discussed this, but it only discussed this issue in
regards to InDesign: http://webaim.org/discussion/mail_thread?threadV87

Thank you,

Dona

From: JP Jamous
Date: Thu, Apr 06 2017 10:10AM
Subject: Re: PDF files have user default language set for some tags(<P>, <LBody>, <Link>)
← Previous message | Next message →

I am afraid you have to do it manually. Word does that in case you use different languages. When you convert to PDF, some settings get lost during the conversion. I know it is a royal pain.

-----Original Message-----
From: WebAIM-Forum [mailto: = EMAIL ADDRESS REMOVED = ] On Behalf Of Dona Patrick
Sent: Thursday, April 6, 2017 11:01 AM
To: WebAIM Discussion List < = EMAIL ADDRESS REMOVED = >
Subject: [WebAIM] PDF files have user default language set for some tags (<P>, <LBody>, <Link>)

I've just finished remediating a 370 page PDF Spanish language file. I ran it through JAWS and realized that some of the text was being read in Spanish but some were being read in English. When I checked the content of the tags that were being read in English I noticed that the tag properties had English - US listed as the language.

I'd not noticed this before and wondered if I'd accidently enabled a setting in either Acrobat Pro or Word to make this happen. I tested it on both my home and work computers and found that when a Word file was converted to PDF using the Acrobat ribbon or the Save As PDF option on the File menu, English -- US was applied to many tags. When I used the Save option on the File menu and chose PDF from the Save As dialog box, this did not happen -- the language was clear in the tags properties.

Is there any way to remove the language from all of the tag properties in one go or am I doomed to do it for each and every tag? I promised this file to the client by next Thursday and have already spent far too long on it.

Are there settings I have enabled that I should disable to stop this from happening in the future?

I did see a thread that discussed this, but it only discussed this issue in regards to InDesign: http://webaim.org/discussion/mail_thread?threadV87

Thank you,

Dona

From: Trafford, Logan
Date: Thu, Apr 06 2017 11:14AM
Subject: Re: PDF files have user default language set for sometags(<P>, <LBody>, <Link>)
← Previous message | Next message →

CommonLook Global Access for PDF has a feature that can solve that problem in one fell swoop. It essentially applies the same fix to all similar issues within the document. We run into the same problem quite often as we create bilingual French/English documents. Unfortunately you need to have the CommonLook add-on for Acrobat.

Logan

-----Original Message-----
From: WebAIM-Forum [mailto: = EMAIL ADDRESS REMOVED = ] On Behalf Of JP Jamous
Sent: Thursday, April 06, 2017 12:11 PM
To: 'WebAIM Discussion List' < = EMAIL ADDRESS REMOVED = >
Subject: Re: [WebAIM] PDF files have user default language set for some tags (<P>, <LBody>, <Link>)

I am afraid you have to do it manually. Word does that in case you use different languages. When you convert to PDF, some settings get lost during the conversion. I know it is a royal pain.

-----Original Message-----
From: WebAIM-Forum [mailto: = EMAIL ADDRESS REMOVED = ] On Behalf Of Dona Patrick
Sent: Thursday, April 6, 2017 11:01 AM
To: WebAIM Discussion List < = EMAIL ADDRESS REMOVED = >
Subject: [WebAIM] PDF files have user default language set for some tags (<P>, <LBody>, <Link>)

I've just finished remediating a 370 page PDF Spanish language file. I ran it through JAWS and realized that some of the text was being read in Spanish but some were being read in English. When I checked the content of the tags that were being read in English I noticed that the tag properties had English - US listed as the language.

I'd not noticed this before and wondered if I'd accidently enabled a setting in either Acrobat Pro or Word to make this happen. I tested it on both my home and work computers and found that when a Word file was converted to PDF using the Acrobat ribbon or the Save As PDF option on the File menu, English -- US was applied to many tags. When I used the Save option on the File menu and chose PDF from the Save As dialog box, this did not happen -- the language was clear in the tags properties.

Is there any way to remove the language from all of the tag properties in one go or am I doomed to do it for each and every tag? I promised this file to the client by next Thursday and have already spent far too long on it.

Are there settings I have enabled that I should disable to stop this from happening in the future?

I did see a thread that discussed this, but it only discussed this issue in regards to InDesign: http://webaim.org/discussion/mail_thread?threadV87

Thank you,

Dona
This e-mail originates from the City of Ottawa e-mail system. Any distribution, use or copying of this e-mail or the information it contains by other than the intended recipient(s) is unauthorized. Thank you.

Le présent courriel a été expédié par le système de courriels de la Ville d'Ottawa. Toute distribution, utilisation ou reproduction du courriel ou des renseignements qui s'y trouvent par une personne autre que son destinataire prévu est interdite. Je vous remercie de votre collaboration.

From: Karlen Communications
Date: Thu, Apr 06 2017 11:30AM
Subject: Re: PDF files have user default language set for some tags(<P>, <LBody>, <Link>)
← Previous message | Next message →

I've had a problem with Word over the 2013/2016 cycle where the language of some parts of my documents get switched to FR instead of EN even though they are EN words. I know this because my screen reader starts reading EN words as FR and the dictionary used to spell check suddenly changes to FR. If you go to the Review Ribbon, Language and choose Set Proofing Language, there is a check box to automatically detect languages which I've unchecked and this seems to help.

To be on the safe side, I usually select my document and go through the Review Ribbon to set the default language for the entire document to EN just before I convert it. Doesn't take long, a few keyboard commands, but can save a lot of time in Acrobat.

Cheers, Karen

-----Original Message-----
From: WebAIM-Forum [mailto: = EMAIL ADDRESS REMOVED = ] On Behalf Of Dona Patrick
Sent: April 6, 2017 12:01 PM
To: WebAIM Discussion List < = EMAIL ADDRESS REMOVED = >
Subject: [WebAIM] PDF files have user default language set for some tags (<P>, <LBody>, <Link>)

I've just finished remediating a 370 page PDF Spanish language file. I ran it through JAWS and realized that some of the text was being read in Spanish but some were being read in English. When I checked the content of the tags that were being read in English I noticed that the tag properties had English - US listed as the language.

I'd not noticed this before and wondered if I'd accidently enabled a setting in either Acrobat Pro or Word to make this happen. I tested it on both my home and work computers and found that when a Word file was converted to PDF using the Acrobat ribbon or the Save As PDF option on the File menu, English -- US was applied to many tags. When I used the Save option on the File menu and chose PDF from the Save As dialog box, this did not happen -- the language was clear in the tags properties.

Is there any way to remove the language from all of the tag properties in one go or am I doomed to do it for each and every tag? I promised this file to the client by next Thursday and have already spent far too long on it.

Are there settings I have enabled that I should disable to stop this from happening in the future?

I did see a thread that discussed this, but it only discussed this issue in regards to InDesign: http://webaim.org/discussion/mail_thread?threadV87

Thank you,

Dona

From: Philip Kiff
Date: Thu, Apr 06 2017 2:47PM
Subject: Re: PDF files have user default language set for some tags(<P>, <LBody>, <Link>)
← Previous message | Next message →

Dona Patrick wrote:
> Is there any way to remove the language from all of the tag properties
> in one go or am I doomed to do it for each and every tag?

Like CommonLook Global Access mentioned by Logan, axesPDF provides a
quick method to make such global changes. In axesPDF, you can mass edit
any property on all selected tags in a PDF at once. But these are both
very, very expensive pieces of software.


> Are there settings I have enabled that I should disable to stop this
> from happening in the future?

You should be able to eliminate this problem by making sure that none of
your styles in Word include language settings. The default language for
the document should be set, plus the Normal style in your template
should either have no language or should have the language that matches
your global preference. All other styles should have no language
attribute applied. I work in French and English simultaneously and I
find it is easy to end up with a style with a language attribute applied
accidentally - especially if you use the "Update [style] to match
selection" feature on a file that has multiple language codes present.

Also, still working in Word, you can use advanced find-and-replace to
replace all instances of one language with another. Language appears as
an option under the "Format" drop-down in the advanced search options in
the Find and Replace dialog. If using this strategy, you may need to
repeat the process in your headers, footers, and any text boxes or
figures with text.

Finally, always use the paste as "unformatted text" option so you don't
bring language codes from other files or programs.

Phil.

From: Dona Patrick
Date: Fri, Apr 07 2017 9:13AM
Subject: Re: PDF files have user default language set for some tags (<P>, <LBody>, <Link>)
← Previous message | Next message →

Thank you to everyone who responded. Very helpful information.

Dona

On Thu, Apr 6, 2017 at 4:47 PM, Philip Kiff < = EMAIL ADDRESS REMOVED = > wrote:

> Dona Patrick wrote:
>
>> Is there any way to remove the language from all of the tag properties
>> in one go or am I doomed to do it for each and every tag?
>>
>
> Like CommonLook Global Access mentioned by Logan, axesPDF provides a quick
> method to make such global changes. In axesPDF, you can mass edit any
> property on all selected tags in a PDF at once. But these are both very,
> very expensive pieces of software.
>
>
> Are there settings I have enabled that I should disable to stop this
>> from happening in the future?
>>
>
> You should be able to eliminate this problem by making sure that none of
> your styles in Word include language settings. The default language for the
> document should be set, plus the Normal style in your template should
> either have no language or should have the language that matches your
> global preference. All other styles should have no language attribute
> applied. I work in French and English simultaneously and I find it is easy
> to end up with a style with a language attribute applied accidentally -
> especially if you use the "Update [style] to match selection" feature on a
> file that has multiple language codes present.
>
> Also, still working in Word, you can use advanced find-and-replace to
> replace all instances of one language with another. Language appears as an
> option under the "Format" drop-down in the advanced search options in the
> Find and Replace dialog. If using this strategy, you may need to repeat the
> process in your headers, footers, and any text boxes or figures with text.
>
> Finally, always use the paste as "unformatted text" option so you don't
> bring language codes from other files or programs.
>
> Phil.
>
> > > > >

From: Jonathan Cohn
Date: Sun, Apr 09 2017 10:38AM
Subject: Re: PDF files have user default language set for some tags (<P>, <LBody>, <Link>)
← Previous message | No next message

So does this also imply that if I create a style for a right to left language and very specifically mark it as such, that all I would have to do to enter text in that language (which off course has its own keyboard layout) would be to switch to that style?

Thanks,
,

Jonathan



> On Apr 6, 2017, at 4:47 PM, Philip Kiff < = EMAIL ADDRESS REMOVED = > wrote:
>
> Dona Patrick wrote:
>> Is there any way to remove the language from all of the tag properties
>> in one go or am I doomed to do it for each and every tag?
>
> Like CommonLook Global Access mentioned by Logan, axesPDF provides a quick method to make such global changes. In axesPDF, you can mass edit any property on all selected tags in a PDF at once. But these are both very, very expensive pieces of software.
>
>
>> Are there settings I have enabled that I should disable to stop this
>> from happening in the future?
>
> You should be able to eliminate this problem by making sure that none of your styles in Word include language settings. The default language for the document should be set, plus the Normal style in your template should either have no language or should have the language that matches your global preference. All other styles should have no language attribute applied. I work in French and English simultaneously and I find it is easy to end up with a style with a language attribute applied accidentally - especially if you use the "Update [style] to match selection" feature on a file that has multiple language codes present.
>
> Also, still working in Word, you can use advanced find-and-replace to replace all instances of one language with another. Language appears as an option under the "Format" drop-down in the advanced search options in the Find and Replace dialog. If using this strategy, you may need to repeat the process in your headers, footers, and any text boxes or figures with text.
>
> Finally, always use the paste as "unformatted text" option so you don't bring language codes from other files or programs.
>
> Phil.
> > > >