E-mail List Archives
Re: Does pdfGoHTML not recognize Actual Text?
From: Olaf Drümmer
Date: Jul 18, 2013 9:39AM
- Next message: McMorland, Gabriel: "PDF table editor disrupting reading order?"
- Previous message: Duff Johnson: "Re: Does pdfGoHTML not recognize Actual Text?"
- Next message in Thread: None
- Previous message in Thread: Duff Johnson: "Re: Does pdfGoHTML not recognize Actual Text?"
- View all messages in this Thread
Hi Jonathan,
the easiest way to let callas software know about issues or feature requests - whether for our commercial products or the free callas pdfGoHTML - would be an email to
<EMAIL REMOVED>
[we usually have a turn around time for the first substantial answer of less than 24 hours].
If you can - can you send directly to me the file you are struggling with (or a sample file that shows the ActualText issue)?
In principle ActualText should work, but maybe pdfGHoHTML is missing some aspect.
Thanks,
Olaf
Am 18 Jul 2013 um 16:46 schrieb Jonathan Metz < <EMAIL REMOVED> >:
> I feel like an idiot for not having done pdfGoHTML sooner. At least that
> way Id know that the first page was somehow getting hidden from
> everything! I ran it and it wasnt picking up the two culprit paragraphs.
> Redoing the OCR proved successful, to a point.
>
> It appears as though the pdfGoHTML isnt picking up the Actual Text of
> the tag. I tried multiple approaches. If I use Actual Text at all, the
> content is completely hidden. Ive gone so far as to test what would
> happen if I just made some of the text an image and applied the Actual
> Text. However, if I use Alternate text, it shows up in the conversion.
>
> At this point Im just trying to deduce if pdfGoHTML is having this
> problem, or if the file is still screwy. I tried to see if there was a
> feature request option on Callas, but I couldnt figure out where to look
> for that regarding free software. It would be cool if it replaced the
> error content that is tagged with Actual Text as the actual text thats
> supposed to be read. Of course, it might still do that but Ive still got
> a bad file regardless.
>
> Any thoughts?
>
> Jonathan
>
>
>
>
> On 7/17/13 9:56 PM, "Jonathan Metz" < <EMAIL REMOVED> > wrote:
>
>> Thanks for the response, Olaf.
>>
>>
>> Yes, I forgot to mention that Acrobat crashes too. I haven¹t installed
>> pdfGoHTML on this computer yet, but a good idea none the less.
>>
>> Whats the name of that other PDF reader that works with NVDA? I just can¹t
>> remember the name and want to give that a whirl.
>>
>> Should I just try OCRing that page that¹s causing me trouble and see if
>> that helps any?
>>
>> Thanks,
>> Jonathan
>>
>> On 7/17/13 6:10 PM, "Olaf Drümmer" < <EMAIL REMOVED> > wrote:
>>
>>> Hi Jonathan,
>>>
>>> Am 17 Jul 2013 um 20:35 schrieb Jonathan Metz
>>> < <EMAIL REMOVED> >:
>>>
>>>> When I use Adobe¹s Read Out Loud feature¹, Acrobat force closes.
>>>
>>> that looks like the PDF might have syntactical problems.
>>>
>>> Could you also try to
>>> - use Acrobat Pro and save as accessible text - what do you get?
>>> - use callas pdfGoHTML - what do you get?
>>>
>>> Background info: it's more or less the same engine that is working inside
>>> Adobe Reader and Adobe Acrobat for any of the above, and also for how
>>> NVDA gets access to the PDF file's content. If NVDA doesn't give you
>>> much, it might be because Adobe Reader is struggling and does not give
>>> much to NVDA to begin with.
>>>
>>> Olaf
>>>
>>> >>> >>> >>
>> >> >> >
> > >
- Next message: McMorland, Gabriel: "PDF table editor disrupting reading order?"
- Previous message: Duff Johnson: "Re: Does pdfGoHTML not recognize Actual Text?"
- Next message in Thread: None
- Previous message in Thread: Duff Johnson: "Re: Does pdfGoHTML not recognize Actual Text?"
- View all messages in this Thread