[jawsscripts] Re: OCR scripting project

  • From: Soronel Haetir <soronel.haetir@xxxxxxxxx>
  • To: jawsscripts@xxxxxxxxxxxxx
  • Date: Fri, 29 Jan 2010 12:10:23 -0900

Reading the couple text files included with that I suspect you are
running into the limitation that it can only deal with 400 and 600 dpi
images.  I'll mess around with it some, have some math heavy djvu
files I've been trying to run through tesseract with little success.

On 1/29/10, Donald Marang <donald.marang@xxxxxxxxx> wrote:
> Thanks for filling us in on this promising investigation.  It is a shame
> that there does not appear to be a common image file format!  I will look
> into this free software after posting the next version of OCRSnapshot.
>
> For now, I am still doing a crash course on creating objects in JAWS.
> Thanks Jamal as well for the last message which provided great resources to
> start learning.  I really needed this information on how JAWS retrieves
> unregistered objects.  That might be the missing piece in my strawman code!
> I also installed Visual Basic Express and Visual Studio C++ Express in an
> attempt to understand how objects work in Windows.  My intention is to
> perform the OCR using the Microsoft Office Document Imaging (MODI) COM
> object.
>
> Don Marang
>
> --------------------------------------------------
> From: "Jamal Mazrui" <empower@xxxxxxxxx>
> Sent: Friday, January 29, 2010 3:01 PM
> To: <JAWSScripts@xxxxxxxxxxxxx>
> Subject: [jawsscripts] OCR scripting project
>
>> Don and others,
>> I found another free OCR utility besides Tesseract.  The commercial
>> version of InftyReader is available at
>>
>> http://InftyReader.org
>>
>> The program specializes in performing OCR on scientific texts, including
>> formulas, etc.  The commercial version is an expensive GUI application.
>> The publisher made a free command-line version available a few years ago.
>> Although there is a link for this version on the web site, I found that
>> link to be broken, and Google searches did not find an alternative
>> download.
>>
>> Someone from the BlindMath list, however, has the free version, and he
>> emailed it to me.  I am making it available at
>>
>> http://EmpowermentZone.com/InftyReaderE244e.zip
>>
>> The documentation says it can use .tif, .gif, or .png as input.  I tried
>> doing so with output from the NirCmd utility that Don told us about.
>> Unfortunately, the .tif and .gif files were not recognized at all, and the
>> .png file eventually produced a lockup.   I also tried converting files to
>> a more acceptable input format using the free ImageMagick program.  No
>> luck there either.
>>
>> Still, I may have missed something that others can figure out, so I hope
>> this information is helpful for this or other projects that involve OCR.
>>
>> Jamal
>>
>>
>> __________
>> Visit and contribute to The JAWS Script Repository http://jawsscripts.com
>>
>> View the list's information and change your settings at
>> http://www.freelists.org/list/jawsscripts
>>
> __________
> Visit and contribute to The JAWS Script Repository http://jawsscripts.com
>
> View the list's information and change your settings at
> http://www.freelists.org/list/jawsscripts
>
>


-- 
Soronel Haetir
soronel.haetir@xxxxxxxxx
__________ 
Visit and contribute to The JAWS Script Repository http://jawsscripts.com

View the list's information and change your settings at 
http://www.freelists.org/list/jawsscripts

Other related posts: