Post navigation

OCRFeeder version 0.7.1a released

This version introduces some tasks performed by Emergya as part of the GuadaLinfo Accessible project, such as:
* Importation from a scanner device.
* Copying text from the content boxes to the clipboard.
* Users can now use the typical spell-checker dialog to correct mistakes in the text recognized by the OCR engines.

Other highlights include:

* Rewritten ocrfeeder-cli (which also introduces a help method now)
* Added the automatic detection of the Cuneiform OCR engine
* Move the OCRFeeder modules to its own folder (so it is better organized and doesn’t conflict with other modules when installing it)

15 thoughts on “OCRFeeder version 0.7.1a released”

I think 0.7.1a will be confused in automated tools as being an earlier release than 0.7.1. A better version number would be something like 0.7.2 or 0.7.1.1.

For instance, I’m using Mozilla Firefox 4.0b8 as my primary web browser, which is the 8th beta of what will eventually be released as Firefox 4.0, which will later receive an update as 4.0.1 or whatever.

One last thing, please consider putting your real name and, optionally, your email because going anonymous for stating such a wrong accusation would be worth if I couldn’t figure out who you are from 1) the bug you mention, 2) your company that I could easily get from your IP address…

The scrollbars are needed so far. I have been focusing more on features and bug fixing and less on the UI. But still I don’t think it is difficult to use. Also notice the window was not maximized (to get better for the screenshot).

Current version of OCRFeeder (0.7.1) does not allow to choose a language to recognize. I needed to add some options to the command line in the ocr engines settings window to recognize non-english text.
It would be great to have some GUI to select the main recognition language. It would be even greater to set a language for every area (most OCR engines like tesseract and cuneiform currently do not allow to recognize multilanguage documents).

I try to make it easy to use OCR engines but without losing the flexibility of using them as if it was from the command line. Still, maybe I could create a more high-level way of configuring the most known OCR engines such as Tesseract and Cuneiform. Let’s see if I can do it in the future.