Google Docs OCR Demonstration

One cannot really deny the fact that Google is constantly working on new features for their popular online services like Gmail or Google Docs. The latest feature is currently available as a demonstration only and not yet integrated into Google Docs. The Google Docs OCR demonstration can OCR the three image formats jpg, png and gif. Google lists the following limitations that are currently in place:

Files must be fairly high-resolution -- rule of thumb is 10 pixel character height.

Maximum file size: 10MB, maximum resolution: 25 mega pixel

The larger the file, the longer the OCR operation will take (500K: ~15s, 2MB: ~40s, 10MB: forever)

Supported image formats that are uploaded on the demonstration page will be turned into text documents and displayed in Google Docs once the process has been completed. The quality depends largely on the quality of the image. It is usually necessary to look over the text and correct errors that have been made during character recognition. Google Docs helps in the error correction by underlining unknown words in red in its interface. It still takes some time to correct the errors.

The OCR demonstration is linked to a Google Docs account but not integrated into Google Docs yet. It is very likely that Google will integrated OCR capabilities to Google Docs in the near future. You can use the demonstration page for now to test the OCR service.

Update: Google has shut down the test server, so that the demonstration page is no longer available. The OCR feature has however been implemented into Google Docs, at least when it comes to pdf or image file formats that you upload to Google Docs.

The support of image file formats basically lets you use the service with any type of document format, as you could make a screenshot and upload the image file to use Google Docs' OCR feature.

Advertisement

We need your help

Advertising revenue is falling fast across the Internet, and independently-run sites like Ghacks are hit hardest by it. The advertising model in its current form is coming to an end, and we have to find other ways to continue operating this site.

We are committed to keeping our content free and independent, which means no paywalls, no sponsored posts, no annoying ad formats or subscription fees.

If you like our content, and would like to help, please consider making a contribution:

About Martin Brinkmann

Martin Brinkmann is a journalist from Germany who founded Ghacks Technology News Back in 2005. He is passionate about all things tech and knows the Internet and computers like the back of his hand.You can follow Martin on Facebook, Twitter or Google+

About gHacks

Ghacks is a technology news blog that was founded in 2005 by Martin Brinkmann. It has since then become one of the most popular tech news sites on the Internet with five authors and regular contributions from freelance writers.