Google reveals OCR tech for scanned docs

"Small but important step"


31 October 2008 16:57 GMT / By Amy-Mae Elliott

Google has worked out a way to perform OCR (Optical Character Recognition) on any scanned documents that are stored in Adobe's PDF format.

The OCR technology allows for the picture of text (as you get with a scan) to be converted into words that can be searched and indexed.

In the past, scanned documents were rarely included in search results as Google says it could not be sure of their content.

Google says that every day, people all over the world post scanned documents online - everything from official government reports to obscure academic papers.

The company says: "this is a small but important step forward in our mission of making all the world's information accessible and useful".
Related
Full tags
Software, Websites, Google, OCR, Search engines

share print story pdf email story

Recommended articles

Recommended articles from around the web

Loading

Best iPad 2 apps

We detail the best iPad 2 and iPad apps in the app store Which iPad app should you download?

Best new iPad apps

We detail the best iPad apps in the app store for your new Retina Display Which iPad app should you download?

Windows 8

First Look: Windows 8 Consumer Preview reviewed

The new iPad

The new iPad: Everything you need to know

Pocket-lint poll

Q. Does the Samsung Galaxy S III deliver what you hoped for?

Vote YES Vote NO

» LAST TIME
When asked Would you switch from iOS to Android? 54% said yes and 46% said no