10 OCR Softwares Open Source & Commercial with download URLs

List of OCR Softwares Open Source & Commercial

Open Source OCR Software:


Tesseract
Category: Open source
Claims to be the most accurate open source OCR engine available and was one of the top 3 engines in the 1995 UNLV Accuracy test..
Supports a wide variety of image formats to be read and convert to texts in over 60 languages.
Combined with the Leptonica Image Processing Library.  since 2006 it has been improved extensively by Google.
Platforms: Linux, Windows, MAC OSX.


JMagick
Category: Open source
Open Source Java Interface of ImageMagick.
Implemented in the form of Java Native Interface(JNI) into the ImageMagick API which is built as a thin layer into the ImageMagick API
Lot faster than image processing libraries written completely in Java
Licence: LPGL
Platforms : as a java library.


Cognitive OpenOCR (Cuneiform)
CuneiForm is a software tool for optical character recognition. It was originally developed at Cognitive Technologies and, after a few years with no development, released as freeware on December 12, 2007.
Algorithms used in CuneiForm come from the rules for writing letters, from their topology, and do not require pattern recognition learning. CuneiForm recognizes any print font (scanned from booksnewspapersmagazineslaser printer output, dot-matrix printer output, typewriter text, etc.)
Licence: BSD Licence


OCRopus
OCRopus™ is an OCR system written in Python, NumPy, and SciPy focusing on the use of large scale machine learning for addressing problems in document analysis.
The latest release features a new text line recognizer based on recurrent neural networks (and does not require language modeling), models for both Latin script and Fraktur, and some new tools for ground truth labeling
Platform: Linux


VietOCR
A Java/.NET GUI frontend for Tesseract OCR engine. Supports optical character recognition for Vietnamese and other languages supported by Tesseract.
Supports PDF, TIFF, JPEG, GIF, PNG, BMP image formats, Multi-page TIFF images, Screenshots, Selection box, File drag-and-drop, Paste image from clipboard, Postprocessing for Vietnamese to boost accuracy rate
Localized user interface, Integrated scanning support, Watch folder monitor for support of batch processing, Custom text replacement in postprocessing, Spellcheck with Hunspell,
Support for downloading and installing language data packs and appropriate spell dictionaries
Licence: Apache Licence, 2.0
Platform: All, as a java library.


GOCR
Developed under the GNU Public License
Converts scanned images of text back to text files
Can be used with different front-ends, which makes it very easy to port to different OSes and architectures. It can open many different image formats, and its quality have been improving in a daily basis.
Licence : GNU Public Licence
Platform : All


YAGF
 Is a graphical interface for cuneiform and tesseract text recognition tools
Can scan images via XSane, import pages from PDF documents, perform images preprocessing and recognize texts using cuneiform from a single command centre
YAGF also makes it easy to scan and recognize several images sequentially.
Platform : Linux
Licence : GNU GPL v3

WatchOCR
Free OCR server for PDFs
Uses cuneiform, and exactimage to create text searchable PDFs from image only PDFs and Tiffs. 
Also includes a barcode feature that allows files to be renamed and placed in a directory structure based on information included in a barcode.
Using the web interface, WatchOCR can be remotely configured to monitor a watched folder for newly scanned PDFs for OCR conversion.
Platform : Linux
Licence : GNU GPL


SimpleOCR
Freeware
Royalty free OCR SDK for developers.
Licence : Free

Commercial OCR Softwares

TypeReader
Category: Commercial
High Speed: According to third party tests, the speed of ExperVision®’s OpenRTK® is four (4) to eight (8) times faster than competition.
Converts scanned documents into electronic files at speed of 8,000 pages per hour with maximum reliability
 Desktop 7.0 offers added flexibility to handle color and grayscale images, with duplex scanning support to process documents in English, French, German, Italian, Portuguese, Spanish, Dutch, Danish, Swedish, Norwegian, Finnish, Polish, Hungarian and Polynesian.
Unparalleled recognition technology to support 2618 fonts.
Users can choose to output various formats
Licence: Commercial

Edocfile
Category: Commercial
Uses Optical Character Recognition on the entire document and then parses the data contents, allowing the user to easily capture data from multi-page documents and documents of various lengths such as sales receipts.
The parsing engine can extract information based on its location to other items in the file and it also supports Regular Expressions and EasyPatterns.
capability to monitor an unlimited number of file folders that contain different document types to be processed, making it ideal for use with a copier that has a scan to file option.
Licence: Commercial
Platform : Windows

ABBYY FineReader
Converts Scans and PDFs to editable text.
Extract text from document photos
Create Searchable PDFs for archiving
Converts mulitilingual document images to texts
Retain the original structure of multipage documents.
Supports wide range of formats.
No retyping and reformatting
Digitizing Historic Texts with Fraktur OCR.
Export to Google Docs, Evernote and Dropbo
Platform: Windows, Mac


Nuance Omnipage
Turn high volumes of paper and digital documents into files you can edit, search and share in the format of your choice
Integrated PDF toolkit including searchable PDF and patented PDF-MRC
The most accurate conversion in 123 languages
Superior formatting control
Complete recognition of text, tables, graphics and images
Platform: Windows, Mac, Linux, Mobile etc
ReadIRIS
Powerful OCR software designed to convert all your paper documents, images or PDF into editable and searchable digital text (Word, Excel, PDF…) in just a click
Converts documents to PDF, Export your documents to the cloud
has acceptable performance, the software produces less accurate results and is less user-friendly than OmniPage Standard or Presto! OCR


Aspire
Embeded with a high performance OCR (optical character recognition) engine, Asprise OCR SDK library for Java, VB.NET, CSharp.NET, VC++, VB6.0, C, C++, Delphi on Windows, Mac, Linux and Solaris, enables you to equip your applications with OCR ability easily.
Highest Level of Accuracy - Asprise OCR can easily recognize difficult documents of poor image quality;
Excellent Format Retention - Text layouts on the input documents are preserved;
High Speed - Asprise OCR uses optimized OCR engine to perform excellent recognition in very short time;
Ease of Use - We strive to make the developer's life easier. Complex parameter configurations are removed from Asprise OCR SDK. You only have to supply the image document. Asprise OCR can intelligently determine the best setting internally.
Barcode Recognition - Beside characters (letters and numbers), Asprise OCR can recognize almost every kind of bar code. You can choose to recognize barcode or characters or both.
Flexible Licensing Scheme - You can purchase binary APIs and/or source code - the Lowest OCR library ownership cost!

Presto! OCR
Excellent OCR software package that produces more accurate optical character recognition results than Readiris Pro. However, it offers fewer output application options and understands fewer languages than OmniPage Standard.
Recognize Any Document, Article, or Letter with Ease
Send Recognized Text Directly to MS Word, Excel etc. Unparalleled Recognition Accuracy, 40 languages support, Re-Creates Digital Content from Paper Documents, Batch Processing, Color OCR, Training Engine, OCR on Dark Background, Built-in text editor, Automatically Detects Page Orientation
Platform: Windows

Share this

Related Posts

Previous
Next Post »

1 comments:

comments