List of OCR Softwares Open Source & Commercial
Open Source OCR Software:
Tesseract
Category: Open source
Claims to be the most
accurate open source OCR engine available and was one of the top 3 engines in
the 1995 UNLV Accuracy test..
Supports a wide variety of
image formats to be read and convert to texts in over 60 languages.
Platforms: Linux,
Windows, MAC OSX.
JMagick
Category: Open source
Open Source Java Interface of ImageMagick.
Implemented in the form of Java Native Interface(JNI) into
the ImageMagick API which is built as a thin layer into the ImageMagick API
Lot faster than image
processing libraries written completely in Java
Licence: LPGL
Platforms : as a java
library.
Cognitive OpenOCR
(Cuneiform)
Licence: BSD
Licence
OCRopus
OCRopus™ is an OCR system written in Python, NumPy, and
SciPy focusing on the use of large scale machine learning for addressing
problems in document analysis.
The latest release features a new text line recognizer based
on recurrent neural networks (and does not require language modeling), models
for both Latin script and Fraktur, and some new tools for ground truth labeling
Platform:
Linux
VietOCR
A Java/.NET GUI frontend for Tesseract OCR engine. Supports
optical character recognition for Vietnamese and other languages supported by
Tesseract.
Supports PDF, TIFF, JPEG, GIF, PNG, BMP image formats, Multi-page
TIFF images, Screenshots, Selection box, File drag-and-drop, Paste image from
clipboard, Postprocessing for Vietnamese to boost accuracy rate
Localized user interface, Integrated scanning support, Watch
folder monitor for support of batch processing, Custom text replacement in
postprocessing, Spellcheck with Hunspell,
Support for downloading and installing language data packs
and appropriate spell dictionaries
Licence: Apache Licence, 2.0
Platform: All, as a java library.
GOCR
Developed under the GNU Public License
Converts scanned images of text back to text files
Can be used with different front-ends, which makes it very
easy to port to different OSes and architectures. It can open many
different image formats, and its quality have been improving in a daily basis.
Licence : GNU
Public Licence
Platform : All
YAGF
Is a graphical interface for cuneiform and tesseract
text recognition tools
Can scan images via XSane, import pages from PDF documents,
perform images preprocessing and recognize texts using cuneiform from a single
command centre
YAGF also makes it easy to scan and recognize several images
sequentially.
Platform : Linux
Licence : GNU GPL v3
WatchOCR
Free OCR server for PDFs
Uses cuneiform, and exactimage to create text searchable
PDFs from image only PDFs and Tiffs.
Also includes a barcode feature that allows files to be
renamed and placed in a directory structure based on information included in a
barcode.
Using the web interface, WatchOCR can be remotely configured
to monitor a watched folder for newly scanned PDFs for OCR conversion.
Platform : Linux
Licence : GNU GPL
SimpleOCR
Freeware
Royalty free OCR SDK for developers.
Licence : Free
Commercial OCR Softwares
TypeReader
Category: Commercial
High Speed:
According to third party tests, the speed of ExperVision®’s OpenRTK® is four
(4) to eight (8) times faster than competition.
Converts scanned documents into electronic files at speed of
8,000 pages per hour with maximum reliability
Desktop 7.0 offers added flexibility to handle color
and grayscale images, with duplex scanning support to process documents in
English, French, German, Italian, Portuguese, Spanish, Dutch, Danish, Swedish,
Norwegian, Finnish, Polish, Hungarian and Polynesian.
Unparalleled recognition technology to support 2618 fonts.
Users can choose to output various formats
Licence: Commercial
Edocfile
Category: Commercial
Uses Optical Character Recognition on the entire document
and then parses the data contents, allowing the user to easily capture data
from multi-page documents and documents of various lengths such as sales
receipts.
The parsing engine can extract information based on its
location to other items in the file and it also supports Regular Expressions
and
EasyPatterns.
capability to monitor an unlimited number of file folders
that contain different document types to be processed, making it ideal for use
with a copier that has a scan to file option.
Licence:
Commercial
Platform :
Windows
ABBYY FineReader
Converts Scans and PDFs to editable text.
Extract text from document photos
Create Searchable PDFs for archiving
Converts mulitilingual document images to texts
Retain the original structure of multipage documents.
Supports wide range of formats.
No retyping and reformatting
Digitizing Historic Texts with Fraktur OCR.
Export to Google Docs, Evernote and Dropbo
Platform:
Windows, Mac
Nuance Omnipage
Turn high volumes of paper and digital documents into files
you can edit, search and share in the format of your choice
Integrated PDF toolkit including searchable PDF and patented
PDF-MRC
The most accurate conversion in 123 languages
Superior formatting control
Complete recognition of text, tables, graphics and images
Platform:
Windows, Mac, Linux, Mobile etc
ReadIRIS
Powerful OCR software designed to convert all your paper
documents, images or PDF into editable and searchable digital text (Word,
Excel, PDF…) in just a click
Converts documents to PDF, Export your documents to
the cloud
has acceptable performance, the software produces less
accurate results and is less user-friendly than OmniPage Standard or Presto! OCR
Aspire
Embeded with a high performance OCR (optical character
recognition) engine, Asprise OCR SDK library for Java, VB.NET, CSharp.NET,
VC++, VB6.0, C, C++, Delphi on Windows, Mac, Linux and Solaris, enables you to
equip your applications with OCR ability easily.
Highest Level of Accuracy - Asprise OCR can easily
recognize difficult documents of poor image quality;
Excellent Format Retention - Text layouts on the input
documents are preserved;
High Speed - Asprise OCR uses optimized OCR engine to
perform excellent recognition in very short time;
Ease of Use - We strive to make the developer's life
easier. Complex parameter configurations are removed from Asprise OCR SDK. You
only have to supply the image document. Asprise OCR can intelligently determine
the best setting internally.
Barcode Recognition - Beside characters (letters and
numbers), Asprise OCR can recognize almost every kind of bar code. You can
choose to recognize barcode or characters or both.
Flexible Licensing Scheme - You can purchase binary
APIs and/or source code - the Lowest OCR library ownership cost!
Presto! OCR
Excellent
OCR software package
that produces more accurate optical character recognition results than Readiris
Pro. However, it offers fewer output application options and understands fewer
languages than OmniPage Standard.
Recognize Any
Document, Article, or Letter with Ease
Send Recognized Text
Directly to MS Word, Excel etc. Unparalleled Recognition Accuracy, 40
languages support, Re-Creates Digital
Content from Paper Documents, Batch
Processing, Color OCR, Training Engine, OCR on Dark Background, Built-in text
editor, Automatically Detects Page Orientation
Platform:
Windows