Please use this identifier to cite or link to this item:
https://repository.sustech.edu/handle/123456789/7391
Title: | Optical Character Recognition Software Evaluation |
Other Titles: | تقييم الضوئي على الحروف الاعتراف البرمجيات |
Authors: | Supervisor - Amir Mohamed Talib Modawi, Omer Mohammed Ali |
Keywords: | Optical Character Recognition Software Evaluation Computer Science Pattern Recognition) |
Issue Date: | 1-May-2005 |
Publisher: | Sudan University of Science & Technology |
Citation: | Modawi,Omer Mohammed Ali. Optical Character Recognition Software Evaluation/Omer Mohammed Ali Modawi;Mohammed ElHafiz Mustfa.-Khartoum:Sudan University of Science and Technology,College of Computer Science and Information Technology,2005.-184p.:ill.;28cm.-M.sc |
Abstract: | This study aimed to measure the accuracy rate of Optical Character Recognition (OCR) packages for ensuring the best software for any scanned pages. Studying the overall concepts behind software evaluation, since OCR packages consider as one of the most useful tools for data entry, building archive and publishing e-article. The researcher tries to find answers for the following important questions: what are the best OCR software for specific printed page quality, which one can recognize tables, charts and other graphical items, which one can recognize colored pages. The thesis reviews Pattern Recognition (PR), PR application, OCR, OCR recognition methods and applications. The thesis then provides a description of evaluation method used for evaluating OCR Packages. The thesis concentrated on printed English and Arabic characters. Most of English packages are available for free or free trial version. Therefore, a lot of work was done to improve their accuracy rate. Most English packages included in this study achieves accuracy rate more than 97%. Among the best are Ms Docscan, OmniPage, Cuneiform, Finreader and SimpleOCR were performed well. For many reasons, the number of Arabic OCR packages is very small and most of them are expensive. The thesis evaluates one Arabic Package, which (Readiris8). All English packages achieved character accuracy rate more than 97% except TextBridge which achieves only 76.47% and DocScanPro 86.83%. MsDocScan is the best package for all document |
Description: | Thesis |
URI: | http://repository.sustech.edu/handle/123456789/7391 |
Appears in Collections: | Masters Dissertations : Computer Science and Information Technology |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
Optical Charactr Recognition ....pdf Restricted Access | Research | 5.3 MB | Adobe PDF | View/Open Request a copy |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.