Improving ocraccuracy for scanned historical newspapers

Naiker, Nithyananthan (2013) Improving ocraccuracy for scanned historical newspapers. [Final Year Project Report] (Unpublished)

[img] PDF (Please get the password by email to repository@unimas.my , or call ext: 3914 / 3942 / 3933)
Naiker Nithyananthan ft.pdf
Restricted to Registered users only

Download (12MB)

Abstract

OCR is part of the computer vision field and the use of it has grown rapidly in recent decade due to the incessant demand in document digitization. Different techniques are used by OCR tools to process varieties of input formats(.pdf, doc, .jpeg, etc.). However, from our point of view, no research has been done in applying the chain code technique on historical documents stored in image format. In this project, one variant of the chain code algorithm known as Compare images algorithm is presented when it has been tuned to process some samples of Sarawak Gazette. Experimental results show relatively high accuracy improvement (approximately 6.90%). Future works will focus on testing the algorithm to other historical documents.

Item Type: Final Year Project Report
Additional Information: Project Report (B.Sc.) -- Universiti Malaysia Sarawak, 2013
Uncontrolled Keywords: document digitization, image format, Sarawak Gazette
Subjects: N Fine Arts > N Visual arts (General) For photography, see TR
Q Science > Q Science (General)
Divisions: Academic Faculties, Institutes and Centres > Faculty of Computer Science and Information Technology
Faculties, Institutes, Centres > Faculty of Computer Science and Information Technology
Academic Faculties, Institutes and Centres > Faculty of Computer Science and Information Technology
Depositing User: Dan
Date Deposited: 02 Aug 2022 04:01
Last Modified: 14 Nov 2023 07:57
URI: http://ir.unimas.my/id/eprint/39020

Actions (For repository members only: login required)

View Item View Item