Optical character recognition (OCR) feature extraction and classification

Prajapati, Pratik Kamlesh

dc.contributor.advisor	Joshi, Manjunath V.
dc.contributor.author	Prajapati, Pratik Kamlesh
dc.date.accessioned	2020-09-14T05:57:49Z
dc.date.available	2020-09-14T05:57:49Z
dc.date.issued	2019
dc.identifier.citation	Prajapati, Pratik Kamlesh (2019). Optical character recognition (OCR) feature extraction and classification. Dhirubhai Ambani Institute of Information and Communication Technology, 47p. (Acc.No: T00763)
dc.identifier.uri	http://drsr.daiict.ac.in//handle/123456789/828
dc.description.abstract	Optical character recognition (OCR) [6] is a process of digitizing an image or document containing text. In the OCR system, we do the classification of optical patterns contained in a digital image corresponding to alphanumeric and special characters. The various important intermediate steps involved in character recognition are pre-processing, segmentation, feature extraction and classification/recognition. In the past, a lot of research has been performed to compare the performance of various OCR approaches such as Support Vector Machine (SVM) [2], Hidden Markov Model (HMM) [7], Feed Forward Neural Networks [8] and Convolutional Neural Networks [9] and even Transfer Learning [3]. We have proposed to use Capsule Network [5] to improve the Optical Character Recognition performance. For this thesis, we are taking up this problem to make it more robust for various type of documents and fonts. Also, we want to overcome erroneous predictions in case of incorrect segmentation of characters. This retains most of the important information in the document which can be used later for various pipeline processes. Our approach makes the manual correction of OCR-ed output as less as possible. The complete numeric value is of more importance and even a single error in the character (digit) will ask for the manual editors to type the complete numeric value again, so predicting the complete block of the numeric value ism very important for us. Keywords: Optical Character Recognition, Pre-processing, Segmentation, Feature Extraction
dc.publisher	Dhirubhai Ambani Institute of Information and Communication Technology
dc.subject	Optical character recognition
dc.subject	pre-processing
dc.subject	segmentation
dc.subject	feature extraction
dc.classification.ddc	006.424 PRA
dc.title	Optical character recognition (OCR) feature extraction and classification
dc.type	Dissertation
dc.degree	M.Tech
dc.student.id	201711004
dc.accession.number	T00763

Files in this item

Name:: 201711004.pdf
Size:: 13.69Mb
Format:: PDF
Description:: Dissertation

View/Open

This item appears in the following Collection(s)

M Tech Dissertations [923]

Show simple item record