Please use this identifier to cite or link to this item:
http://drsr.daiict.ac.in//handle/123456789/1018
Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor.advisor | Khare, Manish | |
dc.contributor.advisor | Bhilare, Shruti | |
dc.contributor.author | Shah, Vidit | |
dc.date.accessioned | 2022-05-06T18:02:48Z | |
dc.date.available | 2023-02-24T18:02:48Z | |
dc.date.issued | 2021 | |
dc.identifier.citation | Shah, Vidit (2021). English Handwritten Word Recognition. Dhirubhai Ambani Institute of Information and Communication Technology. ix, 39 p. (Acc.No: T00953) | |
dc.identifier.uri | http://drsr.daiict.ac.in//handle/123456789/1018 | |
dc.description.abstract | Today, tons of data is being generated every day and this helps with the automation of several tasks. Automated recognition of handwritten words from images is one such challenging task. This can be done by extracting the important features out of an image. The major challenge for handwritten word recognition over optical word recognition is the inherent variation in the handwriting styles. To recognize such words there must be a model or a system. Thus, it is of utmost importance to build handwritten word recognition models with high accuracy. The model will face multiple challenges that need to be overcome to accurately predict the given word on its own. This model can be used in pharmaceuticals to convert the prescription or report images into scanned documents and store the relevant information from it. In this work, I will be building a deep-learningbased odel for the English Handwritten Dataset that can recognize the words from the images. Dataset used here is the IAM word dataset. This dataset is publicly available. CNN architecture helps to extract features from images. Features could be in the form of edges or blurred images. RNN helps to learn the model from the previous states and predict the output for the next state. This process is called sequential learning. Combining the strength of feature extraction from CNN and sequence learning from RNN i.e. C-RNN, I got 72.46% accuracy and 11.88% character error rate. Accuracy depends on the dataset used for training purposes. | |
dc.subject | Convolutional Neural Network | |
dc.subject | CTC loss function | |
dc.subject | Deep Learning | |
dc.subject | Neural Networks and Recurrent Neural Network | |
dc.classification.ddc | 006.32 SHA | |
dc.title | English Handwritten Word Recognition | |
dc.type | Dissertation | |
dc.degree | M. Tech | |
dc.student.id | 201911026 | |
dc.accession.number | T00953 | |
Appears in Collections: | M Tech Dissertations |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
201911026_Final Thesis - Manish Khare.pdf Restricted Access | 2.7 MB | Adobe PDF | View/Open Request a copy |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.