Please use this identifier to cite or link to this item:
http://drsr.daiict.ac.in//handle/123456789/1111
Title: | SDINet Scheme for Generalized Text Detection in Scene and Document Images |
Authors: | Joshi, Manjunath V. Bhillare, Shruti Pal, Pravir |
Keywords: | Optical character recognition Scene or Document Image Network Weighted Loss |
Issue Date: | 2022 |
Publisher: | Dhirubhai Ambani Institute of Information and Communication Technology |
Citation: | Pal, Pravir (2022). SDINet Scheme for Generalized Text Detection in Scene and Document Images. Dhirubhai Ambani Institute of Information and Communication Technology. ix, 42 p. (Acc. # T01031). |
Abstract: | Text Detection is an essential intermediate step in optical character recognition (OCR). OCR applied in scene text images is helpful for applications such as traffic signs and vehicle number plate recognition. OCR applied in the document text images help digitise and analyse the documents. Hence, a robust text detection system is needed to detect the text exceptionally well given an arbitrary text image. In this work, we address text detection in images using our Scene or Document Image Network (SDINet). During the training of the model, a Weighted Loss (WL) is designed to better update the training parameters according to the input image type. A classification model is designed that helps us to find the WL By classifying an input image as a scene text type image or document text type image. The novelty of our approach is in the fact that the training parameters of the model are updated according to the input image type. Our approach shows comparative results in all the evaluation parameters for scene text and document text datasets. Specifically, when compared to PSENet, experimental results show that our SDINet approach improves the recall by more than 1%, and F-score is increased by approximately 1% for SCUT-CTW 1500 dataset. [30]. |
URI: | http://drsr.daiict.ac.in//handle/123456789/1111 |
Appears in Collections: | M Tech Dissertations |
Files in This Item:
File | Size | Format | |
---|---|---|---|
202011044.pdf | 2.98 MB | Adobe PDF | View/Open |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.