Please use this identifier to cite or link to this item: http://drsr.daiict.ac.in//handle/123456789/1111
Title: SDINet Scheme for Generalized Text Detection in Scene and Document Images
Authors: Joshi, Manjunath V.
Bhillare, Shruti
Pal, Pravir
Keywords: Optical character recognition
Scene or Document Image Network
Weighted Loss
Issue Date: 2022
Publisher: Dhirubhai Ambani Institute of Information and Communication Technology
Citation: Pal, Pravir (2022). SDINet Scheme for Generalized Text Detection in Scene and Document Images. Dhirubhai Ambani Institute of Information and Communication Technology. ix, 42 p. (Acc. # T01031).
Abstract: Text Detection is an essential intermediate step in optical character recognition (OCR). OCR applied in scene text images is helpful for applications such as traffic signs and vehicle number plate recognition. OCR applied in the document text images help digitise and analyse the documents. Hence, a robust text detection system is needed to detect the text exceptionally well given an arbitrary text image. In this work, we address text detection in images using our Scene or Document Image Network (SDINet). During the training of the model, a Weighted Loss (WL) is designed to better update the training parameters according to the input image type. A classification model is designed that helps us to find the WL By classifying an input image as a scene text type image or document text type image. The novelty of our approach is in the fact that the training parameters of the model are updated according to the input image type. Our approach shows comparative results in all the evaluation parameters for scene text and document text datasets. Specifically, when compared to PSENet, experimental results show that our SDINet approach improves the recall by more than 1%, and F-score is increased by approximately 1% for SCUT-CTW 1500 dataset. [30].
URI: http://drsr.daiict.ac.in//handle/123456789/1111
Appears in Collections:M Tech Dissertations

Files in This Item:
File SizeFormat 
202011044.pdf2.98 MBAdobe PDFView/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.