Please use this identifier to cite or link to this item: http://drsr.daiict.ac.in//handle/123456789/1084
Full metadata record
DC FieldValueLanguage
dc.contributor.advisorKhare, Manish-
dc.contributor.advisorKumar, Ahlad-
dc.contributor.authorShah, Mahir Manishbhai-
dc.date.accessioned2024-08-22T05:21:00Z-
dc.date.available2024-08-22T05:21:00Z-
dc.date.issued2022-
dc.identifier.citationShah, Mahir Manishbhai (2022). Video Object Detection and Identification in Dynamic Environment. Dhirubhai Ambani Institute of Information and Communication Technology. viii, 41 p. (Acc. # T01004).-
dc.identifier.urihttp://drsr.daiict.ac.in//handle/123456789/1084-
dc.description.abstractObject Detection and Identification in the field of computer vision is widely regarded as one of the most difficult problems in computer science. Yet it is one of the most rising topics in recent years due to the advancement of the computer hardware technologies like GPUs.The task of Object Detection and Identification can be further divided into two categories: 1. Object Detection and Identification in still images. 2. Object Detection and Identification in dynamic environments. Due to the advancements in computer hardware like GPUs, deep neural network based methods have shown great accuracy and most of the state-of-the-art methods for still images are based on deep neural networks. Extending these state-of-the-art object detectors for still images into dynamic environments is not easy as we see a drop in accuracy because of the deteriorated object appearances like rare poses, motion blurs, video defocus, and part or full occlusion. The reason for the decrease in accuracy is that still image detectors do not take into account the temporal information contained in videos when detecting the objects in dynamic environment like videos. To improve the accuracy of the state-of-the-art detectors in the dynamic environment like videos, different methods have been developed which takes into consideration temporal information present in videos.In this thesis work, we have tried to increase the accuracy of state-of-the-art object detectors by trying to use the knowledge of the previously trained model as a reference to another model. In our work, we have also tried to simplify the architecture when we combine two different models without incurring a loss in the accuracy. In our thesis work, the first model that we have trained is an pixel-level method and the second model that we have trained is an instancelevel method. We have tested our approach on the ImageNet VID dataset and YouTube-8M dataset. Results show that our approach has obtained improved results in instance-level object detection methods.-
dc.publisherDhirubhai Ambani Institute of Information and Communication Technology-
dc.subjectcomputer science-
dc.subjectdynamic environments-
dc.subjectImageNet VID dataset-
dc.classification.ddc006.31 SHA-
dc.titleVideo Object Detection and Identification in Dynamic Environment-
dc.typeDissertation-
dc.degreeM. Tech-
dc.student.id202011002-
dc.accession.numberT01004-
Appears in Collections:M Tech Dissertations

Files in This Item:
File SizeFormat 
202011002.pdf1.45 MBAdobe PDFView/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.