Show simple item record

dc.contributor.advisorKhare, Manish
dc.contributor.advisorKumar, Ahlad
dc.contributor.authorShah, Mahir Manishbhai
dc.date.accessioned2024-08-22T05:21:00Z
dc.date.available2024-08-22T05:21:00Z
dc.date.issued2022
dc.identifier.citationShah, Mahir Manishbhai (2022). Video Object Detection and Identification in Dynamic Environment. Dhirubhai Ambani Institute of Information and Communication Technology. viii, 41 p. (Acc. # T01004).
dc.identifier.urihttp://drsr.daiict.ac.in//handle/123456789/1084
dc.description.abstractObject Detection and Identification in the field of computer vision is widely regarded as one of the most difficult problems in computer science. Yet it is one of the most rising topics in recent years due to the advancement of the computer hardware technologies like GPUs.The task of Object Detection and Identification can be further divided into two categories: 1. Object Detection and Identification in still images. 2. Object Detection and Identification in dynamic environments. Due to the advancements in computer hardware like GPUs, deep neural network based methods have shown great accuracy and most of the state-of-the-art methods for still images are based on deep neural networks. Extending these state-of-the-art object detectors for still images into dynamic environments is not easy as we see a drop in accuracy because of the deteriorated object appearances like rare poses, motion blurs, video defocus, and part or full occlusion. The reason for the decrease in accuracy is that still image detectors do not take into account the temporal information contained in videos when detecting the objects in dynamic environment like videos. To improve the accuracy of the state-of-the-art detectors in the dynamic environment like videos, different methods have been developed which takes into consideration temporal information present in videos.In this thesis work, we have tried to increase the accuracy of state-of-the-art object detectors by trying to use the knowledge of the previously trained model as a reference to another model. In our work, we have also tried to simplify the architecture when we combine two different models without incurring a loss in the accuracy. In our thesis work, the first model that we have trained is an pixel-level method and the second model that we have trained is an instancelevel method. We have tested our approach on the ImageNet VID dataset and YouTube-8M dataset. Results show that our approach has obtained improved results in instance-level object detection methods.
dc.publisherDhirubhai Ambani Institute of Information and Communication Technology
dc.subjectcomputer science
dc.subjectdynamic environments
dc.subjectImageNet VID dataset
dc.classification.ddc006.31 SHA
dc.titleVideo Object Detection and Identification in Dynamic Environment
dc.typeDissertation
dc.degreeM. Tech
dc.student.id202011002
dc.accession.numberT01004


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record