Video Object Detection and Identification in Dynamic Environment

Shah, Mahir Manishbhai

Please use this identifier to cite or link to this item: http://drsr.daiict.ac.in//handle/123456789/1084

Full metadata record

DC Field	Value	Language
dc.contributor.advisor	Khare, Manish	-
dc.contributor.advisor	Kumar, Ahlad	-
dc.contributor.author	Shah, Mahir Manishbhai	-
dc.date.accessioned	2024-08-22T05:21:00Z	-
dc.date.available	2024-08-22T05:21:00Z	-
dc.date.issued	2022	-
dc.identifier.citation	Shah, Mahir Manishbhai (2022). Video Object Detection and Identification in Dynamic Environment. Dhirubhai Ambani Institute of Information and Communication Technology. viii, 41 p. (Acc. # T01004).	-
dc.identifier.uri	http://drsr.daiict.ac.in//handle/123456789/1084	-
dc.description.abstract	Object Detection and Identification in the field of computer vision is widely regarded as one of the most difficult problems in computer science. Yet it is one of the most rising topics in recent years due to the advancement of the computer hardware technologies like GPUs.The task of Object Detection and Identification can be further divided into two categories: 1. Object Detection and Identification in still images. 2. Object Detection and Identification in dynamic environments. Due to the advancements in computer hardware like GPUs, deep neural network based methods have shown great accuracy and most of the state-of-the-art methods for still images are based on deep neural networks. Extending these state-of-the-art object detectors for still images into dynamic environments is not easy as we see a drop in accuracy because of the deteriorated object appearances like rare poses, motion blurs, video defocus, and part or full occlusion. The reason for the decrease in accuracy is that still image detectors do not take into account the temporal information contained in videos when detecting the objects in dynamic environment like videos. To improve the accuracy of the state-of-the-art detectors in the dynamic environment like videos, different methods have been developed which takes into consideration temporal information present in videos.In this thesis work, we have tried to increase the accuracy of state-of-the-art object detectors by trying to use the knowledge of the previously trained model as a reference to another model. In our work, we have also tried to simplify the architecture when we combine two different models without incurring a loss in the accuracy. In our thesis work, the first model that we have trained is an pixel-level method and the second model that we have trained is an instancelevel method. We have tested our approach on the ImageNet VID dataset and YouTube-8M dataset. Results show that our approach has obtained improved results in instance-level object detection methods.	-
dc.publisher	Dhirubhai Ambani Institute of Information and Communication Technology	-
dc.subject	computer science	-
dc.subject	dynamic environments	-
dc.subject	ImageNet VID dataset	-
dc.classification.ddc	006.31 SHA	-
dc.title	Video Object Detection and Identification in Dynamic Environment	-
dc.type	Dissertation	-
dc.degree	M. Tech	-
dc.student.id	202011002	-
dc.accession.number	T01004	-
Appears in Collections:	M Tech Dissertations

Files in This Item:

File	Size	Format
202011002.pdf	1.45 MB	Adobe PDF	View/Open

Show simple item record

DSpace JSPUI

DSpace preserves and enables easy and open access to all types of digital content including text, images, moving images, mpegs and data sets