Video captioning

dc.accession.number	T00866
dc.classification.ddc	621.367 LAH
dc.contributor.advisor	Mandal, Srimanta
dc.contributor.author	Laheri, Vishal Bharatkumar
dc.date.accessioned	2020-09-22T19:41:36Z
dc.date.accessioned	2025-06-28T10:28:22Z
dc.date.available	2023-02-16T19:41:36Z
dc.date.issued	2020
dc.degree	M. Tech
dc.description.abstract	In recent years, models for video captioning task has been improved very much. Despite advancement, it is still impeded by hardware constraints. Video captioning models takes a sequence of images and caption as inputs, which makes it one of the most memory consuming and computation required task. In this project work, we exploit the importance of required frames from the video to get the desired performance. We also propose the use of a video summarizing model embedded with the captioning model for dynamically selecting frames, which allows the reduction of required frames without losing Spatio-temporal information of the video.
dc.identifier.citation	Laheri, Vishal Bharatkumar (2020). Video captioning. Dhirubhai Ambani Institute of Information and Communication Technology. vi, 25 p. (Acc.No: T00866)
dc.identifier.uri	http://drsr.daiict.ac.in/handle/123456789/944
dc.student.id	201811037
dc.subject	Deep Learning
dc.subject	Computer Vision
dc.subject	LSTM
dc.subject	Video Description
dc.subject	Video Captioning
dc.title	Video captioning
dc.type	Dissertation

Files

Now showing 1 - 1 of 1

Now showing 1 - 1 of 1