Video captioning

Please use this identifier to cite or link to this item: http://drsr.daiict.ac.in//handle/123456789/944

Title:	Video captioning
Authors:	Mandal, Srimanta Laheri, Vishal Bharatkumar
Keywords:	Deep Learning Computer Vision LSTM Video Description Video Captioning
Issue Date:	2020
Citation:	Laheri, Vishal Bharatkumar (2020). Video captioning. Dhirubhai Ambani Institute of Information and Communication Technology. vi, 25 p. (Acc.No: T00866)
Abstract:	In recent years, models for video captioning task has been improved very much. Despite advancement, it is still impeded by hardware constraints. Video captioning models takes a sequence of images and caption as inputs, which makes it one of the most memory consuming and computation required task. In this project work, we exploit the importance of required frames from the video to get the desired performance. We also propose the use of a video summarizing model embedded with the captioning model for dynamically selecting frames, which allows the reduction of required frames without losing Spatio-temporal information of the video.
URI:	http://drsr.daiict.ac.in//handle/123456789/944
Appears in Collections:	M Tech Dissertations

Files in This Item:

File	Description	Size	Format
201811037.pdf Restricted Access		4.16 MB	Adobe PDF	View/Open Request a copy

DSpace JSPUI