Please use this identifier to cite or link to this item: http://drsr.daiict.ac.in//handle/123456789/944
Full metadata record
DC FieldValueLanguage
dc.contributor.advisorMandal, Srimanta
dc.contributor.authorLaheri, Vishal Bharatkumar
dc.date.accessioned2020-09-22T19:41:36Z
dc.date.available2023-02-16T19:41:36Z
dc.date.issued2020
dc.identifier.citationLaheri, Vishal Bharatkumar (2020). Video captioning. Dhirubhai Ambani Institute of Information and Communication Technology. vi, 25 p. (Acc.No: T00866)
dc.identifier.urihttp://drsr.daiict.ac.in//handle/123456789/944
dc.description.abstractIn recent years, models for video captioning task has been improved very much. Despite advancement, it is still impeded by hardware constraints. Video captioning models takes a sequence of images and caption as inputs, which makes it one of the most memory consuming and computation required task. In this project work, we exploit the importance of required frames from the video to get the desired performance. We also propose the use of a video summarizing model embedded with the captioning model for dynamically selecting frames, which allows the reduction of required frames without losing Spatio-temporal information of the video.
dc.subjectDeep Learning
dc.subjectComputer Vision
dc.subjectLSTM
dc.subjectVideo Description
dc.subjectVideo Captioning
dc.classification.ddc621.367 LAH
dc.titleVideo captioning
dc.typeDissertation
dc.degreeM. Tech
dc.student.id201811037
dc.accession.numberT00866
Appears in Collections:M Tech Dissertations

Files in This Item:
File Description SizeFormat 
201811037.pdf
  Restricted Access
4.16 MBAdobe PDFView/Open Request a copy


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.