Please use this identifier to cite or link to this item: http://drsr.daiict.ac.in//handle/123456789/570
Full metadata record
DC FieldValueLanguage
dc.contributor.advisorPatil, Hemant A.
dc.contributor.authorRajpal, Avni
dc.date.accessioned2017-06-10T14:43:27Z-
dc.date.available2017-06-10T14:43:27Z-
dc.date.issued2015
dc.identifier.citationRajpal, Avni (2015). Acoustic-to-articulatory inversion: speech quality assessment and smoothness constraint. Dhirubhai Ambani Institute of Information and Communication Technology, xii, 73 p. (Acc.No: T00533)
dc.identifier.urihttp://drsr.daiict.ac.in/handle/123456789/570-
dc.description.abstractThe ability of humans to speak effortlessly, require coordinated movements of various articulators, muscles, etc. This effortless movement contributes towards naturalness, intelligibility and speaker identity in human speech, which is only partially present in speech, obtained from most of voice conversion (VC) systems. Hence, during voice conversion, the information related to speech production is lost. For quantification of the loss in information two quantities, i.e., mutual information (I) and estimation error were calculated. In this thesis, the differences in the estimated articulator trajectories are exploited to propose articulatory features based objective measure for assessing the quality of voice conversion. Moreover, a new smoothness criterion, i.e., jerk minimization is explored to deal non-uniqueness of speech inversion mapping. Speech is the result of coordinated movements of the articulators such as lips, tongue, jaw, velum, etc. Therefore, measured trajectories obtained are smooth and slowly varying. However, the trajectories estimated from acoustic-to-articulatory inversion are found to be jagged. Thus, energy minimization is used as smoothness constraint for improving performance of the acoustic-to-articulatory inversion. Moreover, jerk (i.e., rate of change of acceleration) is known for quantification of smoothness in case of human motor movements. This motivates us to propose jerk minimization as the smoothness criteria for frame-based acoustic-to-articulatory inversion.
dc.publisherDhirubhai Ambani Institute of Information and Communication Technology
dc.subjectAcoustic-to-articulatory
dc.subjectAcoustics in engineering
dc.subjectGeneral
dc.subjectAutomatic speech recognition
dc.classification.ddc006.454 RAJ
dc.titleAcoustic-to-articulatory inversion: speech quality assessment and smoothness constraint
dc.typeDissertation
dc.degreeM. Tech
dc.student.id201311039
dc.accession.numberT00533
Appears in Collections:M Tech Dissertations

Files in This Item:
File Description SizeFormat 
201311039.pdf
  Restricted Access
1.8 MBAdobe PDFThumbnail
View/Open Request a copy


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.