Show simple item record

dc.contributor.advisorPatil, Hemant A.
dc.contributor.authorRajpal, Avni
dc.date.accessioned2017-06-10T14:43:27Z
dc.date.available2017-06-10T14:43:27Z
dc.date.issued2015
dc.identifier.citationRajpal, Avni (2015). Acoustic-to-articulatory inversion: speech quality assessment and smoothness constraint. Dhirubhai Ambani Institute of Information and Communication Technology, xii, 73 p. (Acc.No: T00533)
dc.identifier.urihttp://drsr.daiict.ac.in/handle/123456789/570
dc.description.abstractThe ability of humans to speak effortlessly, require coordinated movements of various articulators, muscles, etc. This effortless movement contributes towards naturalness, intelligibility and speaker identity in human speech, which is only partially present in speech, obtained from most of voice conversion (VC) systems. Hence, during voice conversion, the information related to speech production is lost. For quantification of the loss in information two quantities, i.e., mutual information (I) and estimation error were calculated. In this thesis, the differences in the estimated articulator trajectories are exploited to propose articulatory features based objective measure for assessing the quality of voice conversion. Moreover, a new smoothness criterion, i.e., jerk minimization is explored to deal non-uniqueness of speech inversion mapping. Speech is the result of coordinated movements of the articulators such as lips, tongue, jaw, velum, etc. Therefore, measured trajectories obtained are smooth and slowly varying. However, the trajectories estimated from acoustic-to-articulatory inversion are found to be jagged. Thus, energy minimization is used as smoothness constraint for improving performance of the acoustic-to-articulatory inversion. Moreover, jerk (i.e., rate of change of acceleration) is known for quantification of smoothness in case of human motor movements. This motivates us to propose jerk minimization as the smoothness criteria for frame-based acoustic-to-articulatory inversion.
dc.publisherDhirubhai Ambani Institute of Information and Communication Technology
dc.subjectAcoustic-to-articulatory
dc.subjectAcoustics in engineering
dc.subjectGeneral
dc.subjectAutomatic speech recognition
dc.classification.ddc006.454 RAJ
dc.titleAcoustic-to-articulatory inversion: speech quality assessment and smoothness constraint
dc.typeDissertation
dc.degreeM. Tech
dc.student.id201311039
dc.accession.numberT00533


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record