Please use this identifier to cite or link to this item: http://drsr.daiict.ac.in//handle/123456789/596
Full metadata record
DC FieldValueLanguage
dc.contributor.advisorPatil, Hemant A.
dc.contributor.authorRao, Sushant V.
dc.date.accessioned2017-06-10T14:44:06Z
dc.date.available2017-06-10T14:44:06Z
dc.date.issued2016
dc.identifier.citationRao, Sushant V. (2016). Pre-processing using outlier removal in voice conversion. Dhirubhai Ambani Institute of Information and Communication Technology, x, 61p. (Acc.No: T00559)
dc.identifier.urihttp://drsr.daiict.ac.in/handle/123456789/596
dc.description.abstractVoice conversion (VC) is a technique that modifies and converts the speech spokenby one speaker to sound as if the same sentence was spoken by another speaker.In short, only the speaker�s identity is converted and the linguistic informationfrom the source speaker remains unchanged. There are numerous methods forVC that have their own strengths and limitations. In this thesis, the problem thatis being dealt with is that of improving the quality of training by proposing a preprocessingmethod to remove the undesired observations. An attempt is made tosuccessfully make the training phase of VC systems, robust by eliminating suchobservations before estimating a mapping function. In particular, for this work,the two state-of-the-art statistical mapping techniques were implemented to testand compare the performance of the proposed approach. Voice conversion usingJoint Density Gaussian Mixture Models (JD-GMM) and Partial Least Squares(PLS) regression were used as the mapping techniques. These undesired observationsare known as outliers. By definition, outliers are observations (frames inspeech signal processing) that do not fit within the regularity of the datatset. Theconcept and effect of outliers will be further studied in this thesis. To evaluate VCconversion systems, there are a set of standard objective and subjective measuresthat are used. The performance of the VC systems is compared based on both thestandard set of measures.
dc.publisherDhirubhai Ambani Institute of Information and Communication Technology
dc.subjectVoice Conversion Systems
dc.subjectPre-processing Method
dc.subjectLine Spectral Frequencies
dc.subjectSpectral Mapping Techniques
dc.classification.ddc006.454 RAO
dc.titlePre-processing using outlier removal in voice conversion
dc.typeDissertation
dc.degreeM. Tech
dc.student.id201411005
dc.accession.numberT00559
Appears in Collections:M Tech Dissertations

Files in This Item:
File Description SizeFormat 
201411005.pdf
  Restricted Access
1.72 MBAdobe PDFThumbnail
View/Open Request a copy


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.