Novel nonlinear prediction-based features for spoofed speech detection

Bhavsar, Himanshu N.

View/Open

201411029.pdf (794.9Kb)

Date

2016

Author

Bhavsar, Himanshu N.

Metadata

Show full item record

Abstract

Automatic Speaker Verification (ASV) systems are prone to various spoofing attacks.Spoofing is one type of technique in which fake speech signal is given tothe ASV system to get the access of that system without the permission of anauthorized person. There are four types of spoofing attacks, namely, impersonation,replay, speech synthesis (SS) and voice conversion (VC). In impersonationattack, source speaker alter their voice (i.e., mimicking), replay attack record thespeech from target speaker voice, using any arbitrary text spoof speech can begenerated in speech synthesis, VC changes the voice of source-to-target speaker.SS and VC are more practical and they create more threat to the ASV system andhence, in this thesis work, we concentrate on the SS and VC only. For the detectionof spoofed speech, we develop various countermeasures, after analysis ofvarious plots and histograms of these features. We came up with the observationthat these countermeasures might work well, to classify whether these featuresare from natural or spoofed speech. For that purpose, we use GaussianMixture Model (GMM)-based classifier. We built two different GMM for naturaland spoofed speech. At the time of verification, when an unknown speech signalis given as an input to the ASV system, first features are extracted and afterthat, we find the likelihood score from both GMM models, which indicates theprobability of these features are from both the models. If this score is greater thansome threshold value, then it is classified as natural otherwise it is detected asspoofed speech. In this work, we propose linear prediction-nonlinear prediction(LP-NLP)-based countermeasure for the detection of spoofed speech signal. Forexperiments reported in this thesis, we used ASVspoof challenge 2015 databaseand database of Blizzard challenge 2012 and 2014. For the measurement of performanceof the system, we use Detection Error Tradeoff (DET) curve and EqualError Rate (EER).

URI

http://drsr.daiict.ac.in/handle/123456789/615

Collections

M Tech Dissertations [923]