Novel nonlinear prediction-based features for spoofed speech detection

Bhavsar, Himanshu N.

dc.contributor.advisor	Patil, Hemant A.
dc.contributor.author	Bhavsar, Himanshu N.
dc.date.accessioned	2017-06-10T14:44:36Z
dc.date.available	2017-06-10T14:44:36Z
dc.date.issued	2016
dc.identifier.citation	Bhavsar, Himanshu N. (2016). Novel nonlinear prediction-based features for spoofed speech detection. Dhirubhai Ambani Institute of Information and Communication Technology, ix, 52p. (Acc.No: T00578)
dc.identifier.uri	http://drsr.daiict.ac.in/handle/123456789/615
dc.description.abstract	Automatic Speaker Verification (ASV) systems are prone to various spoofing attacks.<p/>Spoofing is one type of technique in which fake speech signal is given to<p/>the ASV system to get the access of that system without the permission of an<p/>authorized person. There are four types of spoofing attacks, namely, impersonation,<p/>replay, speech synthesis (SS) and voice conversion (VC). In impersonation<p/>attack, source speaker alter their voice (i.e., mimicking), replay attack record the<p/>speech from target speaker voice, using any arbitrary text spoof speech can be<p/>generated in speech synthesis, VC changes the voice of source-to-target speaker.<p/>SS and VC are more practical and they create more threat to the ASV system and<p/>hence, in this thesis work, we concentrate on the SS and VC only. For the detection<p/>of spoofed speech, we develop various countermeasures, after analysis of<p/>various plots and histograms of these features. We came up with the observation<p/>that these countermeasures might work well, to classify whether these features<p/>are from natural or spoofed speech. For that purpose, we use Gaussian<p/>Mixture Model (GMM)-based classifier. We built two different GMM for natural<p/>and spoofed speech. At the time of verification, when an unknown speech signal<p/>is given as an input to the ASV system, first features are extracted and after<p/>that, we find the likelihood score from both GMM models, which indicates the<p/>probability of these features are from both the models. If this score is greater than<p/>some threshold value, then it is classified as natural otherwise it is detected as<p/>spoofed speech. In this work, we propose linear prediction-nonlinear prediction<p/>(LP-NLP)-based countermeasure for the detection of spoofed speech signal. For<p/>experiments reported in this thesis, we used ASVspoof challenge 2015 database<p/>and database of Blizzard challenge 2012 and 2014. For the measurement of performance<p/>of the system, we use Detection Error Tradeoff (DET) curve and Equal<p/>Error Rate (EER).
dc.publisher	Dhirubhai Ambani Institute of Information and Communication Technology
dc.subject	Automatic Speaker Verification Systems
dc.subject	Spoofed Speech
dc.subject	Linear Prediction
dc.subject	Gaussian Mixture Model
dc.classification.ddc	519.54 BHA
dc.title	Novel nonlinear prediction-based features for spoofed speech detection
dc.type	Dissertation
dc.degree	M. Tech
dc.student.id	201411029
dc.accession.number	T00578

Files in this item

Name:: 201411029.pdf
Size:: 794.9Kb
Format:: PDF

View/Open

This item appears in the following Collection(s)

M Tech Dissertations [923]

Show simple item record