Publication: LP spectra vs. mel spectra for identification of professional mimics in Indian languages
dc.contributor.affiliation | DA-IICT, Gandhinagar | |
dc.contributor.author | Basu, T K | |
dc.contributor.author | Patil, Hemant | |
dc.date.accessioned | 2025-08-01T13:09:00Z | |
dc.date.issued | 19-05-2009 | |
dc.description.abstract | Automatic Speaker Recognition (ASR) is an economic tool for voice biometrics because of availability of low cost and powerful processors. For an ASR system to be successful in practical environments, it must have�high mimic resistance, i.e., the system should not be defeated by determined mimics which may be either identical twins or professional mimics. In this paper, we demonstrate the effectiveness of Linear Prediction (LP)-based features, viz., Linear Prediction Coefficients (LPC) and Linear Prediction Cepstral Coefficients (LPCC) over filterbank-based features such as Mel-Frequency Cepstral Coefficients (MFCC) and newly proposed Teager energy-based MFCC (T-MFCC) for the identification of professional mimics in Indian languages. Results are reported for real and fictitious experiments. On the whole, it is observed that LP-based features perform�better�than filterbank-based features (an average jump of 23.21% and 31.43% for fictitious experiments with professional mimic in Marathi and Hindi, respectively, whereas there is an average jump of 1.64% for real experiments with professional mimic in Hindi) and�we believe that this is the first time such results on identification of professional mimics in ASR are obtained. Analysis of the results is given with the help of Mean Square Error (MSE) between training and testing utterances for mimic�s imitations for target speakers and target speakers� normal voice. Fourier spectra and corresponding LP spectra for target speaker and its impersonations provided by professional mimic are shown to justify the results. Finally, dependence of LPC on physiological characteristics of vocal tract and its relation with respect to the problem addressed in this paper is studied. | |
dc.format.extent | 1-16 | |
dc.identifier.citation | Patil, Hemant A. and Basu, T. K. "LP spectra vs. mel spectra for identification of professional mimics in Indian languages," International Journal of Speech Technology, vol. 11, no. 1, pp. 1-16, May. 2009. | |
dc.identifier.doi | 10.1007/s10772-009-9031-y | |
dc.identifier.issn | 1572-8110 | |
dc.identifier.scopus | 2-s2.0-68849123320 | |
dc.identifier.uri | https://ir.daiict.ac.in/handle/dau.ir/1530 | |
dc.language.iso | en | |
dc.publisher | Springer | |
dc.relation.ispartofseries | Vol. 11; No. 1 | |
dc.source | International Journal of Speech Technology | |
dc.source.uri | https://link.springer.com/article/10.1007/s10772-009-9031-y | |
dc.title | LP spectra vs. mel spectra for identification of professional mimics in Indian languages | |
dspace.entity.type | Publication | |
relation.isAuthorOfPublication | fdb7041b-280e-498b-b2ee-34f9bc351f4c | |
relation.isAuthorOfPublication.latestForDiscovery | fdb7041b-280e-498b-b2ee-34f9bc351f4c |