Publication:
Effectiveness of Teager energy operator for epoch detection from speech signals

dc.contributor.affiliationDA-IICT, Gandhinagar
dc.contributor.authorViswanath, Srikant
dc.contributor.authorPatil, Hemant
dc.contributor.researcherViswanath, Srikant (200701125)
dc.date.accessioned2025-08-01T13:09:00Z
dc.date.issued01-12-2011
dc.description.abstractIn this paper, we try to present the problem of epoch detection from a different perspective that not only deals with estimation of epoch instances (i.e., glottal activity) but also with quantification of the absence of epochs (i.e.,�no�glottal activity) in the unvoiced regions of speech signal. Most of the epoch detection methods perform significantly well in the voiced regions of speech but are not robust enough in the unvoiced regions of speech, i.e., they detect a number of�pseudo�epochs in the unvoiced regions of speech. We propose a simple method based on Teager Energy Operator (TEO) which not only determines the epochs in voiced region (due to its superior temporal resolution and its ability to capture airflow properties through the glottis) but also is very effective in unvoiced region. Recently proposed methods such as 0-Hz resonator-based method and DYPSA method gave a combined rate (CR) (for detecting epochs in voiced and unvoiced regions of speech) of 74.7% and 60%, respectively and a pseudo epoch rate (PER) (i.e., spurious epochs in the unvoiced regions of speech) of 62.9% and 54.04%, respectively. On the other hand, our proposed method gave a CR and PER of 87% and 0.27%, respectively. This result suggests that the proposed method captures�glottal activity�more efficiently both in voiced and unvoiced regions of speech signal. The performance of the proposed method is demonstrated using publicly available CMU-Arctic database using the epoch information from the electro-glottograph (EGG) as reference signal to serve as ground truth for estimation of glottal closure instants (GCI). Due to the noise suppression capability of TEO, the proposed method has almost no or little effect (i.e., robust) against signal degradations like white, babble, high frequency and vehicle noises as compared to 0-Hz resonator and DYPSA methods.
dc.format.extent321-337
dc.identifier.citationPatil, Hemant A, and Srikant Viswanath, "Effectiveness of Teager energy operator for epoch detection from speech signals," International Journal of Speech Technology (IJST), Vol. 14, no. 4, Dec. 2011, pp. 321-337. Doi: 10.1007/s10772-011-9110-8
dc.identifier.doi10.1007/s10772-011-9110-8
dc.identifier.issn1572-8110
dc.identifier.scopus2-s2.0-84864712967
dc.identifier.urihttps://ir.daiict.ac.in/handle/dau.ir/1531
dc.language.isoen
dc.publisherSpringer
dc.relation.ispartofseriesVol. 14; No. 4
dc.sourceInternational Journal of Speech Technology (IJST)
dc.source.urihttps://link.springer.com/article/10.1007/s10772-011-9110-8
dc.titleEffectiveness of Teager energy operator for epoch detection from speech signals
dspace.entity.typePublication
relation.isAuthorOfPublicationfdb7041b-280e-498b-b2ee-34f9bc351f4c
relation.isAuthorOfPublication.latestForDiscoveryfdb7041b-280e-498b-b2ee-34f9bc351f4c

Files

Collections