Search
Now showing items 1-10 of 10
Speaker recognition over VoIP network
(Dhirubhai Ambani Institute of Information and Communication Technology, 2011)
This thesis deals with the Automatic Speaker Recognition (ASR) system over narrowband Voice over Internet Protocol (VoIP) networks. There are several artifacts of VoIP network such as speech codec, packet loss and packet ...
Design of syllable-based speech segmentation methods for text-to-speech (TTS) synthesis system for Gujarati
(Dhirubhai Ambani Institute of Information and Communication Technology, 2013)
Text-to-speech (TTS) synthesizer has been proved to be an aiding tool for many visually challenged people for reading through hearing feedback. Although there are TTS synthesizers available in English and other languages ...
Feature based approach for singer identification
(Dhirubhai Ambani Institute of Information and Communication Technology, 2012)
One of the challenging and difficult problems under the category of Music Information
Retrieval (MIR) is to identify a singer of a given song under strong instrumental
accompaniments. Besides instrumental sounds, other ...
Objective evaluation of speech quality of text-to-speech (TTS) synthesis systems
(Dhirubhai Ambani Institute of Information and Communication Technology, 2013)
Since the use of Text-to-Speech (TTS) technology is increasing, there is a high demand of TTS system that can produce natural and intelligible voice in any environments. In order to improve speech synthesis system, synthesized ...
Person recognition using humming, singing and speech
(Dhirubhai Ambani Institute of Information and Communication Technology, 2013)
In this thesis, person recognition system is designed for three different speech-related biometric signals, i.e., humming, singing and normal speech. As humming is nasalised sound, we have approached Mel filterbank-based ...
Person recognition from their hum
(Dhirubhai Ambani Institute of Information and Communication Technology, 2011)
In this thesis, design of person recognition system based on person's hum is presented. As hum is nasalized sound and LP (Linear Predication) model does not characterize nasal sounds sufficiently, our approach in this work ...
Phonetic segmentation: unsupervised approach
(Dhirubhai Ambani Institute of Information and Communication Technology, 2013)
Phonetic segmentation can find its potential application for Automatic Speech Recognition (ASR) and Text-to-Speech (TTS) Synthesis systems. In this thesis, we propose use of different spectral features viz., Mel Frequency ...
Studies on transcription, classification and detection of obstruents
(Dhirubhai Ambani Institute of Information and Communication Technology, 2013)
Speech is the powerful mode of communication among the people. During the last few decades, there has been growing interest in speech related research all over the world. To develop algorithms for automatic speech recognition ...
Vocal tract length normalization for automatic speech recognition
(Dhirubhai Ambani Institute of Information and Communication Technology, 2014)
Various factors affect the performance of Automatic Speech Recognition (ASR) systems. In this thesis, speaker differences due to variations in vocal tract length (VTL) are taken into account. Vocal Tract Length Normalization ...
Analysis of voice biometric attacks: detection of synthetic vs natural speech
(Dhirubhai Ambani Institute of Information and Communication Technology, 2014)
The improvement in text-to-speech (TTS) synthesis also poses the problem of biometric attack on speaker verification system. In this context, it is required to analyse the performance of these system for false acceptance ...