Now showing items 1-9 of 9

    • Acoustic-to-articulatory inversion: speech quality assessment and smoothness constraint 

      Rajpal, Avni (Dhirubhai Ambani Institute of Information and Communication Technology, 2015)
      The ability of humans to speak effortlessly, require coordinated movements of various articulators, muscles, etc. This effortless movement contributes towards naturalness, intelligibility and speaker identity in human ...
    • Analysis of nonlinearity in speech production mechanism for speaker verification: phase-based approach 

      Agrawal, Purvi (Dhirubhai Ambani Institute of Information and Communication Technology, 2015)
      Many of the real-world signal processing problems can be described using linear models, and can be realized as analog or digital filter, time-invariant filters; finite or infinite impulse response (IIR or FIR) filters. In ...
    • Feature based approach for singer identification 

      Radadia, Purushotam G. (Dhirubhai Ambani Institute of Information and Communication Technology, 2012)
      One of the challenging and difficult problems under the category of Music Information Retrieval (MIR) is to identify a singer of a given song under strong instrumental accompaniments. Besides instrumental sounds, other ...
    • Gaussian mixture models for spoken language identification 

      Manwani, Naresh (Dhirubhai Ambani Institute of Information and Communication Technology, 2006)
      Language Identification (LID) is the problem of identifying the language of any spoken utterance irrespective of the topic, speaker or the duration of the speech. Although A very huge amount of work has been done for ...
    • Phonetic segmentation: unsupervised approach 

      Vachhani, Bhavikkumar Bhagvanbhai (Dhirubhai Ambani Institute of Information and Communication Technology, 2013)
      Phonetic segmentation can find its potential application for Automatic Speech Recognition (ASR) and Text-to-Speech (TTS) Synthesis systems. In this thesis, we propose use of different spectral features viz., Mel Frequency ...
    • Speaker recognition over VoIP network 

      Goswami, Parth A. (Dhirubhai Ambani Institute of Information and Communication Technology, 2011)
      This thesis deals with the Automatic Speaker Recognition (ASR) system over narrowband Voice over Internet Protocol (VoIP) networks. There are several artifacts of VoIP network such as speech codec, packet loss and packet ...
    • Spectro-temporal features based automatic speech recognition 

      Nagpal, Ankit (Dhirubhai Ambani Institute of Information and Communication Technology, 2015)
      ASR technology has found its application in almost every field in life. Today‟s world cannot be considered as noise-free and deploying ASR technology in such environments would incorporate the challenge to deal with various ...
    • Unsupervised speaker-invariant feature representations for QbE-STD 

      R., Sreeraj (Dhirubhai Ambani Institute of Information and Communication Technology, 2018)
      Query-by-Example Spoken Term Detection (QbE-STD) is the task of retrieving audio documents relevant to the user query in spoken form, from a huge collection of audio data. The idea in QbE-STD is to match the audio documents ...
    • Vocal tract length normalization for automatic speech recognition 

      Sharma, Shubham (Dhirubhai Ambani Institute of Information and Communication Technology, 2014)
      Various factors affect the performance of Automatic Speech Recognition (ASR) systems. In this thesis, speaker differences due to variations in vocal tract length (VTL) are taken into account. Vocal Tract Length Normalization ...