Repository logo
Collections
Browse
Statistics
  • English
  • हिंदी
Log In
New user? Click here to register.Have you forgotten your password?
  1. Home
  2. Publications
  3. Journal Article
  4. Voice privacy using time-scale and pitch modification

Publication:
Voice privacy using time-scale and pitch modification

Date

27-01-2024

Authors

Singh, Dipesh Kumar
Prajapati, Gauri
Patil, Hemant

Journal Title

Journal ISSN

Volume Title

Publisher

 S N Computer Science

Research Projects

Organizational Units

Journal Issue

Abstract

There is a growing demand toward digitization of various day-to-day work and hence, there is a surge in use of Intelligent Personal Assistants. The extensive use of these smart digital assistants asks for security and privacy preservation techniques because they use personally identifiable characteristics of the user. To that effect, various privacy preservation techniques for different types of voice assistants have been explored. Hence, for voice-based digital assistants, we need a privacy preservation technique. Thus, in this study, we explored the prosody modification methods to modify speaker-specific characteristics of the user, so that the modified utterances can then be made publicly available to use for training of different speech-based systems. This study presents three data augmentation techniques as voice anonymization methods to modify the speaker-dependent speech parameters (i.e.,�). The voice anonymization and speech intelligibility are measured objectively using the automatic speaker verification (ASV) and automatic speech recognition (ASR) experiments, respectively, on development and test set of�Librispeech�dataset. For speed perturbation-based anonymization, up to 53.7% relative increased % EER is observed for a perturbation factor,��for both male and female speakers. For the same case, the % WER was adequate (less than the baseline system), reflecting the use of speed perturbation method as anonymization algorithm in a voice privacy system. The similar performance is observed for pitch perturbation with perturbation factor,�. However, the tempo perturbation could not found to be useful for speaker anonymization during the experiments with % EER in the order of 5�10 %.

Description

Keywords

Citation

Dipesh K. Singh, Gauri P. Prajapati, and Patil, Hemant A, "Voice Privacy Using Time-Scale and Pitch Modification," SN Computer Science, Springer, ISSN: 2661-8907, vol. 5, 27 Jan. 2024, Article no.243, doi: 10.1007/s42979-023-02549-8.

URI

https://ir.daiict.ac.in/handle/dau.ir/1567

Collections

Journal Article

Endorsement

Review

Supplemented By

Referenced By

Full item page

Research Impact

Metrics powered by PlumX, Altmetric and Dimensions

 
Quick Links
  • Home
  • Search
  • Research Overview
  • About
Contact

DAU, Gandhinagar, India

library@dau.ac.in

+91 0796-8261-578

Follow Us

© 2025 Dhirubhai Ambani University
Designed by Library Team