Skip to main content

Research Repository

Advanced Search

Robust speaker recognition using denoised vocal source and vocal tract features

Wang, Ning; Ching, P. C.; Zheng, Nengheng; Lee, Tan

Authors

P. C. Ching

Nengheng Zheng

Tan Lee



Abstract

To alleviate the problem of severe degradation of speaker recognition performance under noisy environments because of inadequate and inaccurate speaker-discriminative information, a method of robust feature estimation that can capture both vocal source- and vocal tract-related characteristics from noisy speech utterances is proposed. Spectral subtraction, a simple yet useful speech enhancement technique, is employed to remove the noise-specific components prior to the feature extraction process. It has been shown through analytical derivation, as well as by simulation results, that the proposed feature estimation method leads to robust recognition performance, especially at low signal-to-noise ratios. In the context of Gaussian mixture model-based speaker recognition with the presence of additive white Gaussian noise, the new approach produces consistent reduction of both identification error rate and equal error rate at signal-to-noise ratios ranging from 0 to 15 dB. © 2010 IEEE.

Citation

Wang, N., Ching, P. C., Zheng, N., & Lee, T. (2011). Robust speaker recognition using denoised vocal source and vocal tract features. IEEE Transactions on Audio, Speech and Language Processing, 19(1), 196-205. https://doi.org/10.1109/TASL.2010.2045800

Journal Article Type Article
Acceptance Date Jan 26, 2010
Online Publication Date Mar 15, 2010
Publication Date Jan 1, 2011
Deposit Date Jun 19, 2019
Journal IEEE Transactions on Audio, Speech and Language Processing
Print ISSN 1558-7916
Publisher Institute of Electrical and Electronics Engineers
Peer Reviewed Peer Reviewed
Volume 19
Issue 1
Pages 196-205
DOI https://doi.org/10.1109/TASL.2010.2045800
Public URL https://uwe-repository.worktribe.com/output/1494760