Result Details
Towards Noise-Robust Speaker Recognition Using Probabilistic Linear Discriminant Analysis
Burget Lukáš, doc. Ing., Ph.D., DCGM (FIT)
Ferrer Luciana
Graciarena Martin
Scheffer Nicolas
We show results on a newly designed noisy corpus for speakerrecognition where real recordings of babble noise were addedto original NIST SRE clean speech data.
Speaker Recognition, noise, robustness,i-vector, PLDA
This work addresses the problem of speaker verification where additive noise is present in the enrollment and testing utterances. We show how the current state-of-the-art framework can be effectively used to mitigate this effect. We first look at the degradation a standard speaker verification system is subjected to when presented with noisy speech waveforms. We designed and generated a corpus with noisy conditions, based on the NIST SRE 2008 and 2010 data, built using open-source tools and freely available noise samples. We then show how adding noisy training data in the current i-vectorbased approach followed by probabilistic linear discriminant analysis (PLDA) can bring significant gains in accuracy at various signal-to-noise ratio (SNR) levels. We demonstrate that this improvement is not feature-specific as we present positive results for three disparate sets of features: standard mel frequency cepstral coefficients, prosodic polynomial coefficients and maximum likelihood linear regression (MLLR) transforms.
@inproceedings{BUT91503,
author="Yun {Lei} and Lukáš {Burget} and Luciana {Ferrer} and Martin {Graciarena} and Nicolas {Scheffer}",
title="Towards Noise-Robust Speaker Recognition Using Probabilistic Linear Discriminant Analysis",
booktitle="Proc. International Conference on Acoustics, Speech, and Signal P",
year="2012",
pages="4253--4256",
publisher="IEEE Signal Processing Society",
address="Kyoto",
doi="10.1109/ICASSP.2012.6288858",
isbn="978-1-4673-0044-5",
url="http://www.fit.vutbr.cz/research/groups/speech/publi/2012/lei_icassp2012_0004253.pdf"
}
Security-Oriented Research in Information Technology, MŠMT, Institucionální prostředky SR ČR (např. VZ, VC), MSM0021630528, start: 2007-01-01, end: 2013-12-31, running