Result Details

Towards Noise-Robust Speaker Recognition Using Probabilistic Linear Discriminant Analysis

LEI, Y.; BURGET, L.; FERRER, L.; GRACIARENA, M.; SCHEFFER, N. Towards Noise-Robust Speaker Recognition Using Probabilistic Linear Discriminant Analysis. Proc. International Conference on Acoustics, Speech, and Signal P. Kyoto: IEEE Signal Processing Society, 2012. p. 4253-4256. ISBN: 978-1-4673-0044-5.

Type

conference paper

Language

English

Authors

Lei Yun
Burget Lukáš, doc. Ing., Ph.D., DCGM (FIT)
Ferrer Luciana
Graciarena Martin
Scheffer Nicolas

Abstract

We show results on a newly designed noisy corpus for speakerrecognition where real recordings of babble noise were addedto original NIST SRE clean speech data.

Keywords

Speaker Recognition, noise, robustness,i-vector, PLDA

URL

https://www.fit.vut.cz/research/group/speech/public/publi/2012/lei_icassp2012…

Annotation

This work addresses the problem of speaker verification where additive noise is present in the enrollment and testing utterances. We show how the current state-of-the-art framework can be effectively used to mitigate this effect. We first look at the degradation a standard speaker verification system is subjected to when presented with noisy speech waveforms. We designed and generated a corpus with noisy conditions, based on the NIST SRE 2008 and 2010 data, built using open-source tools and freely available noise samples. We then show how adding noisy training data in the current i-vectorbased approach followed by probabilistic linear discriminant analysis (PLDA) can bring significant gains in accuracy at various signal-to-noise ratio (SNR) levels. We demonstrate that this improvement is not feature-specific as we present positive results for three disparate sets of features: standard mel frequency cepstral coefficients, prosodic polynomial coefficients and maximum likelihood linear regression (MLLR) transforms.

Published

2012

Pages

4253–4256

Proceedings

Proc. International Conference on Acoustics, Speech, and Signal P

Conference

The 37th International Conference on Acoustics, Speech, and Signal Processing

ISBN

978-1-4673-0044-5

Publisher

IEEE Signal Processing Society

Place

Kyoto

DOI

10.1109/ICASSP.2012.6288858

BibTeX

@inproceedings{BUT91503,
  author="Yun {Lei} and Lukáš {Burget} and Luciana {Ferrer} and Martin {Graciarena} and Nicolas {Scheffer}",
  title="Towards Noise-Robust Speaker Recognition Using Probabilistic Linear Discriminant Analysis",
  booktitle="Proc. International Conference on Acoustics, Speech, and Signal P",
  year="2012",
  pages="4253--4256",
  publisher="IEEE Signal Processing Society",
  address="Kyoto",
  doi="10.1109/ICASSP.2012.6288858",
  isbn="978-1-4673-0044-5",
  url="http://www.fit.vutbr.cz/research/groups/speech/publi/2012/lei_icassp2012_0004253.pdf"
}

Projects

IARPA Biometrics Exploitation Science and Technology (BEST) - Promoting Robustness in Speaker Modeling (PRISM), IARPA, start: 2009-12-07, end: 2011-12-30, completed
Security-Oriented Research in Information Technology, MŠMT, Institucionální prostředky SR ČR (např. VZ, VC), MSM0021630528, start: 2007-01-01, end: 2013-12-31, running

Research groups

Speech Data Mining Research Group BUT Speech@FIT (RG SPEECH)

Departments

Department of Computer Graphics and Multimedia (DCGM)