Result Details

Discriminative Acoustic Language Recognition via Channel-Compensated GMM Statistics

BRÜMMER, N.; STRASHEIM, A.; HUBEIKA, V.; MATĚJKA, P.; BURGET, L.; GLEMBEK, O. Discriminative Acoustic Language Recognition via Channel-Compensated GMM Statistics. Proc. Interspeech 2009. Proceedings of Interspeech. Brighton: International Speech Communication Association, 2009. no. 9, p. 2187-2190. ISBN: 978-1-61567-692-7. ISSN: 1990-9772.
Type
conference paper
Language
English
Authors
Brümmer Niko
Strasheim Albeert
Hubeika Valiantsina, Ing., DCGM (FIT)
Matějka Pavel, Ing., Ph.D., DCGM (FIT)
Burget Lukáš, doc. Ing., Ph.D., DCGM (FIT)
Glembek Ondřej, Ing., Ph.D., DCGM (FIT)
Abstract

The paper is on Discriminative Acoustic Language Recognition via Channel-Compensated GMM Statistics. The results are reported on NIST LRE'07.

Keywords

acoustic language recognition, intersession variabilitycompensation, discriminative training

URL
Annotation

We propose a novel design for acoustic feature-based automatic spoken language recognizers. Our design is inspired by recent advances in text-independent speaker recognition, where intraclass variability is modeled by factor analysis in Gaussian mixture model (GMM) space. We use approximations to GMMlikelihoods which allow variable-length data sequences to be represented as statistics of fixed size. Our experiments on NIST LRE'07 show that variability-compensation of these statistics can reduce error-rates by a factor of three. Finally, we show that further improvements are possible with discriminative logistic regression training.

Published
2009
Pages
2187–2190
Journal
Proceedings of Interspeech, no. 9, ISSN 1990-9772
Proceedings
Proc. Interspeech 2009
Conference
Interspeech Conference
ISBN
978-1-61567-692-7
Publisher
International Speech Communication Association
Place
Brighton
BibTeX
@inproceedings{BUT33741,
  author="Niko {Brümmer} and Albeert {Strasheim} and Valiantsina {Hubeika} and Pavel {Matějka} and Lukáš {Burget} and Ondřej {Glembek}",
  title="Discriminative Acoustic Language Recognition via Channel-Compensated GMM Statistics",
  booktitle="Proc. Interspeech 2009",
  year="2009",
  journal="Proceedings of Interspeech",
  number="9",
  pages="2187--2190",
  publisher="International Speech Communication Association",
  address="Brighton",
  isbn="978-1-61567-692-7",
  issn="1990-9772",
  url="http://www.fit.vutbr.cz/research/groups/speech/publi/2009/brummer_is2009.pdf"
}
Projects
DIRAC - Detection and Identification of Rare Audio-visual Cues, MŠMT, Šestý rámcový program Evropského společenství pro výzkum, technický rozvoj a demonstrační činnosti, 027787, start: 2006-01-01, end: 2010-12-31, completed
Mobile Biometry, MŠMT, Podpora projektů sedmého rámcového programu Evropského společenství pro výzkum, technologický rozvoj a demonstrace (2007 až 2013) podle zákona č. 171/2007 Sb., 7E08042, start: 2008-01-01, end: 2010-12-31, completed
Overcoming the language barrier complicating investigation into financing terrorism and serious financial crimes, MV, Program bezpečnostního výzkumu, VD20072010B16, start: 2007-08-01, end: 2010-12-31, completed
Speech Recognition under Real-World Conditions, GACR, Standardní projekty, GA102/08/0707, start: 2008-01-01, end: 2011-12-31, completed
Research groups
Departments
Back to top