Result Details

Exploiting i-vector posterior covariances for short-duration language recognition

CUMANI, S.; PLCHOT, O.; FÉR, R. Exploiting i-vector posterior covariances for short-duration language recognition. In Proceedings of Interspeech 2015. Proceedings of Interspeech. Dresden: International Speech Communication Association, 2015. no. 09, p. 1002-1006. ISBN: 978-1-5108-1790-6. ISSN: 1990-9772.
Type
conference paper
Language
English
Authors
Cumani Sandro, Ph.D.
Plchot Oldřich, Ing., Ph.D., DCGM (FIT)
Fér Radek, Ing., DCGM (FIT)
Abstract

In this work we have proposed an approach that accounts forthe uncertainty in the i-vector extraction process in the frameworkof generative Gaussian models for language recognition.

Keywords

i-vector, uncertainty, calibration, stacked bottleneckfeatures, language identification

URL
Annotation

Linear models in i-vector space have shown to be an effective solution not only for speaker identification, but also for language recogniton. The i-vector extraction process, however, is affected by several factors, such as noise level, the acoustic content of the utterance and the duration of the spoken segments. These factors influence both the i-vector estimate and its uncertainty, represented by the i-vector posterior covariance matrix. Modeling of i-vector uncertainty with Probabilistic Linear Discriminant Analysis has shown to be effective for short-duration speaker identification. This paper extends the approach to language recognition, analyzing the effects of i-vector covariances on a state-of-the-art Gaussian classifier, and proposes an effective solution for the reduction of the average detection cost (Cavg) for short segments.

Published
2015
Pages
1002–1006
Journal
Proceedings of Interspeech, vol. 2015, no. 09, ISSN 1990-9772
Proceedings
Proceedings of Interspeech 2015
Conference
Interspeech Conference
ISBN
978-1-5108-1790-6
Publisher
International Speech Communication Association
Place
Dresden
UT WoS
000380581600209
EID Scopus
BibTeX
@inproceedings{BUT119903,
  author="Sandro {Cumani} and Oldřich {Plchot} and Radek {Fér}",
  title="Exploiting i-vector posterior covariances for short-duration language recognition",
  booktitle="Proceedings of Interspeech 2015",
  year="2015",
  journal="Proceedings of Interspeech",
  volume="2015",
  number="09",
  pages="1002--1006",
  publisher="International Speech Communication Association",
  address="Dresden",
  isbn="978-1-5108-1790-6",
  issn="1990-9772",
  url="http://www.fit.vutbr.cz/research/groups/speech/publi/2015/cumani_interspeech2015_IS150935.pdf"
}
Projects
Centrum excelence IT4Innovations, MŠMT, Operační program Výzkum a vývoj pro inovace, ED1.1.00/02.0070, start: 2011-01-01, end: 2015-12-31, completed
DARPA Robust Automatic Transcription of Speech (RATS) - RATS Patrol II, BBN, start: 2015-02-23, end: 2017-03-31, completed
Enabling automatic speaker verification to broad spectrum of users in the security domain, MV, Program bezpečnostního výzkumu České republiky 2010 - 2015, VG20132015129, start: 2013-04-01, end: 2015-09-30, completed
Research groups
Departments
Back to top