Result Details

Full-covariance UBM and Heavy-tailed PLDA in I-Vector Speaker Verification

MATĚJKA, P.; GLEMBEK, O.; CASTALDO, F.; ALAM, J.; PLCHOT, O.; KENNY, P.; BURGET, L.; ČERNOCKÝ, J. Full-covariance UBM and Heavy-tailed PLDA in I-Vector Speaker Verification. In Proceedings of the 2011 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2011. Praha: IEEE Signal Processing Society, 2011. p. 4828-4831. ISBN: 978-1-4577-0537-3.
Type
conference paper
Language
English
Authors
Matějka Pavel, Ing., Ph.D., DCGM (FIT)
Glembek Ondřej, Ing., Ph.D., DCGM (FIT)
Castaldo Fabio
Alam Jahangir
Plchot Oldřich, Ing., Ph.D., DCGM (FIT)
Kenny Patrick
Burget Lukáš, doc. Ing., Ph.D., DCGM (FIT)
Černocký Jan, prof. Dr. Ing., DCGM (FIT)
Abstract

The work we presented aims at the best performance of the single stand alone system. We have presented full-covariance UBM and i-vector extraction with different kind of modeling. Our analysis shows that for the best performance it is necessary to have fullcovariance i-vector without any approximation.

Keywords

GMM, speaker recognition, PLDA, heavytailed PLDA, full-covariance UBM, i-vectors

URL
Annotation

In this paper, we describe recent progress in i-vector based speaker verification. The use of universal background models (UBM) with full-covariance matrices is suggested and thoroughly experimentally tested. The i-vectors are scored using a simple cosine distance and advanced techniques such as Probabilistic Linear Discriminant Analysis (PLDA) and heavy-tailed variant of PLDA (PLDA-HT). Finally, we investigate into dimensionality reduction of i-vectors before entering the PLDA-HT modeling. The results are very competitive: on NIST 2010 SRE task, the results of a single full-covariance LDA-PLDA-HT system approach those of complex fused system.

Published
2011
Pages
4828–4831
Proceedings
Proceedings of the 2011 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2011
Conference
International Conference on Acoustics, Speech and Signal Processing 2011
ISBN
978-1-4577-0537-3
Publisher
IEEE Signal Processing Society
Place
Praha
DOI
EID Scopus
BibTeX
@inproceedings{BUT76387,
  author="Pavel {Matějka} and Ondřej {Glembek} and Fabio {Castaldo} and Jahangir {Alam} and Oldřich {Plchot} and Patrick {Kenny} and Lukáš {Burget} and Jan {Černocký}",
  title="Full-covariance UBM and Heavy-tailed PLDA in I-Vector Speaker Verification",
  booktitle="Proceedings of the 2011 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2011",
  year="2011",
  pages="4828--4831",
  publisher="IEEE Signal Processing Society",
  address="Praha",
  doi="10.1109/ICASSP.2011.5947436",
  isbn="978-1-4577-0537-3",
  url="https://www.fit.vut.cz/research/publication/9657/"
}
Projects
Mobile Biometry, MŠMT, Podpora projektů sedmého rámcového programu Evropského společenství pro výzkum, technologický rozvoj a demonstrace (2007 až 2013) podle zákona č. 171/2007 Sb., 7E08042, start: 2008-01-01, end: 2010-12-31, completed
Recognition and presentation of multimedia data, BUT, Vnitřní projekty VUT, FIT-S-10-2, 2010, start: 2010-04-01, end: 2010-12-31, completed
Security-Oriented Research in Information Technology, MŠMT, Institucionální prostředky SR ČR (např. VZ, VC), MSM0021630528, start: 2007-01-01, end: 2013-12-31, running
Speech Recognition under Real-World Conditions, GACR, Standardní projekty, GA102/08/0707, start: 2008-01-01, end: 2011-12-31, completed
Research groups
Departments
Back to top