Result Details

Fast variational Bayes for heavy-tailed PLDA applied to i-vectors and x-vectors

SILNOVA, A.; BRUMMER, J.; GARCÍA-ROMERO, D.; SNYDER, D.; BURGET, L. Fast variational Bayes for heavy-tailed PLDA applied to i-vectors and x-vectors. In Proceedings of Interspeech 2018. Proceedings of Interspeech. Hyderabad: International Speech Communication Association, 2018. no. 9, p. 72-76. ISSN: 1990-9772.
Type
conference paper
Language
English
Authors
Silnova Anna, M.Sc., Ph.D., DCGM (FIT)
Brummer Johan Nikolaas Langenhoven, Dr.
García-Romero Daniel
SNYDER, D.
Burget Lukáš, doc. Ing., Ph.D., DCGM (FIT)
Abstract

The standard state-of-the-art backend for text-independentspeaker recognizers that use i-vectors or x-vectors, is GaussianPLDA (G-PLDA), assisted by a Gaussianization step involvinglength normalization. G-PLDA can be trained withboth generative or discriminative methods. It has long beenknown that heavy-tailed PLDA (HT-PLDA), applied withoutlength normalization, gives similar accuracy, but at considerableextra computational cost. We have recently introduced afast scoring algorithm for a discriminatively trained HT-PLDAbackend. This paper extends that work by introducing a fast,variational Bayes, generative training algorithm. We compareold and new backends, with and without length-normalization,with i-vectors and x-vectors, on SRE10, SRE16 and SITW.

Keywords

peaker recognition, variational Bayes, heavytailed PLDA

URL
Published
2018
Pages
72–76
Journal
Proceedings of Interspeech, vol. 2018, no. 9, ISSN 1990-9772
Proceedings
Proceedings of Interspeech 2018
Conference
Interspeech Conference
Publisher
International Speech Communication Association
Place
Hyderabad
DOI
UT WoS
000465363900015
EID Scopus
BibTeX
@inproceedings{BUT155098,
  author="SILNOVA, A. and BRUMMER, J. and GARCÍA-ROMERO, D. and SNYDER, D. and BURGET, L.",
  title="Fast variational Bayes for heavy-tailed PLDA applied to i-vectors and x-vectors",
  booktitle="Proceedings of Interspeech 2018",
  year="2018",
  journal="Proceedings of Interspeech",
  volume="2018",
  number="9",
  pages="72--76",
  publisher="International Speech Communication Association",
  address="Hyderabad",
  doi="10.21437/Interspeech.2018-2128",
  issn="1990-9772",
  url="https://www.isca-speech.org/archive/Interspeech_2018/abstracts/2128.html"
}
Files
Projects
IT4Innovations excellence in science, MŠMT, Národní program udržitelnosti II, LQ1602, start: 2016-01-01, end: 2020-12-31, completed
Neural networks for signal processing and speech data mining, TAČR, Program na podporu aplikovaného výzkumu ZÉTA, TJ01000208, start: 2018-01-01, end: 2019-12-31, completed
Research groups
Departments
Back to top