Detail výsledku

On the use of X-vectors for Robust Speaker Recognition

NOVOTNÝ, O.; PLCHOT, O.; MATĚJKA, P.; MOŠNER, L.; GLEMBEK, O. On the use of X-vectors for Robust Speaker Recognition. Proceedings of Odyssey 2018. Proceedings of Odyssey: The Speaker and Language Recognition Workshop Odyssey 2014, Joensuu, Finland. Les Sables d´Olonne: International Speech Communication Association, 2018. no. 6, p. 168-175. ISSN: 2312-2846.
Typ
článek ve sborníku konference
Jazyk
anglicky
Autoři
Novotný Ondřej, Ing., Ph.D., UPGM (FIT)
Plchot Oldřich, Ing., Ph.D., UPGM (FIT)
Matějka Pavel, Ing., Ph.D., UPGM (FIT)
Mošner Ladislav, Ing., UPGM (FIT)
Glembek Ondřej, Ing., Ph.D., UPGM (FIT)
Abstrakt

Text-independent speaker verification (SV) is currently in theprocess of embracing DNN modeling in every stage of SV system.Slowly, the DNN-based approaches such as end-to-endmodelling and systems based on DNN embeddings start to becompetitive even in challenging and diverse channel conditionsof recent NIST SREs. Domain adaptation and the need for alarge amount of training data are still a challenge for currentdiscriminative systems and (unlike with generative models), wesee significant gains from data augmentation, simulation andother techniques designed to overcome lack of training data.We present an analysis of a SV system based on DNN embeddings(x-vectors) and focus on robustness across diverse datadomains such as standard telephone and microphone conversations,both in clean, noisy and reverberant environments. Wealso evaluate the system on challenging far-field data createdby re-transmitting a subset of NIST SRE 2008 and 2010 microphoneinterviews. We compare our results with the stateof-the-art i-vector system. In general, we were able to achievebetter performance with the DNN-based systems, but most importantly,we have confirmed the robustness of such systemsacross multiple data domains.

Klíčová slova

Speaker Recognition, Embedding, X-vectors, DNN

URL
Rok
2018
Strany
168–175
Časopis
Proceedings of Odyssey: The Speaker and Language Recognition Workshop Odyssey 2014, Joensuu, Finland, roč. 2018, č. 6, ISSN 2312-2846
Sborník
Proceedings of Odyssey 2018
Konference
Odyssey 2018
Vydavatel
International Speech Communication Association
Místo
Les Sables d´Olonne
DOI
BibTeX
@inproceedings{BUT155075,
  author="Ondřej {Novotný} and Oldřich {Plchot} and Pavel {Matějka} and Ladislav {Mošner} and Ondřej {Glembek}",
  title="On the use of X-vectors for Robust Speaker Recognition",
  booktitle="Proceedings of Odyssey 2018",
  year="2018",
  journal="Proceedings of Odyssey: The Speaker and Language Recognition Workshop Odyssey 2014, Joensuu, Finland",
  volume="2018",
  number="6",
  pages="168--175",
  publisher="International Speech Communication Association",
  address="Les Sables d´Olonne",
  doi="10.21437/Odyssey.2018-24",
  issn="2312-2846",
  url="https://www.fit.vut.cz/research/publication/11787/"
}
Soubory
Projekty
Dolování infoRmAcí z řeči Pořízené vzdÁlenými miKrofony, MV, Bezpečnostní výzkum České republiky 2015-2020, VI20152020025, zahájení: 2015-10-01, ukončení: 2020-09-30, ukončen
IT4Innovations excellence in science, MŠMT, Národní program udržitelnosti II, LQ1602, zahájení: 2016-01-01, ukončení: 2020-12-31, ukončen
Zvýšení spolehlivosti v automatickém rozpoznávání řečníka, GAČR, Juniorské granty, GJ17-23870Y, zahájení: 2017-01-01, ukončení: 2019-12-31, ukončen
Výzkumné skupiny
Pracoviště
Nahoru