Detail výsledku

Speaker Verification Using End-To-End Adversarial Language Adaptation

ROHDIN, J.; STAFYLAKIS, T.; SILNOVA, A.; ZEINALI, H.; BURGET, L.; PLCHOT, O. Speaker Verification Using End-To-End Adversarial Language Adaptation. In Proceedings of ICASSP 2019. Brighton: IEEE Signal Processing Society, 2019. p. 6006-6010. ISBN: 978-1-5386-4658-8.
Typ
článek ve sborníku konference
Jazyk
anglicky
Autoři
Abstrakt

In this paper we investigate the use of adversarial domainadaptation for addressing the problem of language mismatchbetween speaker recognition corpora. In the context ofspeaker verification, adversarial domain adaptation methodsaim at minimizing certain divergences between the distributionthat the utterance-level features follow (i.e. speakerembeddings) when drawn from source and target domains(i.e. languages), while preserving their capacity in recognizingspeakers. Neural architectures for extracting utterancelevelrepresentations enable us to apply adversarial adaptationmethods in an end-to-end fashion and train the networkjointly with the standard cross-entropy loss. We examineseveral configurations, such as the use of (pseudo-)labels onthe target domain as well as domain labels in the feature extractor,and we demonstrate the effectiveness of our methodon the challenging NIST SRE16 and SRE18 benchmarks.

Klíčová slova

Speaker recognition, domain adaptation

URL
Rok
2019
Strany
6006–6010
Sborník
Proceedings of ICASSP 2019
Konference
2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
ISBN
978-1-5386-4658-8
Vydavatel
IEEE Signal Processing Society
Místo
Brighton
DOI
UT WoS
000482554006047
EID Scopus
BibTeX
@inproceedings{BUT158086,
  author="Johan Andréas {Rohdin} and Themos {Stafylakis} and Anna {Silnova} and Hossein {Zeinali} and Lukáš {Burget} and Oldřich {Plchot}",
  title="Speaker Verification Using End-To-End Adversarial Language Adaptation",
  booktitle="Proceedings of ICASSP 2019",
  year="2019",
  pages="6006--6010",
  publisher="IEEE Signal Processing Society",
  address="Brighton",
  doi="10.1109/ICASSP.2019.8683616",
  isbn="978-1-5386-4658-8",
  url="https://ieeexplore.ieee.org/abstract/document/8683616"
}
Soubory
Projekty
Neuronové sítě shrnující sekvence pro rozpoznávání mluvčího, EU, Horizon 2020, 5SA15094, zahájení: 2016-07-01, ukončení: 2019-06-30, ukončen
Zpracování, zobrazování a analýza multimediálních a 3D dat, VUT, Vnitřní projekty VUT, FIT-S-17-3984, zahájení: 2017-03-01, ukončení: 2020-02-29, ukončen
Zvýšení spolehlivosti v automatickém rozpoznávání řečníka, GAČR, Juniorské granty, GJ17-23870Y, zahájení: 2017-01-01, ukončení: 2019-12-31, ukončen
Výzkumné skupiny
Pracoviště
Nahoru