Result Details

Speaker Verification Using End-To-End Adversarial Language Adaptation

ROHDIN, J.; STAFYLAKIS, T.; SILNOVA, A.; ZEINALI, H.; BURGET, L.; PLCHOT, O. Speaker Verification Using End-To-End Adversarial Language Adaptation. In Proceedings of ICASSP 2019. Brighton: IEEE Signal Processing Society, 2019. p. 6006-6010. ISBN: 978-1-5386-4658-8.
Type
conference paper
Language
English
Authors
Abstract

In this paper we investigate the use of adversarial domainadaptation for addressing the problem of language mismatchbetween speaker recognition corpora. In the context ofspeaker verification, adversarial domain adaptation methodsaim at minimizing certain divergences between the distributionthat the utterance-level features follow (i.e. speakerembeddings) when drawn from source and target domains(i.e. languages), while preserving their capacity in recognizingspeakers. Neural architectures for extracting utterancelevelrepresentations enable us to apply adversarial adaptationmethods in an end-to-end fashion and train the networkjointly with the standard cross-entropy loss. We examineseveral configurations, such as the use of (pseudo-)labels onthe target domain as well as domain labels in the feature extractor,and we demonstrate the effectiveness of our methodon the challenging NIST SRE16 and SRE18 benchmarks.

Keywords

Speaker recognition, domain adaptation

URL
Published
2019
Pages
6006–6010
Proceedings
Proceedings of ICASSP 2019
Conference
2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
ISBN
978-1-5386-4658-8
Publisher
IEEE Signal Processing Society
Place
Brighton
DOI
UT WoS
000482554006047
EID Scopus
BibTeX
@inproceedings{BUT158086,
  author="Johan Andréas {Rohdin} and Themos {Stafylakis} and Anna {Silnova} and Hossein {Zeinali} and Lukáš {Burget} and Oldřich {Plchot}",
  title="Speaker Verification Using End-To-End Adversarial Language Adaptation",
  booktitle="Proceedings of ICASSP 2019",
  year="2019",
  pages="6006--6010",
  publisher="IEEE Signal Processing Society",
  address="Brighton",
  doi="10.1109/ICASSP.2019.8683616",
  isbn="978-1-5386-4658-8",
  url="https://ieeexplore.ieee.org/abstract/document/8683616"
}
Files
Projects
Improving Robustnes in Automatic Speaker Recognition, GACR, Juniorské granty, GJ17-23870Y, start: 2017-01-01, end: 2019-12-31, completed
Sequence summarizing neural networks for speaker recognition, EU, Horizon 2020, 5SA15094, start: 2016-07-01, end: 2019-06-30, completed
Zpracování, zobrazování a analýza multimediálních a 3D dat, BUT, Vnitřní projekty VUT, FIT-S-17-3984, start: 2017-03-01, end: 2020-02-29, completed
Research groups
Departments
Back to top