Result Details

Speaker Verification Using End-To-End Adversarial Language Adaptation

ROHDIN, J.; STAFYLAKIS, T.; SILNOVA, A.; ZEINALI, H.; BURGET, L.; PLCHOT, O. Speaker Verification Using End-To-End Adversarial Language Adaptation. In Proceedings of ICASSP 2019. Brighton: IEEE Signal Processing Society, 2019. p. 6006-6010. ISBN: 978-1-5386-4658-8.

Type

conference paper

Language

English

Authors

Rohdin Johan Andréas, M.Sc., Ph.D., FIT (FIT), DCGM (FIT)
Stafylakis Themos
Silnova Anna, M.Sc., Ph.D., DCGM (FIT)
Zeinali Hossein, Ph.D., DCGM (FIT)
Burget Lukáš, doc. Ing., Ph.D., DCGM (FIT)
Plchot Oldřich, Ing., Ph.D., DCGM (FIT)

Abstract

In this paper we investigate the use of adversarial domainadaptation for addressing the problem of language mismatchbetween speaker recognition corpora. In the context ofspeaker verification, adversarial domain adaptation methodsaim at minimizing certain divergences between the distributionthat the utterance-level features follow (i.e. speakerembeddings) when drawn from source and target domains(i.e. languages), while preserving their capacity in recognizingspeakers. Neural architectures for extracting utterancelevelrepresentations enable us to apply adversarial adaptationmethods in an end-to-end fashion and train the networkjointly with the standard cross-entropy loss. We examineseveral configurations, such as the use of (pseudo-)labels onthe target domain as well as domain labels in the feature extractor,and we demonstrate the effectiveness of our methodon the challenging NIST SRE16 and SRE18 benchmarks.

Keywords

Speaker recognition, domain adaptation

URL

Published

2019

Pages

6006–6010

Proceedings

Proceedings of ICASSP 2019

Conference

2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

ISBN

978-1-5386-4658-8

Publisher

IEEE Signal Processing Society

Place

Brighton

DOI

10.1109/ICASSP.2019.8683616

UT WoS

000482554006047

EID Scopus

2-s2.0-85069003393

BibTeX

@inproceedings{BUT158086,
  author="Johan Andréas {Rohdin} and Themos {Stafylakis} and Anna {Silnova} and Hossein {Zeinali} and Lukáš {Burget} and Oldřich {Plchot}",
  title="Speaker Verification Using End-To-End Adversarial Language Adaptation",
  booktitle="Proceedings of ICASSP 2019",
  year="2019",
  pages="6006--6010",
  publisher="IEEE Signal Processing Society",
  address="Brighton",
  doi="10.1109/ICASSP.2019.8683616",
  isbn="978-1-5386-4658-8",
  url="https://ieeexplore.ieee.org/abstract/document/8683616"
}

Files

pdf rohdin_icassp2019_0006006.pdf 336 kB

Projects

Improving Robustnes in Automatic Speaker Recognition, GACR, Juniorské granty, GJ17-23870Y, start: 2017-01-01, end: 2019-12-31, completed
Sequence summarizing neural networks for speaker recognition, EU, Horizon 2020, 5SA15094, start: 2016-07-01, end: 2019-06-30, completed
Zpracování, zobrazování a analýza multimediálních a 3D dat, BUT, Vnitřní projekty VUT, FIT-S-17-3984, start: 2017-03-01, end: 2020-02-29, completed

Research groups

Speech Data Mining Research Group BUT Speech@FIT (RG SPEECH)

Departments

Department of Computer Graphics and Multimedia (DCGM)