Result Details
Analysis of ABC Submission to NIST SRE 2019 CMN and VAST Challenge
Boulianne Gilles
Burget Lukáš, doc. Ing., Ph.D., DCGM (FIT)
DAHMANE, M.
DIEZ SÁNCHEZ, M.
Glembek Ondřej, Ing., Ph.D., DCGM (FIT)
LALONDE, M.
LOZANO DÍEZ, A.
Matějka Pavel, Ing., Ph.D., DCGM (FIT)
MIZERA, P.
Mošner Ladislav, Ing., DCGM (FIT)
NOISEUX, C.
MONTEIRO, J.
Novotný Ondřej, Ing., Ph.D., DCGM (FIT)
Plchot Oldřich, Ing., Ph.D., DCGM (FIT)
Rohdin Johan Andréas, M.Sc., Ph.D., FIT (FIT), DCGM (FIT)
Silnova Anna, M.Sc., Ph.D., DCGM (FIT)
SLAVÍČEK, J.
Stafylakis Themos
ST-CHARLES, P.
Wang Shuai
Zeinali Hossein, Ph.D.
We present a condensed description and analysis of the jointsubmission of ABC team for NIST SRE 2019, by BUT, CRIM,Phonexia, Omilia and UAM. We concentrate on challenges thatarose during development and we analyze the results obtainedon the evaluation data and on our development sets. The conversationaltelephone speech (CMN2) condition is challengingfor current state-of-the-art systems, mainly due to the languagemismatch between training and test data. We show that a combinationof adversarial domain adaptation, backend adaptationand score normalization can mitigate this mismatch. On theVAST condition, we demonstrate the importance of deployingdiarization when dealing with multi-speaker utterances and thedrastic improvements that can be obtained by combining audioand visual modalities.
speaker verification, NIST SRE, CMN, VAST, system fusion.
@inproceedings{BUT164070,
author="ALAM, J. and BOULIANNE, G. and BURGET, L. and DAHMANE, M. and DIEZ SÁNCHEZ, M. and GLEMBEK, O. and LALONDE, M. and LOZANO DÍEZ, A. and MATĚJKA, P. and MIZERA, P. and MOŠNER, L. and NOISEUX, C. and MONTEIRO, J. and NOVOTNÝ, O. and PLCHOT, O. and ROHDIN, J. and SILNOVA, A. and SLAVÍČEK, J. and STAFYLAKIS, T. and ST-CHARLES, P. and WANG, S. and ZEINALI, H.",
title="Analysis of ABC Submission to NIST SRE 2019 CMN and VAST Challenge",
booktitle="Proceedings of Odyssey 2020 The Speaker and Language Recognition Workshop",
year="2020",
journal="Proceedings of Odyssey: The Speaker and Language Recognition Workshop Odyssey 2014, Joensuu, Finland",
volume="2020",
number="11",
pages="289--295",
publisher="International Speech Communication Association",
address="Tokyo",
doi="10.21437/Odyssey.2020-41",
issn="2312-2846",
url="https://www.isca-speech.org/archive/Odyssey_2020/abstracts/73.html"
}
Information mining in speech acquired by distant microphones, MV, Bezpečnostní výzkum České republiky 2015-2020, VI20152020025, start: 2015-10-01, end: 2020-09-30, completed
IT4Innovations excellence in science, MŠMT, Národní program udržitelnosti II, LQ1602, start: 2016-01-01, end: 2020-12-31, completed
Moderní metody zpracování, analýzy a zobrazování multimediálních a 3D dat, BUT, Vnitřní projekty VUT, FIT-S-20-6460, start: 2020-03-01, end: 2023-02-28, completed
Neural Representations in multi-modal and multi-lingual modeling, GACR, Grantové projekty exelence v základním výzkumu EXPRO - 2019, GX19-26934X, start: 2019-01-01, end: 2023-12-31, completed
Real time network, text, and speaker analytics for combating organized crime, EU, Horizon 2020, start: 2019-09-01, end: 2022-12-31, completed
Robust End-To-End SPEAKER recognition based on deep learning and attention models, EU, Horizon 2020, start: 2019-06-01, end: 2021-01-31, completed