Result Details

BUT Text-Dependent Speaker Verification System for SdSV Challenge 2020

LOZANO DÍEZ, A.; SILNOVA, A.; PULUGUNDLA, B.; ROHDIN, J.; VESELÝ, K.; BURGET, L.; PLCHOT, O.; GLEMBEK, O.; NOVOTNÝ, O.; MATĚJKA, P. BUT Text-Dependent Speaker Verification System for SdSV Challenge 2020. In Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. Proceedings of Interspeech. Shanghai: International Speech Communication Association, 2020. no. 10, p. 761-765. ISSN: 1990-9772.
Type
conference paper
Language
English
Authors
Lozano Díez Alicia, Ph.D., DCGM (FIT)
Silnova Anna, M.Sc., Ph.D., DCGM (FIT)
Pulugundla Bhargav, M.Sc., DCGM (FIT)
Rohdin Johan Andréas, M.Sc., Ph.D., FIT (FIT), DCGM (FIT)
Veselý Karel, Ing., Ph.D., DCGM (FIT)
Burget Lukáš, doc. Ing., Ph.D., DCGM (FIT)
Plchot Oldřich, Ing., Ph.D., DCGM (FIT)
Glembek Ondřej, Ing., Ph.D., DCGM (FIT)
Novotný Ondřej, Ing., Ph.D., DCGM (FIT)
Matějka Pavel, Ing., Ph.D., DCGM (FIT)
Abstract

In this paper, we present the winning BUT submission for thetext-dependent task of the SdSV challenge 2020. Given thelarge amount of training data available in this challenge, we exploresuccessful techniques from text-independent systems inthe text-dependent scenario. In particular, we trained x-vectorextractors on both in-domain and out-of-domain datasets andcombine them with i-vectors trained on concatenated MFCCsand bottleneck features, which have proven effective for thetext-dependent scenario. Moreover, we proposed the use ofphrase-dependent PLDA backend for scoring and its combinationwith a simple phrase recognizer, which brings up to 63%relative improvement on our development set with respect to usingstandard PLDA. Finally, we combine our different i-vectorand x-vector based systems using a simple linear logistic regressionscore level fusion, which provides 28% relative improvementon the evaluation set with respect to our best singlesystem.

Keywords

text-dependent speaker verification, phrasedependentPLDA, phrase recognizer

URL
Published
2020
Pages
761–765
Journal
Proceedings of Interspeech, vol. 2020, no. 10, ISSN 1990-9772
Proceedings
Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
Conference
Interspeech
Publisher
International Speech Communication Association
Place
Shanghai
DOI
UT WoS
000833594100158
EID Scopus
BibTeX
@inproceedings{BUT168145,
  author="Alicia {Lozano Díez} and Anna {Silnova} and Bhargav {Pulugundla} and Johan Andréas {Rohdin} and Karel {Veselý} and Lukáš {Burget} and Oldřich {Plchot} and Ondřej {Glembek} and Ondřej {Novotný} and Pavel {Matějka}",
  title="BUT Text-Dependent Speaker Verification System for SdSV Challenge 2020",
  booktitle="Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH",
  year="2020",
  journal="Proceedings of Interspeech",
  volume="2020",
  number="10",
  pages="761--765",
  publisher="International Speech Communication Association",
  address="Shanghai",
  doi="10.21437/Interspeech.2020-2882",
  issn="1990-9772",
  url="https://www.isca-speech.org/archive/Interspeech_2020/pdfs/2882.pdf"
}
Files
Projects
Employment of artificial intelligence into an emergency call reception, MV, Program bezpečnostního výzkumu ČR v letech 2015-2022 (BV III/1-VS), VI20192022169, start: 2019-07-04, end: 2022-05-31, completed
Information mining in speech acquired by distant microphones, MV, Bezpečnostní výzkum České republiky 2015-2020, VI20152020025, start: 2015-10-01, end: 2020-09-30, completed
IT4Innovations excellence in science, MŠMT, Národní program udržitelnosti II, LQ1602, start: 2016-01-01, end: 2020-12-31, completed
Moderní metody zpracování, analýzy a zobrazování multimediálních a 3D dat, BUT, Vnitřní projekty VUT, FIT-S-20-6460, start: 2020-03-01, end: 2023-02-28, completed
Multi-linguality in speech technologies, MŠMT, INTER-EXCELLENCE - Podprogram INTER-ACTION, LTAIN19087, start: 2020-01-01, end: 2023-08-31, completed
Neural Representations in multi-modal and multi-lingual modeling, GACR, Grantové projekty exelence v základním výzkumu EXPRO - 2019, GX19-26934X, start: 2019-01-01, end: 2023-12-31, completed
Real time network, text, and speaker analytics for combating organized crime, EU, Horizon 2020, start: 2019-09-01, end: 2022-12-31, completed
Robust End-To-End SPEAKER recognition based on deep learning and attention models, EU, Horizon 2020, start: 2019-06-01, end: 2021-01-31, completed
Research groups
Departments
Back to top