Result Details
BUT system for DIHARD Speech Diarization Challenge 2018
Landini Federico Nicolás, Ph.D., DCGM (FIT)
Burget Lukáš, doc. Ing., Ph.D., DCGM (FIT)
Rohdin Johan Andréas, M.Sc., Ph.D., FIT (FIT), DCGM (FIT)
Silnova Anna, M.Sc., Ph.D., DCGM (FIT)
Žmolíková Kateřina, Ing., Ph.D., DCGM (FIT)
Novotný Ondřej, Ing., Ph.D., DCGM (FIT)
Veselý Karel, Ing., Ph.D., DCGM (FIT)
Glembek Ondřej, Ing., Ph.D., DCGM (FIT)
Plchot Oldřich, Ing., Ph.D., DCGM (FIT)
Mošner Ladislav, Ing., DCGM (FIT)
Matějka Pavel, Ing., Ph.D., DCGM (FIT)
This paper presents the approach developed by the BUT teamfor the first DIHARD speech diarization challenge, which isbased on our Bayesian Hidden Markov Model with eigenvoicepriors system. Besides the description of the approach, we providea brief analysis of different techniques and data processingmethods tested on the development set. We also introducea simple attempt for overlapped speech detection that we usedfor attaining cleaner speaker models and reassigning overlappedspeech to multiple speakers. Finally, we present results obtainedon the evaluation set and discuss findings we made during thedevelopment phase and with the help of the DIHARD leaderboardfeedback.
Speaker Diarization, Variational Bayes, HMM,i-vector, x-vector, Overlapped speech, DIHARD
@inproceedings{BUT155100,
author="Mireia {Diez Sánchez} and Federico Nicolás {Landini} and Lukáš {Burget} and Johan Andréas {Rohdin} and Anna {Silnova} and Kateřina {Žmolíková} and Ondřej {Novotný} and Karel {Veselý} and Ondřej {Glembek} and Oldřich {Plchot} and Ladislav {Mošner} and Pavel {Matějka}",
title="BUT system for DIHARD Speech Diarization Challenge 2018",
booktitle="Proceedings of Interspeech 2018",
year="2018",
journal="Proceedings of Interspeech",
volume="2018",
number="9",
pages="2798--2802",
publisher="International Speech Communication Association",
address="Hyderabad",
doi="10.21437/Interspeech.2018-1749",
issn="1990-9772",
url="https://www.isca-speech.org/archive/Interspeech_2018/abstracts/1749.html"
}
Information mining in speech acquired by distant microphones, MV, Bezpečnostní výzkum České republiky 2015-2020, VI20152020025, start: 2015-10-01, end: 2020-09-30, completed
IT4Innovations excellence in science, MŠMT, Národní program udržitelnosti II, LQ1602, start: 2016-01-01, end: 2020-12-31, completed
Neural networks for signal processing and speech data mining, TAČR, Program na podporu aplikovaného výzkumu ZÉTA, TJ01000208, start: 2018-01-01, end: 2019-12-31, completed
Robust SPEAKER DIariazation systems using Bayesian inferenCE and deep learning methods, EU, Horizon 2020, start: 2017-03-01, end: 2019-02-28, completed
Sequence summarizing neural networks for speaker recognition, EU, Horizon 2020, 5SA15094, start: 2016-07-01, end: 2019-06-30, completed