Result Details

But System for the Second Dihard Speech Diarization Challenge

LANDINI, F.; WANG, S.; DIEZ SÁNCHEZ, M.; BURGET, L.; MATĚJKA, P.; ŽMOLÍKOVÁ, K.; MOŠNER, L.; SILNOVA, A.; PLCHOT, O.; NOVOTNÝ, O.; ZEINALI, H.; ROHDIN, J. But System for the Second Dihard Speech Diarization Challenge. In ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Barcelona: IEEE Signal Processing Society, 2020. p. 6529-6533. ISBN: 978-1-5090-6631-5.
Type
conference paper
Language
English
Authors
Landini Federico Nicolás, Ph.D., DCGM (FIT)
Wang Shuai
Diez Sánchez Mireia, M.Sc., Ph.D., DCGM (FIT)
Burget Lukáš, doc. Ing., Ph.D., DCGM (FIT)
Matějka Pavel, Ing., Ph.D., DCGM (FIT)
Žmolíková Kateřina, Ing., Ph.D., DCGM (FIT)
Mošner Ladislav, Ing., DCGM (FIT)
Silnova Anna, M.Sc., Ph.D., DCGM (FIT)
Plchot Oldřich, Ing., Ph.D., DCGM (FIT)
Novotný Ondřej, Ing., Ph.D., DCGM (FIT)
Zeinali Hossein, Ph.D.
Rohdin Johan Andréas, M.Sc., Ph.D., FIT (FIT), DCGM (FIT)
Abstract

This paper describes the winning systems developed by theBUT team for the four tracks of the Second DIHARD SpeechDiarization Challenge. For tracks 1 and 2 the systems weremainly based on performing agglomerative hierarchical clustering(AHC) of x-vectors, followed by another x-vectorclustering based on Bayes hidden Markov model and variationalBayes inference. We provide a comparison of theimprovement given by each step and share the implementationof the core of the system. For tracks 3 and 4 withrecordings from the Fifth CHiME Challenge, we exploreddifferent approaches for doing multi-channel diarization andour best performance was obtained when applying AHC onthe fusion of per channel probabilistic linear discriminantanalysis scores.

Keywords

Speaker Diarization, Variational Bayes, HMM, DIHARD, CHiME

URL
Published
2020
Pages
6529–6533
Proceedings
ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
Conference
2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP)
ISBN
978-1-5090-6631-5
Publisher
IEEE Signal Processing Society
Place
Barcelona
DOI
UT WoS
000615970406158
EID Scopus
BibTeX
@inproceedings{BUT163962,
  author="Federico Nicolás {Landini} and Shuai {Wang} and Mireia {Diez Sánchez} and Lukáš {Burget} and Pavel {Matějka} and Kateřina {Žmolíková} and Ladislav {Mošner} and Anna {Silnova} and Oldřich {Plchot} and Ondřej {Novotný} and Hossein {Zeinali} and Johan Andréas {Rohdin}",
  title="But System for the Second Dihard Speech Diarization Challenge",
  booktitle="ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings",
  year="2020",
  pages="6529--6533",
  publisher="IEEE Signal Processing Society",
  address="Barcelona",
  doi="10.1109/ICASSP40776.2020.9054251",
  isbn="978-1-5090-6631-5",
  url="https://ieeexplore.ieee.org/document/9054251"
}
Files
Projects
IT4Innovations excellence in science, MŠMT, Národní program udržitelnosti II, LQ1602, start: 2016-01-01, end: 2020-12-31, completed
Moderní metody zpracování, analýzy a zobrazování multimediálních a 3D dat, BUT, Vnitřní projekty VUT, FIT-S-20-6460, start: 2020-03-01, end: 2023-02-28, completed
Research groups
Departments
Back to top