Result Details
BUT System Description for The Third DIHARD Speech Diarization Challenge
LANDINI, F.; LOZANO DÍEZ, A.; BURGET, L.; DIEZ SÁNCHEZ, M.; SILNOVA, A.; ŽMOLÍKOVÁ, K.; GLEMBEK, O.; MATĚJKA, P.; STAFYLAKIS, T.; BRUMMER, J. BUT System Description for The Third DIHARD Speech Diarization Challenge. Proceedings available at Dihard Challenge Github. on-line by LDC and University of Pennsylvania: 2021. p. 1-5.
Type
conference paper
Language
English
Authors
Landini Federico Nicolás, Ph.D., DCGM (FIT)
Lozano Díez Alicia, Ph.D., DCGM (FIT)
Burget Lukáš, doc. Ing., Ph.D., DCGM (FIT)
Diez Sánchez Mireia, M.Sc., Ph.D., DCGM (FIT)
Silnova Anna, M.Sc., Ph.D., DCGM (FIT)
Žmolíková Kateřina, Ing., Ph.D., DCGM (FIT)
Glembek Ondřej, Ing., Ph.D., DCGM (FIT)
Matějka Pavel, Ing., Ph.D., DCGM (FIT)
Stafylakis Themos
Brummer Johan Nikolaas Langenhoven, Dr.
Lozano Díez Alicia, Ph.D., DCGM (FIT)
Burget Lukáš, doc. Ing., Ph.D., DCGM (FIT)
Diez Sánchez Mireia, M.Sc., Ph.D., DCGM (FIT)
Silnova Anna, M.Sc., Ph.D., DCGM (FIT)
Žmolíková Kateřina, Ing., Ph.D., DCGM (FIT)
Glembek Ondřej, Ing., Ph.D., DCGM (FIT)
Matějka Pavel, Ing., Ph.D., DCGM (FIT)
Stafylakis Themos
Brummer Johan Nikolaas Langenhoven, Dr.
Abstract
This is the system description corresponding to thesystems developed by the BUT team for The Third DIHARDSpeech Diarization Challenge. The systems for both tracks consistof a DOVERlap fusion of an end-to-end NN system with xvectorbased clustering systems in the form of spectral clusteringand VBx. Given that the x-vector clustering systems do notprovide overlapping speakers, overlapped speech is detected by aTasNet-based detector before the final fusion with the end-to-endapproach.
Keywords
Speaker Diarization, DIHARD, VBx diarization,end-to-end diarization, overlapped speech detection
URL
Published
2021
Pages
1–5
Proceedings
Proceedings available at Dihard Challenge Github
Conference
The Third DIHARD Speech Diarization Challenge Workshop
Place
on-line by LDC and University of Pennsylvania
BibTeX
@inproceedings{BUT170909,
author="Federico Nicolás {Landini} and Alicia {Lozano Díez} and Lukáš {Burget} and Mireia {Diez Sánchez} and Anna {Silnova} and Kateřina {Žmolíková} and Ondřej {Glembek} and Pavel {Matějka} and Themos {Stafylakis} and Johan Nikolaas Langenhoven {Brummer}",
title="BUT System Description for The Third DIHARD Speech Diarization Challenge",
booktitle="Proceedings available at Dihard Challenge Github",
year="2021",
pages="1--5",
address="on-line by LDC and University of Pennsylvania",
url="https://dihardchallenge.github.io/dihard3/system_descriptions/dihard3_system_description_team55.pdf"
}
Files
Projects
Neural Representations in multi-modal and multi-lingual modeling, GACR, Grantové projekty exelence v základním výzkumu EXPRO - 2019, GX19-26934X, start: 2019-01-01, end: 2023-12-31, completed
Real time network, text, and speaker analytics for combating organized crime, EU, Horizon 2020, start: 2019-09-01, end: 2022-12-31, completed
Robust End-To-End SPEAKER recognition based on deep learning and attention models, EU, Horizon 2020, start: 2019-06-01, end: 2021-01-31, completed
Real time network, text, and speaker analytics for combating organized crime, EU, Horizon 2020, start: 2019-09-01, end: 2022-12-31, completed
Robust End-To-End SPEAKER recognition based on deep learning and attention models, EU, Horizon 2020, start: 2019-06-01, end: 2021-01-31, completed
Research groups
Speech Data Mining Research Group BUT Speech@FIT (RG SPEECH)
Departments