Result Details

BUT System Description for The Third DIHARD Speech Diarization Challenge

LANDINI, F.; LOZANO DÍEZ, A.; BURGET, L.; DIEZ SÁNCHEZ, M.; SILNOVA, A.; ŽMOLÍKOVÁ, K.; GLEMBEK, O.; MATĚJKA, P.; STAFYLAKIS, T.; BRUMMER, J. BUT System Description for The Third DIHARD Speech Diarization Challenge. Proceedings available at Dihard Challenge Github. on-line by LDC and University of Pennsylvania: 2021. p. 1-5.

Type

conference paper

Language

English

Authors

Landini Federico Nicolás, Ph.D., DCGM (FIT)
Lozano Díez Alicia, Ph.D., DCGM (FIT)
Burget Lukáš, doc. Ing., Ph.D., DCGM (FIT)
Diez Sánchez Mireia, M.Sc., Ph.D., DCGM (FIT)
Silnova Anna, M.Sc., Ph.D., DCGM (FIT)
Žmolíková Kateřina, Ing., Ph.D., DCGM (FIT)
Glembek Ondřej, Ing., Ph.D., DCGM (FIT)
Matějka Pavel, Ing., Ph.D., DCGM (FIT)
Stafylakis Themos
Brummer Johan Nikolaas Langenhoven, Dr.

Abstract

This is the system description corresponding to thesystems developed by the BUT team for The Third DIHARDSpeech Diarization Challenge. The systems for both tracks consistof a DOVERlap fusion of an end-to-end NN system with xvectorbased clustering systems in the form of spectral clusteringand VBx. Given that the x-vector clustering systems do notprovide overlapping speakers, overlapped speech is detected by aTasNet-based detector before the final fusion with the end-to-endapproach.

Keywords

Speaker Diarization, DIHARD, VBx diarization,end-to-end diarization, overlapped speech detection

URL

Published

2021

Pages

1–5

Proceedings

Proceedings available at Dihard Challenge Github

Conference

The Third DIHARD Speech Diarization Challenge Workshop

Place

on-line by LDC and University of Pennsylvania

BibTeX

@inproceedings{BUT170909,
  author="Federico Nicolás {Landini} and Alicia {Lozano Díez} and Lukáš {Burget} and Mireia {Diez Sánchez} and Anna {Silnova} and Kateřina {Žmolíková} and Ondřej {Glembek} and Pavel {Matějka} and Themos {Stafylakis} and Johan Nikolaas Langenhoven {Brummer}",
  title="BUT System Description for The Third DIHARD Speech Diarization Challenge",
  booktitle="Proceedings available at Dihard Challenge Github",
  year="2021",
  pages="1--5",
  address="on-line by LDC and University of Pennsylvania",
  url="https://dihardchallenge.github.io/dihard3/system_descriptions/dihard3_system_description_team55.pdf"
}

Files

pdf landini_dihard3_system_description_team55.pdf 165 kB

Projects

Neural Representations in multi-modal and multi-lingual modeling, GACR, Grantové projekty exelence v základním výzkumu EXPRO - 2019, GX19-26934X, start: 2019-01-01, end: 2023-12-31, completed
Real time network, text, and speaker analytics for combating organized crime, EU, Horizon 2020, start: 2019-09-01, end: 2022-12-31, completed
Robust End-To-End SPEAKER recognition based on deep learning and attention models, EU, Horizon 2020, start: 2019-06-01, end: 2021-01-31, completed

Research groups

Speech Data Mining Research Group BUT Speech@FIT (RG SPEECH)

Departments

Department of Computer Graphics and Multimedia (DCGM)