Result Details
BUT System Description for The Third DIHARD Speech Diarization Challenge
        LANDINI, F.; LOZANO DÍEZ, A.; BURGET, L.; DIEZ SÁNCHEZ, M.; SILNOVA, A.; ŽMOLÍKOVÁ, K.; GLEMBEK, O.; MATĚJKA, P.; STAFYLAKIS, T.; BRUMMER, J. BUT System Description for The Third DIHARD Speech Diarization Challenge. Proceedings available at Dihard Challenge Github. on-line by LDC and University of Pennsylvania: 2021. p. 1-5.  
    
                Type
            
        
                conference paper
            
        
                Language
            
        
                English
            
        
            Authors
            
        
                Landini Federico Nicolás, Ph.D., DCGM (FIT)
                
Lozano Díez Alicia, Ph.D., DCGM (FIT)
Burget Lukáš, doc. Ing., Ph.D., DCGM (FIT)
Diez Sánchez Mireia, M.Sc., Ph.D., DCGM (FIT)
Silnova Anna, M.Sc., Ph.D., DCGM (FIT)
Žmolíková Kateřina, Ing., Ph.D., DCGM (FIT)
Glembek Ondřej, Ing., Ph.D., DCGM (FIT)
Matějka Pavel, Ing., Ph.D., DCGM (FIT)
Stafylakis Themos
Brummer Johan Nikolaas Langenhoven, Dr.
        Lozano Díez Alicia, Ph.D., DCGM (FIT)
Burget Lukáš, doc. Ing., Ph.D., DCGM (FIT)
Diez Sánchez Mireia, M.Sc., Ph.D., DCGM (FIT)
Silnova Anna, M.Sc., Ph.D., DCGM (FIT)
Žmolíková Kateřina, Ing., Ph.D., DCGM (FIT)
Glembek Ondřej, Ing., Ph.D., DCGM (FIT)
Matějka Pavel, Ing., Ph.D., DCGM (FIT)
Stafylakis Themos
Brummer Johan Nikolaas Langenhoven, Dr.
                    Abstract
            
        This is the system description corresponding to thesystems developed by the BUT team for The Third DIHARDSpeech Diarization Challenge. The systems for both tracks consistof a DOVERlap fusion of an end-to-end NN system with xvectorbased clustering systems in the form of spectral clusteringand VBx. Given that the x-vector clustering systems do notprovide overlapping speakers, overlapped speech is detected by aTasNet-based detector before the final fusion with the end-to-endapproach.
                Keywords
            
        Speaker Diarization, DIHARD, VBx diarization,end-to-end diarization, overlapped speech detection
                URL
            
        
                Published
            
            
                    2021
                    
                
            
                    Pages
                
            
                        1–5
                
            
                        Proceedings
                
            
                    Proceedings available at Dihard Challenge Github
                
            
                    Conference
                
            
                    The Third DIHARD Speech Diarization Challenge Workshop
                
            
                    Place
                
            
                    on-line by LDC and University of Pennsylvania
                
            
                    BibTeX
                
            @inproceedings{BUT170909,
  author="Federico Nicolás {Landini} and Alicia {Lozano Díez} and Lukáš {Burget} and Mireia {Diez Sánchez} and Anna {Silnova} and Kateřina {Žmolíková} and Ondřej {Glembek} and Pavel {Matějka} and Themos {Stafylakis} and Johan Nikolaas Langenhoven {Brummer}",
  title="BUT System Description for The Third DIHARD Speech Diarization Challenge",
  booktitle="Proceedings available at Dihard Challenge Github",
  year="2021",
  pages="1--5",
  address="on-line by LDC and University of Pennsylvania",
  url="https://dihardchallenge.github.io/dihard3/system_descriptions/dihard3_system_description_team55.pdf"
}
                Files
            
        
                Projects
            
        
        
            
        
    
    
        Neural Representations in multi-modal and multi-lingual modeling, GACR, Grantové projekty exelence v základním výzkumu EXPRO - 2019, GX19-26934X, start: 2019-01-01, end: 2023-12-31, completed
                
Real time network, text, and speaker analytics for combating organized crime, EU, Horizon 2020, start: 2019-09-01, end: 2022-12-31, completed
Robust End-To-End SPEAKER recognition based on deep learning and attention models, EU, Horizon 2020, start: 2019-06-01, end: 2021-01-31, completed
        Real time network, text, and speaker analytics for combating organized crime, EU, Horizon 2020, start: 2019-09-01, end: 2022-12-31, completed
Robust End-To-End SPEAKER recognition based on deep learning and attention models, EU, Horizon 2020, start: 2019-06-01, end: 2021-01-31, completed
                Research groups
            
        
                Speech Data Mining Research Group BUT Speech@FIT (RG SPEECH)
            
        
                Departments