Result Details

BUT System for CHiME-6 Challenge

ŽMOLÍKOVÁ, K.; KOCOUR, M.; LANDINI, F.; BENEŠ, K.; KARAFIÁT, M.; VYDANA, H.; LOZANO DÍEZ, A.; PLCHOT, O.; BASKAR, M.; ŠVEC, J.; MOŠNER, L.; MALENOVSKÝ, V.; BURGET, L.; YUSUF, B.; NOVOTNÝ, O.; GRÉZL, F.; SZŐKE, I.; ČERNOCKÝ, J. BUT System for CHiME-6 Challenge. Proceedings of CHiME 2020 Virtual Workshop. Barcelona: University of Sheffield, 2020. p. 1-3.
Type
conference paper
Language
English
Authors
Žmolíková Kateřina, Ing., Ph.D., DCGM (FIT)
Kocour Martin, Ing., DCGM (FIT)
Landini Federico Nicolás, Ph.D., DCGM (FIT)
Beneš Karel, Ing., Ph.D., DCGM (FIT)
Karafiát Martin, Ing., Ph.D., DCGM (FIT)
Vydana Hari Krishna, DCGM (FIT)
Lozano Díez Alicia, Ph.D., DCGM (FIT)
Plchot Oldřich, Ing., Ph.D., DCGM (FIT)
Baskar Murali Karthick, Ing., Ph.D., DCGM (FIT)
Švec Ján, Ing., DCGM (FIT)
Mošner Ladislav, Ing., DCGM (FIT)
Malenovský Vladimír, Ing., Ph.D., DCGM (FIT)
Burget Lukáš, doc. Ing., Ph.D., DCGM (FIT)
Yusuf Bolaji, FIT (FIT), DCGM (FIT)
Novotný Ondřej, Ing., Ph.D., DCGM (FIT)
Grézl František, Ing., Ph.D., DCGM (FIT)
Szőke Igor, Ing., Ph.D., DCGM (FIT)
Černocký Jan, prof. Dr. Ing., DCGM (FIT)
Abstract

This paper describes BUTs efforts in the development of thesystem for the CHiME-6 challenge with far-field dinner partyrecordings [1]. Our experiments are on both diarization andspeech recognition parts of the system. For diarization, we employthe VBx framework which uses Bayesian hidden Markovmodel with eigenvoice priors on x-vectors. For acoustic modeling,we explore using different subsets of data for training,different neural network architectures, discriminative training,more robust i-vectors, and semi-supervised training on Vox-Celeb data. Besides, we perform experiments with a neuralnetwork-based language model, exploring how to overcome thesmall size of the text corpus and incorporate across-segmentcontext. When fusing our best systems, we achieve 41.21 %/ 42.55 % WER on Track 1, for development and evaluation respectively,and 55.15% / 69.04 % on Track 2, for developmentand evaluation respectively.

Keywords

diarization, neural network, acoustic model, language model, enhancement

URL
Published
2020
Pages
1–3
Proceedings
Proceedings of CHiME 2020 Virtual Workshop
Conference
The 6th International Workshop on Speech Processing in Everyday Environments
Publisher
University of Sheffield
Place
Barcelona
DOI
BibTeX
@inproceedings{BUT164067,
  author="Kateřina {Žmolíková} and Martin {Kocour} and Federico Nicolás {Landini} and Karel {Beneš} and Martin {Karafiát} and Hari Krishna {Vydana} and Alicia {Lozano Díez} and Oldřich {Plchot} and Murali Karthick {Baskar} and Ján {Švec} and Ladislav {Mošner} and Vladimír {Malenovský} and Lukáš {Burget} and Bolaji {Yusuf} and Ondřej {Novotný} and František {Grézl} and Igor {Szőke} and Jan {Černocký}",
  title="BUT System for CHiME-6 Challenge",
  booktitle="Proceedings of CHiME 2020 Virtual Workshop",
  year="2020",
  pages="1--3",
  publisher="University of Sheffield",
  address="Barcelona",
  doi="10.21437/CHiME.2020-13",
  url="https://www.isca-speech.org/archive/CHiME_2020/pdfs/CHiME_2020_paper_zmolikova.pdf"
}
Files
Projects
IT4Innovations excellence in science, MŠMT, Národní program udržitelnosti II, LQ1602, start: 2016-01-01, end: 2020-12-31, completed
Moderní metody zpracování, analýzy a zobrazování multimediálních a 3D dat, BUT, Vnitřní projekty VUT, FIT-S-20-6460, start: 2020-03-01, end: 2023-02-28, completed
Research groups
Departments
Back to top