Detail výsledku

BUT System for CHiME-6 Challenge

ŽMOLÍKOVÁ, K.; KOCOUR, M.; LANDINI, F.; BENEŠ, K.; KARAFIÁT, M.; VYDANA, H.; LOZANO DÍEZ, A.; PLCHOT, O.; BASKAR, M.; ŠVEC, J.; MOŠNER, L.; MALENOVSKÝ, V.; BURGET, L.; YUSUF, B.; NOVOTNÝ, O.; GRÉZL, F.; SZŐKE, I.; ČERNOCKÝ, J. BUT System for CHiME-6 Challenge. Proceedings of CHiME 2020 Virtual Workshop. Barcelona: University of Sheffield, 2020. p. 1-3.
Typ
článek ve sborníku konference
Jazyk
anglicky
Autoři
Žmolíková Kateřina, Ing., Ph.D., UPGM (FIT)
Kocour Martin, Ing., UPGM (FIT)
Landini Federico Nicolás, Ph.D., UPGM (FIT)
Beneš Karel, Ing., Ph.D., UPGM (FIT)
Karafiát Martin, Ing., Ph.D., UPGM (FIT)
Vydana Hari Krishna, UPGM (FIT)
Lozano Díez Alicia, Ph.D., UPGM (FIT)
Plchot Oldřich, Ing., Ph.D., UPGM (FIT)
Baskar Murali Karthick, Ing., Ph.D.
Švec Ján, Ing., UPGM (FIT)
Mošner Ladislav, Ing., UPGM (FIT)
Malenovský Vladimír, Ing., Ph.D., UPGM (FIT)
Burget Lukáš, doc. Ing., Ph.D., UPGM (FIT)
Yusuf Bolaji, UPGM (FIT)
Novotný Ondřej, Ing., Ph.D., UPGM (FIT)
Grézl František, Ing., Ph.D., UPGM (FIT)
Szőke Igor, Ing., Ph.D., UPGM (FIT)
Černocký Jan, prof. Dr. Ing., UPGM (FIT)
Abstrakt

This paper describes BUTs efforts in the development of thesystem for the CHiME-6 challenge with far-field dinner partyrecordings [1]. Our experiments are on both diarization andspeech recognition parts of the system. For diarization, we employthe VBx framework which uses Bayesian hidden Markovmodel with eigenvoice priors on x-vectors. For acoustic modeling,we explore using different subsets of data for training,different neural network architectures, discriminative training,more robust i-vectors, and semi-supervised training on Vox-Celeb data. Besides, we perform experiments with a neuralnetwork-based language model, exploring how to overcome thesmall size of the text corpus and incorporate across-segmentcontext. When fusing our best systems, we achieve 41.21 %/ 42.55 % WER on Track 1, for development and evaluation respectively,and 55.15% / 69.04 % on Track 2, for developmentand evaluation respectively.

Klíčová slova

diarization, neural network, acoustic model, language model, enhancement

URL
Rok
2020
Strany
1–3
Sborník
Proceedings of CHiME 2020 Virtual Workshop
Konference
The 6th International Workshop on Speech Processing in Everyday Environments
Vydavatel
University of Sheffield
Místo
Barcelona
DOI
BibTeX
@inproceedings{BUT164067,
  author="Kateřina {Žmolíková} and Martin {Kocour} and Federico Nicolás {Landini} and Karel {Beneš} and Martin {Karafiát} and Hari Krishna {Vydana} and Alicia {Lozano Díez} and Oldřich {Plchot} and Murali Karthick {Baskar} and Ján {Švec} and Ladislav {Mošner} and Vladimír {Malenovský} and Lukáš {Burget} and Bolaji {Yusuf} and Ondřej {Novotný} and František {Grézl} and Igor {Szőke} and Jan {Černocký}",
  title="BUT System for CHiME-6 Challenge",
  booktitle="Proceedings of CHiME 2020 Virtual Workshop",
  year="2020",
  pages="1--3",
  publisher="University of Sheffield",
  address="Barcelona",
  doi="10.21437/CHiME.2020-13",
  url="https://www.isca-speech.org/archive/CHiME_2020/pdfs/CHiME_2020_paper_zmolikova.pdf"
}
Soubory
Projekty
IT4Innovations excellence in science, MŠMT, Národní program udržitelnosti II, LQ1602, zahájení: 2016-01-01, ukončení: 2020-12-31, ukončen
Moderní metody zpracování, analýzy a zobrazování multimediálních a 3D dat, VUT, Vnitřní projekty VUT, FIT-S-20-6460, zahájení: 2020-03-01, ukončení: 2023-02-28, ukončen
Výzkumné skupiny
Pracoviště
Nahoru