Detail výsledku
Data selection by sequence summarizing neural network in mismatch condition training
Karafiát Martin, Ing., Ph.D., UPGM (FIT)
Veselý Karel, Ing., Ph.D., FIT (FIT), UPGM (FIT)
Delcroix Marc
Watanabe Shinji, FIT (FIT)
Burget Lukáš, doc. Ing., Ph.D., UPGM (FIT)
Černocký Jan, prof. Dr. Ing., UPGM (FIT)
Data augmentation is a simple and efficient technique to improvethe robustness of a speech recognizer when deployed inmismatched training-test conditions. Our paper proposes a newapproach for selecting data with respect to similarity of acousticconditions. The similarity is computed based on a sequencesummarizing neural network which extracts vectors containingacoustic summary (e.g. noise and reverberation characteristics)of an utterance. Several configurations of this network and differentmethods of selecting data using these "summary-vectors"were explored. The results are reported on a mismatched conditionusing AMI training set with the proposed data selectionand CHiME3 test set.
Automatic speech recognition, Data augmentation,Data selection, Mismatch training condition, Sequencesummarization
@inproceedings{BUT132600,
author="Kateřina {Žmolíková} and Martin {Karafiát} and Karel {Veselý} and Marc {Delcroix} and Shinji {Watanabe} and Lukáš {Burget} and Jan {Černocký}",
title="Data selection by sequence summarizing neural network in mismatch condition training",
booktitle="Proceedings of Interspeech 2016",
year="2016",
pages="2354--2358",
publisher="International Speech Communication Association",
address="San Francisco",
doi="10.21437/Interspeech.2016-741",
isbn="978-1-5108-3313-5",
url="https://www.semanticscholar.org/paper/Data-Selection-by-Sequence-Summarizing-Neural-Zmol%C3%ADkov%C3%A1-Karafi%C3%A1t/bc1832e8b8d4e5edf987e1562b578bd9aa5e18a9"
}
Meeting assistant (MINT), TAČR, Program aplikovaného výzkumu a experimentálního vývoje ALFA, TA04011311, zahájení: 2014-10-01, ukončení: 2017-12-31, ukončen
Zpracování, rozpoznávání a zobrazování multimediálních a 3D dat, VUT, Vnitřní projekty VUT, FIT-S-14-2506, zahájení: 2014-01-01, ukončení: 2016-12-31, ukončen