Faculty of Information Technology, BUT

Publication Details

Multilingual Region-Dependent Transforms

KARAFIÁT Martin, BURGET Lukáš, GRÉZL František, VESELÝ Karel and ČERNOCKÝ Jan. Multilingual Region-Dependent Transforms. In: Proceedings of the 41th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2016), 2016. Shanghai: IEEE Signal Processing Society, 2016, pp. 5430-5434. ISBN 978-1-4799-9988-0.
Czech title
Multilingvální transformace závislé na regionech
Type
conference paper
Language
english
Authors
URL
Keywords
Automatic speech recognition, Region-Dependent Transforms, Multilingual speech recognition, Feedforward neural networks
Abstract
This paper presented our further steps in the development of a feature extraction scheme easily transferable to a new language with severely limited training data.
Annotation
In recent years, trained feature extraction (FE) schemes based on neural networks have replaced or complemented traditional approaches in top performing systems. This paper deals with FE in multilingual scenarios with a target language with low amount of transcribed data. Continuing our previous work on multilingual training of Stacked Bottle-Neck Neural Network FE schemes, we concentrate on improving the discriminatively trained Region- Dependent Transforms. We show that multilingual training of RDT can be implemented by merging statistics from several languages. In our case we used up to 11 source languages to build a FE which generalize well for a new language. This allows us to build a strong bootstrapping model for the final ASR system. The results are produced on IARPA Babel data.
Published
2016
Pages
5430-5434
Proceedings
Proceedings of the 41th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2016), 2016
Conference
41th IEEE International Conference on Acoustics, Speech and Signal Processing, Shanghai, CN
ISBN
978-1-4799-9988-0
Publisher
IEEE Signal Processing Society
Place
Shanghai, CN
DOI
BibTeX
@INPROCEEDINGS{FITPUB11146,
   author = "Martin Karafi\'{a}t and Luk\'{a}\v{s} Burget and Franti\v{s}ek Gr\'{e}zl and Karel Vesel\'{y} and Jan \v{C}ernock\'{y}",
   title = "Multilingual Region-Dependent Transforms",
   pages = "5430--5434",
   booktitle = "Proceedings of the 41th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2016), 2016",
   year = 2016,
   location = "Shanghai, CN",
   publisher = "IEEE Signal Processing Society",
   ISBN = "978-1-4799-9988-0",
   doi = "10.1109/ICASSP.2016.7472715",
   language = "english",
   url = "https://www.fit.vut.cz/research/publication/11146"
}
Back to top