Publication Details

BUT system for low resource Indian language ASR

PULUGUNDLA Bhargav, BASKAR Murali K., KESIRAJU Santosh, EGOROVA Ekaterina, KARAFIÁT Martin, BURGET Lukáš and ČERNOCKÝ Jan. BUT system for low resource Indian language ASR. In: Proceedings of Interspeech 2018. Hyderabad: International Speech Communication Association, 2018, pp. 3182-3186. ISSN 1990-9772. Available from: https://www.isca-speech.org/archive/Interspeech_2018/abstracts/1302.html
Czech title
VUT systém pro rozpoznávání indických jazyků s omezenými zdroji
Type
conference paper
Language
english
Authors
URL
Keywords

Indian languages, low resource ASR, multilingual, LF-MMI

Abstract

This paper describes the BUT Jilebi teams speech recognition systems created for the 2018 low resource speech recognition challenge for Indian languages. We investigate modifications of multilingual time-delay neural network (TDNN) architectures with transfer learning and compare them to bi-directional residual memory networks (BRMN) and bi-directional LSTM. Our best submission based on system combination achieved word error rates of 13.92% (Tamil), 14.71% (Telugu) and 14.06% (Gujarati). We present the details of submitted systems and also the post-evaluation analysis done for lexicon discovery using unsupervised word segmentation.

Published
2018
Pages
3182-3186
Journal
Proceedings of Interspeech, vol. 2018, no. 9, ISSN 1990-9772
Proceedings
Proceedings of Interspeech 2018
Conference
19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), Hyderabad, India, IN
Publisher
International Speech Communication Association
Place
Hyderabad, IN
DOI
UT WoS
000465363900663
EID Scopus
BibTeX
@INPROCEEDINGS{FITPUB11841,
   author = "Bhargav Pulugundla and K. Murali Baskar and Santosh Kesiraju and Ekaterina Egorova and Martin Karafi\'{a}t and Luk\'{a}\v{s} Burget and Jan \v{C}ernock\'{y}",
   title = "BUT system for low resource Indian language ASR",
   pages = "3182--3186",
   booktitle = "Proceedings of Interspeech 2018",
   journal = "Proceedings of Interspeech",
   volume = 2018,
   number = 9,
   year = 2018,
   location = "Hyderabad, IN",
   publisher = "International Speech Communication Association",
   ISSN = "1990-9772",
   doi = "10.21437/Interspeech.2018-1302",
   language = "english",
   url = "https://www.fit.vut.cz/research/publication/11841"
}
Back to top