Result Details
BUT system for low resource Indian language ASR
PULUGUNDLA, B.; BASKAR, M.; KESIRAJU, S.; EGOROVA, E.; KARAFIÁT, M.; BURGET, L.; ČERNOCKÝ, J. BUT system for low resource Indian language ASR. In Proceedings of Interspeech 2018. Proceedings of Interspeech. Hyderabad: International Speech Communication Association, 2018. no. 9, p. 3182-3186. ISSN: 1990-9772.
Type
conference paper
Language
English
Authors
Pulugundla Bhargav, M.Sc., DCGM (FIT)
Baskar Murali Karthick, Ing., Ph.D., DCGM (FIT)
Kesiraju Santosh, Ph.D., DCGM (FIT)
Egorova Ekaterina, Ing., Ph.D., DCGM (FIT)
Karafiát Martin, Ing., Ph.D., DCGM (FIT)
Burget Lukáš, doc. Ing., Ph.D., DCGM (FIT)
Černocký Jan, prof. Dr. Ing., DCGM (FIT)
Baskar Murali Karthick, Ing., Ph.D., DCGM (FIT)
Kesiraju Santosh, Ph.D., DCGM (FIT)
Egorova Ekaterina, Ing., Ph.D., DCGM (FIT)
Karafiát Martin, Ing., Ph.D., DCGM (FIT)
Burget Lukáš, doc. Ing., Ph.D., DCGM (FIT)
Černocký Jan, prof. Dr. Ing., DCGM (FIT)
Abstract
This paper describes the BUT Jilebi teams speech recognitionsystems created for the 2018 low resource speech recognitionchallenge for Indian languages. We investigate modifications ofmultilingual time-delay neural network (TDNN) architectureswith transfer learning and compare them to bi-directionalresidual memory networks (BRMN) and bi-directional LSTM.Our best submission based on system combination achievedword error rates of 13.92% (Tamil), 14.71% (Telugu) and14.06% (Gujarati). We present the details of submitted systemsand also the post-evaluation analysis done for lexicon discoveryusing unsupervised word segmentation.
Keywords
Indian languages, low resource ASR, multilingual, LF-MMI
URL
Published
2018
Pages
3182–3186
Journal
Proceedings of Interspeech, vol. 2018, no. 9, ISSN 1990-9772
Proceedings
Proceedings of Interspeech 2018
Conference
Interspeech Conference
Publisher
International Speech Communication Association
Place
Hyderabad
DOI
UT WoS
000465363900663
EID Scopus
BibTeX
@inproceedings{BUT155101,
author="Bhargav {Pulugundla} and Murali Karthick {Baskar} and Santosh {Kesiraju} and Ekaterina {Egorova} and Martin {Karafiát} and Lukáš {Burget} and Jan {Černocký}",
title="BUT system for low resource Indian language ASR",
booktitle="Proceedings of Interspeech 2018",
year="2018",
journal="Proceedings of Interspeech",
volume="2018",
number="9",
pages="3182--3186",
publisher="International Speech Communication Association",
address="Hyderabad",
doi="10.21437/Interspeech.2018-1302",
issn="1990-9772",
url="https://www.isca-speech.org/archive/Interspeech_2018/abstracts/1302.html"
}
Files
Projects
DARPA Low Resource Languages for Emergent Incidents (LORELEI) - Exploiting Language Information for Situational Awareness (ELISA), University of Southern California, start: 2015-09-01, end: 2020-03-31, completed
IT4Innovations excellence in science, MŠMT, Národní program udržitelnosti II, LQ1602, start: 2016-01-01, end: 2020-12-31, completed
Neural networks for signal processing and speech data mining, TAČR, Program na podporu aplikovaného výzkumu ZÉTA, TJ01000208, start: 2018-01-01, end: 2019-12-31, completed
IT4Innovations excellence in science, MŠMT, Národní program udržitelnosti II, LQ1602, start: 2016-01-01, end: 2020-12-31, completed
Neural networks for signal processing and speech data mining, TAČR, Program na podporu aplikovaného výzkumu ZÉTA, TJ01000208, start: 2018-01-01, end: 2019-12-31, completed
Research groups
Speech Data Mining Research Group BUT Speech@FIT (RG SPEECH)
Departments