Result Details

BUT system for low resource Indian language ASR

PULUGUNDLA, B.; BASKAR, M.; KESIRAJU, S.; EGOROVA, E.; KARAFIÁT, M.; BURGET, L.; ČERNOCKÝ, J. BUT system for low resource Indian language ASR. In Proceedings of Interspeech 2018. Proceedings of Interspeech. Hyderabad: International Speech Communication Association, 2018. no. 9, p. 3182-3186. ISSN: 1990-9772.
Type
conference paper
Language
English
Authors
Pulugundla Bhargav, M.Sc., DCGM (FIT)
Baskar Murali Karthick, Ing., Ph.D., DCGM (FIT)
Kesiraju Santosh, Ph.D., DCGM (FIT)
Egorova Ekaterina, Ing., Ph.D., DCGM (FIT)
Karafiát Martin, Ing., Ph.D., DCGM (FIT)
Burget Lukáš, doc. Ing., Ph.D., DCGM (FIT)
Černocký Jan, prof. Dr. Ing., DCGM (FIT)
Abstract

This paper describes the BUT Jilebi teams speech recognitionsystems created for the 2018 low resource speech recognitionchallenge for Indian languages. We investigate modifications ofmultilingual time-delay neural network (TDNN) architectureswith transfer learning and compare them to bi-directionalresidual memory networks (BRMN) and bi-directional LSTM.Our best submission based on system combination achievedword error rates of 13.92% (Tamil), 14.71% (Telugu) and14.06% (Gujarati). We present the details of submitted systemsand also the post-evaluation analysis done for lexicon discoveryusing unsupervised word segmentation.

Keywords

Indian languages, low resource ASR, multilingual, LF-MMI

URL
Published
2018
Pages
3182–3186
Journal
Proceedings of Interspeech, vol. 2018, no. 9, ISSN 1990-9772
Proceedings
Proceedings of Interspeech 2018
Conference
Interspeech Conference
Publisher
International Speech Communication Association
Place
Hyderabad
DOI
UT WoS
000465363900663
EID Scopus
BibTeX
@inproceedings{BUT155101,
  author="Bhargav {Pulugundla} and Murali Karthick {Baskar} and Santosh {Kesiraju} and Ekaterina {Egorova} and Martin {Karafiát} and Lukáš {Burget} and Jan {Černocký}",
  title="BUT system for low resource Indian language ASR",
  booktitle="Proceedings of Interspeech 2018",
  year="2018",
  journal="Proceedings of Interspeech",
  volume="2018",
  number="9",
  pages="3182--3186",
  publisher="International Speech Communication Association",
  address="Hyderabad",
  doi="10.21437/Interspeech.2018-1302",
  issn="1990-9772",
  url="https://www.isca-speech.org/archive/Interspeech_2018/abstracts/1302.html"
}
Files
Projects
DARPA Low Resource Languages for Emergent Incidents (LORELEI) - Exploiting Language Information for Situational Awareness (ELISA), University of Southern California, start: 2015-09-01, end: 2020-03-31, completed
IT4Innovations excellence in science, MŠMT, Národní program udržitelnosti II, LQ1602, start: 2016-01-01, end: 2020-12-31, completed
Neural networks for signal processing and speech data mining, TAČR, Program na podporu aplikovaného výzkumu ZÉTA, TJ01000208, start: 2018-01-01, end: 2019-12-31, completed
Research groups
Departments
Back to top