Result Details
Dealing with Numbers in Grapheme-Based Speech Recognition
Karafiát Martin, Ing., Ph.D., DCGM (FIT)
Černocký Jan, prof. Dr. Ing., DCGM (FIT)
Grapheme-based speech recognition approach is suitable in situation of low resource languages, where obtaining of pronunciation dictionary is time- and cost-consuming. The paper describes the process of automatic generation of pronunciation dictionaries with emphasis on the expansion of numbers and presents results on GlobalPhone database.
LVCSR, ASR, grapheme, phoneme, speech recognition.
This article presents the results of grapheme-based speech recognition for eight languages. The need for this approach arises in situation of low resource languages, where obtaining a pronunciation dictionary is time- and cost-consuming or impossible. In such scenarios, usage of grapheme dictionaries is the most simplest and straight-forward. The paper describes the process of automatic generation of pronunciation dictionaries with emphasis on the expansion of numbers. Experiments on GlobalPhone database show that grapheme-based systems have results comparable to the phoneme-based ones, especially for phonetic languages.
@inproceedings{BUT97033,
author="Miloš {Janda} and Martin {Karafiát} and Jan {Černocký}",
title="Dealing with Numbers in Grapheme-Based Speech Recognition",
booktitle="Proceedings of 15th International Conference on Text, Speech and Dialogue",
year="2012",
series="Lecture Notes in Computer Science, 2012, Volume 7499",
journal="Lecture Notes in Computer Science",
volume="2012",
number="9",
pages="438--445",
publisher="Springer Verlag",
address="Springer-Verlag Berlin Heidelberg 2012",
doi="10.1007/978-3-642-32790-2\{_}53",
isbn="978-3-642-32789-6",
issn="0302-9743",
url="http://www.springerlink.com/content/yx9807202033v381/"
}
Multilingual recognition and search in speech for electronic dictionaries, MPO, TIP, FR-TI1/034, start: 2009-09-01, end: 2013-08-31, completed
Security-Oriented Research in Information Technology, MŠMT, Institucionální prostředky SR ČR (např. VZ, VC), MSM0021630528, start: 2007-01-01, end: 2013-12-31, running