Detail výsledku

Czech Speech Recognizer for Multiple Environments

GLEMBEK, O.; KARAFIÁT, M.; BURGET, L.; ČERNOCKÝ, J. Czech Speech Recognizer for Multiple Environments. Radioeletronika 2006. Bratislava: 2006. p. 1-4.
Typ
článek ve sborníku konference
Jazyk
anglicky
Autoři
Glembek Ondřej, Ing., Ph.D., UPGM (FIT)
Karafiát Martin, Ing., Ph.D., UPGM (FIT)
Burget Lukáš, doc. Ing., Ph.D., UPGM (FIT)
Černocký Jan, prof. Dr. Ing., UPGM (FIT)
Abstrakt

This paper presents our work on building a largevocabulary continuous speech recognition (LVCSR) system for Czech, capable of op eration in multiple environments. SpeeCon and Temic speech databases were used to define a data-set for training acoustic models, attention was paid to unification of these two resources. The test set was also defined using these corp ora with careful choice of segments not overlapping with the training data. The system was completed by a language model trained on Czech National corpus. The recognition was performed using DUCoder an LVCSR stack decoder. Experimental results on the LVCSR task give a reference score of the system for future improvements.

Klíčová slova

speech, recognition, automatic, artiffical inteligence, training, czech, database, acoustic, modelling, modeling, language

URL
Rok
2006
Strany
1–4
Sborník
Radioeletronika 2006
Konference
16th International Czech-Slovak Scientific conference Radioelektronika 2006
Místo
Bratislava
BibTeX
@inproceedings{BUT22376,
  author="Ondřej {Glembek} and Martin {Karafiát} and Lukáš {Burget} and Jan {Černocký}",
  title="Czech Speech Recognizer for Multiple Environments",
  booktitle="Radioeletronika 2006",
  year="2006",
  pages="1--4",
  address="Bratislava",
  url="http://www.fit.vutbr.cz/~glembek/papers/radioeletronika_2006.pdf"
}
Projekty
Nové směry ve výzkumu a využití hlasových technologií, GAČR, Standardní projekty, GA102/05/0278, zahájení: 2005-01-01, ukončení: 2007-12-31, ukončen
Výzkumné skupiny
Pracoviště
Nahoru