Detail výsledku

Towards Lower Error Rates in Phoneme Recognition

SCHWARZ, P.; MATĚJKA, P.; ČERNOCKÝ, J. Towards Lower Error Rates in Phoneme Recognition. Lecture Notes in Computer Science, 2004, vol. 2004, no. 3206, p. 465-472. ISSN: 0302-9743.
Typ
článek v časopise
Jazyk
anglicky
Autoři
Schwarz Petr, Ing., Ph.D., UPGM (FIT)
Matějka Pavel, Ing., Ph.D., UPGM (FIT)
Černocký Jan, prof. Dr. Ing., UPGM (FIT)
Abstrakt

We investigate techniques for acoustic modeling in automaticrecognition of context-independent phoneme strings from the TIMITdatabase. The baseline phoneme recognizer is based on TempoRAl Patterns(TRAP). This recognizer is simplified to shorten processing times andreduce computational requirements. More states per phoneme and bi-gramlanguage models are incorporated into the system and evaluated. Thequestion of insufficient amount of training data is discussed and thesystem is improved. All modifications lead to a faster system withabout 23.6% relative improvement over the baseline in phoneme errorrate.

Klíčová slova

phoneme recognition, traps, speech recognition, feature extraction

URL
Rok
2004
Strany
465–472
Časopis
Lecture Notes in Computer Science, roč. 2004, č. 3206, ISSN 0302-9743
BibTeX
@article{BUT45739,
  author="Petr {Schwarz} and Pavel {Matějka} and Jan {Černocký}",
  title="Towards  Lower Error Rates in Phoneme Recognition",
  journal="Lecture Notes in Computer Science",
  year="2004",
  volume="2004",
  number="3206",
  pages="465--472",
  issn="0302-9743",
  url="http://www.springerlink.com/index/KBY35VBXY16WHV56"
}
Projekty
Daty řízené a antropické kódování a rozpoznávání řeči, GAČR, Postdoktorandské granty, GP102/02/D108, zahájení: 2002-09-01, ukončení: 2005-08-30, ukončen
Hlasové technologie v podpoře informační společnosti, GAČR, Standardní projekty, GA102/02/0124, zahájení: 2002-01-01, ukončení: 2004-12-31, ukončen
Výzkumné skupiny
Pracoviště
Nahoru