Result Details
Phoneme Recognition using Temporal Patterns
Schwarz Petr, Ing., Ph.D., FIT (FIT)
Černocký Jan, prof. Dr. Ing.
Heřmanský Hynek, prof. Ing., Dr. Eng.
We investigate and compare several techniques for automatic recognition of
unconstrained context-independent phoneme strings from TIMIT and NTIMIT
databases. Among the compared techniques, the technique based on TempoRAl
Patterns (TRAP) achieves the best results in the clean speech, it
achieves about 10% relative improovements against baseline system.
Its advantage is also observed
in the presence of mismatch between training and testing conditions.
Issues such as the optimal length of temporal patterns in the
TRAP technique and the effectiveness of mean and variance normalization of
the patterns and the multi-band input the TRAP estimations, are also explored.
Speech,recognition,phoneme,temporal pattern
@inproceedings{BUT7941,
author="Pavel {Matějka} and Petr {Schwarz} and Jan {Černocký} and Hynek {Heřmanský}",
title="Phoneme Recognition using Temporal Patterns",
booktitle="In Proceedings of the conference TSD'2003. International Conference on Text Speech and Dialogue, TSD 2003",
year="2003",
volume="2003",
pages="8",
isbn="3-540-20024-X"
}
Multi Modal Meeting Manager, IST-2001-34485, start: 2002-03-01, end: 2005-02-28, completed
Voice technologies for support of information society, GACR, Standardní projekty, GA102/02/0124, start: 2002-01-01, end: 2004-12-31, completed