Result Details
Phoneme Recognition using Temporal Patterns
Schwarz Petr, Ing., Ph.D., FIT (FIT)
Heřmanský Hynek, prof. Ing., Dr. Eng.
Černocký Jan, prof. Dr. Ing.
We investigate and compare several techniques for automatic recognitionof unconstrained context-independent phoneme strings from TIMIT andNTIMIT databases. Among the compared techniques, the technique based onTempoRAl Patterns (TRAP) achieves the best results in the clean speech,it achieves about 10% relative improovements against baseline system.Its advantage is also observed in the presence of mismatch betweentraining and testing conditions. Issues such as the optimal length oftemporal patterns in the TRAP technique and the effectiveness of meanand variance normalization of the patterns and the multi-band input theTRAP estimations, are also explored.
speech recognition, feature extraction, temporal patterns
@inproceedings{BUT14184,
  author="Pavel {Matějka} and Petr {Schwarz} and Hynek {Heřmanský} and Jan {Černocký}",
  title="Phoneme Recognition using Temporal Patterns",
  booktitle="Proc. 6th International Conference Text, Speech and Dialogue, TSD2003",
  year="2003",
  pages="465--472",
  publisher="Springer Verlag",
  address="Ceske Budejovice",
  isbn="3-540-20024-X"
}Voice technologies for support of information society, GACR, Standardní projekty, GA102/02/0124, start: 2002-01-01, end: 2004-12-31, completed