Result Details

Data Driven Design of Filter Bank for Speech Recognition

BURGET, L., HERMANSKY, H. Data Driven Design of Filter Bank for Speech Recognition. In Proc. 4th Intl. Conference Text, Speech Dialogue. Zelezna Ruda: Springer Verlag, 2001. 6 p. ISBN: 3-540-42557-8.
Type
conference paper
Language
English
Authors
Burget Lukáš, doc. Ing., Ph.D.
Hermansky Hynek, prof.
Abstract

Filter bank approach is commonly used in feature extraction phase of speech recognition (e.g. Mel frequency cepstral coefficients). Filter bank is applied for modification of magnitude spectrum according to physiological and psychological findings. However, since mechanism of human auditory system is not fully understood,the optimal filter bank parameters are not known. This work presents a method where the filter bank, optimized for discriminability between phonemes, is derived directly from phonetically labeled speech data using Linear Discriminant Analysis. This work can be seen as another proof of the fact that incorporation of psychoacoustic findings into feature extraction can lead to better recognition performance.

Published
2001
Pages
6
Proceedings
Proc. 4th Intl. Conference Text, Speech Dialogue
Conference
International Conference on Text Speech and Dialogue, TSD 2001
ISBN
3-540-42557-8
Publisher
Springer Verlag
Place
Zelezna Ruda
BibTeX
@inproceedings{BUT3682,
  author="Lukáš {Burget} and Hynek {Hermansky}",
  title="Data Driven Design of Filter Bank for Speech Recognition",
  booktitle="Proc. 4th Intl. Conference Text, Speech Dialogue",
  year="2001",
  pages="6",
  publisher="Springer Verlag",
  address="Zelezna Ruda",
  isbn="3-540-42557-8"
}
Departments
Back to top