Result Details

TRAP based features for LVCSR of meeting data

KARAFIÁT, M.; GRÉZL, F.; ČERNOCKÝ, J. TRAP based features for LVCSR of meeting data. Proc. 8th International Conference on Spoken Language Processing. Jeju Island: Sunjin Printing Co, 2004. no. 10, p. 437-440. ISSN: 1225-4111.
Type
conference paper
Language
English
Authors
Abstract

This paper describes using temporal patterns (TRAPs) feature extraction in large vocabulary continuous speech recognition (LVCSR) of meeting data. Frequency differentiation and local operators are applied to critical-band speech spectrum. Tests are performed with HMM recognizer on ICSI meetings database. We show that TRAP features in combination with standard ones lead to improvement of word-error rate (WER).

Keywords

Speech recognition, TRAP, LVCSR

URL
Annotation

This paper describes using temporal patterns (TRAPs) feature extraction in large vocabulary continuous speech recognition (LVCSR) of meeting data. Frequency differentiation and local operators are applied to critical-band speech spectrum. Tests are performed with HMM recognizer on ICSI meetings database. We show that TRAP features in combination with standard ones lead to improvement of word-error rate (WER).

Published
2004
Pages
437–440
Proceedings
Proc. 8th International Conference on Spoken Language Processing
Volume
2004
Number
10
Conference
8th International Conference on Spoken Language Processing
Publisher
Sunjin Printing Co,
Place
Jeju Island
BibTeX
@inproceedings{BUT17131,
  author="Martin {Karafiát} and František {Grézl} and Jan {Černocký}",
  title="TRAP based features for LVCSR of meeting data",
  booktitle="Proc. 8th International Conference on Spoken Language Processing",
  year="2004",
  volume="2004",
  number="10",
  pages="437--440",
  publisher="Sunjin Printing Co,",
  address="Jeju Island",
  url="http://www.fit.vutbr.cz/~karafiat/publi/2004/karafiat_icslp2004.pdf"
}
Projects
Augmented Multi-party Interaction, EU, Sixth Framework programme, 506811-AMI, start: 2004-01-01, end: 2006-12-31, completed
Data driven and anthropic coding and recognition of speech, GACR, Postdoktorandské granty, GP102/02/D108, start: 2002-09-01, end: 2005-08-30, completed
Research groups
Departments
Back to top