Result Details

Using Smoothed Heteroscedastic Linear Discriminant Analysis in Large Vocabulary Continuous Speech Recognition System

KARAFIÁT, M.; BURGET, L.; ČERNOCKÝ, J. Using Smoothed Heteroscedastic Linear Discriminant Analysis in Large Vocabulary Continuous Speech Recognition System. 2nd Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms. tento článek nebyl zařazen mezi Revised Selected Papers, nevyšel v LNCS 3869. Edinbourgh, Scotland: University of Edinburgh, 2005. p. 1-8.
Type
conference paper
Language
English
Authors
Abstract

In this work, we verify that SHLDA can be advantageously used also for Large Vocabulary
Continuous Speech Recognition.

Keywords

speech recognition, LVCSR, HLDA, feature transform, dimensionality reduction

URL
Annotation

In the state-of-the-art speech recognition systems, Heteroscedastic Linear Discriminant Analysis (HLDA)
is becoming popular technique allowing for feature decorrelation and dimensionality reduction.
However, HLDA relies on statistics, which may not be reliably estimated when only limited amount of
training data is available. Recently, Smoothed HLDA (SHLDA) was proposed
as a robust modification of
HLDA. Previously, SHLDA was successfully used for feature combination in
small vocabulary recognition
experiments. In this work, we verify that SHLDA can be advantageously used also for Large Vocabulary
Continuous Speech Recognition.

Published
2005
Pages
1–8
Proceedings
2nd Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms
Series
tento článek nebyl zařazen mezi Revised Selected Papers, nevyšel v LNCS 3869
Conference
2nd Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms
Publisher
University of Edinburgh
Place
Edinbourgh, Scotland
BibTeX
@inproceedings{BUT18264,
  author="Martin {Karafiát} and Lukáš {Burget} and Jan {Černocký}",
  title="Using Smoothed Heteroscedastic Linear Discriminant Analysis in Large Vocabulary Continuous Speech Recognition System",
  booktitle="2nd Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms",
  year="2005",
  series="tento článek nebyl zařazen mezi Revised Selected Papers, nevyšel v LNCS 3869",
  pages="1--8",
  publisher="University of Edinburgh",
  address="Edinbourgh, Scotland",
  url="https://www.fit.vutbr.cz/~karafiat/publi/2005/karafiat_mlmi2005.pdf"
}
Projects
Augmented Multi-party Interaction, EU, Sixth Framework programme, 506811-AMI, start: 2004-01-01, end: 2006-12-31, completed
New trends in research and application of voice technology, GACR, Standardní projekty, GA102/05/0278, start: 2005-01-01, end: 2007-12-31, completed
Research groups
Departments
Back to top