Faculty of Information Technology, BUT

Publication Details

Independent Component Analysis and MLLR Transforms for Speaker Identification

CUMANI Sandro, PLCHOT Oldřich and KARAFIÁT Martin. Independent Component Analysis and MLLR Transforms for Speaker Identification. In: Proc. International Conference on Acoustics, Speech, and Signal P. Kyoto: IEEE Signal Processing Society, 2012, pp. 4365-4368. ISBN 978-1-4673-0044-5.
Czech title
Analýza nezávislých komponent a MLLT transformace pro identifikaci řečníka
Type
conference paper
Language
english
Authors
Cumani Sandro (POLITO)
Plchot Oldřich, Ing., Ph.D. (DCGM FIT BUT)
Karafiát Martin, Ing., Ph.D. (DCGM FIT BUT)
URL
Keywords
Speaker Recognition, MLLR, ICA, PLDA, SVM
Abstract
This paper describes the use of of Independent Component Analysis (ICA) and Principal Component Analysis (PCA) techniques to reduce the dimensionality of high-level LVCSR features.
Annotation
In this paper, we explore the use of Independent Component Analysis (ICA) and Principal Component Analysis (PCA) techniques to reduce the dimensionality of high-level LVCSR features and at the same time to enable modelling them with state-of-the-art techniques like Probabilistic Linear Discriminant Analysis or Pairwise Support Vector Machines (PSVM). The high-level features are the coefficients from Constrained Maximum-Likelihood Linear Regression (CMLLR) and Maximum-Likelihood Linear Regression (MLLR) transforms estimated in an Automatic Speech Recognition (ASR) system. We also compare a classical approach of modeling every speaker by a single SVM classifier with the recent state-of-the-art modelling techniques in Speaker Identification. We report performance of the systems and score-level combination with a current state-of-the-art acoustic i-vector system on the NIST SRE2010 dataset.
Published
2012
Pages
4365-4368
Proceedings
Proc. International Conference on Acoustics, Speech, and Signal P
Conference
The 37th International Conference on Acoustics, Speech, and Signal Processing, Kyoto, JP
ISBN
978-1-4673-0044-5
Publisher
IEEE Signal Processing Society
Place
Kyoto, JP
DOI
BibTeX
@INPROCEEDINGS{FITPUB9941,
   author = "Sandro Cumani and Old\v{r}ich Plchot and Martin Karafi\'{a}t",
   title = "Independent Component Analysis and MLLR Transforms for Speaker Identification",
   pages = "4365--4368",
   booktitle = "Proc. International Conference on Acoustics, Speech, and Signal P",
   year = 2012,
   location = "Kyoto, JP",
   publisher = "IEEE Signal Processing Society",
   ISBN = "978-1-4673-0044-5",
   doi = "10.1109/ICASSP.2012.6288886",
   language = "english",
   url = "https://www.fit.vut.cz/research/publication/9941"
}
Back to top