Result Details

Discriminative Classifiers for Phonotactic Language Recognition with iVectors

SOUFIFAR, M.; CUMANI, S.; BURGET, L.; ČERNOCKÝ, J. Discriminative Classifiers for Phonotactic Language Recognition with iVectors. Proc. International Conference on Acoustics, Speech, and Signal Processing 2012. Kyoto: IEEE Signal Processing Society, 2012. p. 4853-4856. ISBN: 978-1-4673-0044-5.
Type
conference paper
Language
English
Authors
Soufifar Mehdi Mohammad, Ing.
Cumani Sandro, Ph.D.
Burget Lukáš, doc. Ing., Ph.D., DCGM (FIT)
Černocký Jan, prof. Dr. Ing., DCGM (FIT)
Abstract

The paper is about phonotactic models based on bags of n-grams representations and discriminative classifiers are a popular approach to the language recognition problem.

Keywords

Phonotactic iVector, Discriminative classifier,Support vector machine, Logistic regression

URL
Annotation

Phonotactic models based on bags of n-grams representations and discriminative classifiers are a popular approach to the language recognition problem. However, the large size of n-gram count vectors brings about some difficulties in discriminative classifiers. The subspace Multinomial model was recently proposed to effectively represent information contained in the n-grams using low-dimensional iVectors. The availability of a low-dimensional feature vector allows investigating different post-processing techniques and different classifiers to improve recognition performance. In this work, we analyze a set of discriminative classifiers based on Support Vector Machines and Logistic Regression and we propose an iVector post-processing technique which allows to improve recognition performance. The proposed systems are evaluated on the NIST LRE 2009 task.

Published
2012
Pages
4853–4856
Proceedings
Proc. International Conference on Acoustics, Speech, and Signal Processing 2012
Conference
The 37th International Conference on Acoustics, Speech, and Signal Processing
ISBN
978-1-4673-0044-5
Publisher
IEEE Signal Processing Society
Place
Kyoto
DOI
BibTeX
@inproceedings{BUT91475,
  author="Mehdi Mohammad {Soufifar} and Sandro {Cumani} and Lukáš {Burget} and Jan {Černocký}",
  title="Discriminative Classifiers for Phonotactic Language Recognition with iVectors",
  booktitle="Proc. International Conference on Acoustics, Speech, and Signal Processing 2012",
  year="2012",
  pages="4853--4856",
  publisher="IEEE Signal Processing Society",
  address="Kyoto",
  doi="10.1109/ICASSP.2012.6289006",
  isbn="978-1-4673-0044-5",
  url="http://www.fit.vutbr.cz/research/groups/speech/publi/2012/soufifar_icassp2012_0004853.pdf"
}
Projects
Security-Oriented Research in Information Technology, MŠMT, Institucionální prostředky SR ČR (např. VZ, VC), MSM0021630528, start: 2007-01-01, end: 2013-12-31, running
Technologies of speech processing for efficient human-machine communication, TAČR, Program aplikovaného výzkumu a experimentálního vývoje ALFA, TA01011328, start: 2011-01-01, end: 2014-12-31, completed
Research groups
Departments
Back to top