Faculty of Information Technology, BUT

Publication Details

Automatic Language Identification Using Deep Neural Networks

LOPEZ-MORENO Ignacio, GONZALEZ-DOMINGUEZ Javier, MARTÍNEZ González David, PLCHOT Oldřich, GONZALEZ-RODRIGUEZ Joaquin and MORENO Pedro. Automatic Language Identification Using Deep Neural Networks. In: Proceeding of ICASSP 2014. Florencie: IEEE Signal Processing Society, 2014, pp. 5374-5378. ISBN 978-1-4799-2892-7.
Czech title
Automatická identifikace mluvčího pomocí hlubokých neuronových sítí
Type
conference paper
Language
english
Authors
Lopez-Moreno Ignacio (Google)
Gonzalez-Dominguez Javier (Google)
Martínez González David (UNIZAR)
Plchot Oldřich, Ing., Ph.D. (DCGM FIT BUT)
Gonzalez-Rodriguez Joaquin (UAM)
Moreno Pedro (Google)
URL
Keywords
Automatic Language Identification, ivectors, DNNs
Abstract
In this work, we experimented with the use of deep neural networks (DNNs) to automatic language identification (LID). Guided by the success of DNNs for acoustic modelling, we explored their capability to learn discriminative language information from speech signals.
Annotation
This work studies the use of deep neural networks (DNNs) to address automatic language identification (LID). Motivated by their recent success in acoustic modelling, we adapt DNNs to the problem of identifying the language of a given spoken utterance from short-term acoustic features. The proposed approach is compared to state-of-the-art i-vector based acoustic systems on two different datasets: Google 5M LID corpus and NIST LRE 2009. Results show how LID can largely benefit from using DNNs, especially when a large amount of training data is available. We found relative improvements up to 70%, in Cavg, over the baseline system.
Published
2014
Pages
5374-5378
Proceedings
Proceeding of ICASSP 2014
Conference
The 39th International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Florencie, IT
ISBN
978-1-4799-2892-7
Publisher
IEEE Signal Processing Society
Place
Florencie, IT
DOI
BibTeX
@INPROCEEDINGS{FITPUB10562,
   author = "Ignacio Lopez-Moreno and Javier Gonzalez-Dominguez and David Gonz\'{a}lez Mart\'{i}nez and Old\v{r}ich Plchot and Joaquin Gonzalez-Rodriguez and Pedro Moreno",
   title = "Automatic Language Identification Using Deep Neural Networks",
   pages = "5374--5378",
   booktitle = "Proceeding of ICASSP 2014",
   year = 2014,
   location = "Florencie, IT",
   publisher = "IEEE Signal Processing Society",
   ISBN = "978-1-4799-2892-7",
   doi = "10.1109/ICASSP.2014.6854622",
   language = "english",
   url = "https://www.fit.vut.cz/research/publication/10562"
}
Back to top