Result Details

Data selection and calibration issues in automatic language recognition - investigation with BUT-AGNITIO NIST LRE 2009 system

JANČÍK, Z.; PLCHOT, O.; BRUMMER, J.; BURGET, L.; GLEMBEK, O.; HUBEIKA, V.; KARAFIÁT, M.; MATĚJKA, P.; MIKOLOV, T.; STRASHEIM, A.; ČERNOCKÝ, J. Data selection and calibration issues in automatic language recognition - investigation with BUT-AGNITIO NIST LRE 2009 system. In Proc. Odyssey 2010 - The Speaker and Language Recognition Workshop. Brno: International Speech Communication Association, 2010. p. 215-221. ISBN: 978-80-214-4114-9.
Type
conference paper
Language
English
Authors
Jančík Zdeněk, Ing.
Plchot Oldřich, Ing., Ph.D., DCGM (FIT)
Brummer Johan Nikolaas Langenhoven, Dr.
Burget Lukáš, doc. Ing., Ph.D., DCGM (FIT)
Glembek Ondřej, Ing., Ph.D., DCGM (FIT)
Hubeika Valiantsina, Ing., DCGM (FIT)
Karafiát Martin, Ing., Ph.D., DCGM (FIT)
Matějka Pavel, Ing., Ph.D., DCGM (FIT)
Mikolov Tomáš, Ing., Ph.D., DCGM (FIT)
Strasheim Albeert
Černocký Jan, prof. Dr. Ing., DCGM (FIT)
Abstract

This paper is on data selection and calibration issues in automatic language recognition. The paper is based on investigation with BUT-AGNITIO NIST LRE 2009 system.

Keywords

speech, automatic, language, recognition, evaluation

URL
Annotation

This paper summarizes the BUT-AGNITIO system for NIST Language Recognition Evaluation 2009. The post-evaluation analysis aimed mainly at improving the quality of the data (fixing language label problems and detecting overlapping speakers in the training and development sets) and investigation of different compositions of the development set. The paper further investigates into JFA-based acoustic system and reports results for new SVM-PCA systems going beyond BUT-Agnitio original NIST LRE 2009 submission. All results are presented on evaluation data from NIST LRE 2009 task.

Published
2010
Pages
215–221
Proceedings
Proc. Odyssey 2010 - The Speaker and Language Recognition Workshop
Conference
The Speaker and Language Recognition Workshop
ISBN
978-80-214-4114-9
Publisher
International Speech Communication Association
Place
Brno
EID Scopus
BibTeX
@inproceedings{BUT34924,
  author="Zdeněk {Jančík} and Oldřich {Plchot} and Johan Nikolaas Langenhoven {Brummer} and Lukáš {Burget} and Ondřej {Glembek} and Valiantsina {Hubeika} and Martin {Karafiát} and Pavel {Matějka} and Tomáš {Mikolov} and Albeert {Strasheim} and Jan {Černocký}",
  title="Data selection and calibration issues in automatic language recognition - investigation with BUT-AGNITIO NIST LRE 2009 system",
  booktitle="Proc. Odyssey 2010 - The Speaker and Language Recognition Workshop",
  year="2010",
  pages="215--221",
  publisher="International Speech Communication Association",
  address="Brno",
  isbn="978-80-214-4114-9",
  url="http://www.fit.vutbr.cz/research/groups/speech/publi/2010/jancik_odys2010.pdf"
}
Projects
EOARD - Improving the capacity of language recognition systems to handle rare languages using radio broadcast data, start: 2008-10-15, end: 2010-12-14, completed
Mobile Biometry, MŠMT, Podpora projektů sedmého rámcového programu Evropského společenství pro výzkum, technologický rozvoj a demonstrace (2007 až 2013) podle zákona č. 171/2007 Sb., 7E08042, start: 2008-01-01, end: 2010-12-31, completed
Security-Oriented Research in Information Technology, MŠMT, Institucionální prostředky SR ČR (např. VZ, VC), MSM0021630528, start: 2007-01-01, end: 2013-12-31, running
Speech Recognition under Real-World Conditions, GACR, Standardní projekty, GA102/08/0707, start: 2008-01-01, end: 2011-12-31, completed
Research groups
Departments
Back to top