Result Details

Brno University of Technology System for Interspeech 2010 Paralinguistic Challenge

KOCKMANN, M.; BURGET, L.; ČERNOCKÝ, J. Brno University of Technology System for Interspeech 2010 Paralinguistic Challenge. Proceedings of the 11th Annual Conference of the International Speech Communication Association (INTERSPEECH 2010). Proceedings of Interspeech. Makuhari, Chiba: International Speech Communication Association, 2010. no. 9, p. 2822-2825. ISBN: 978-1-61782-123-3. ISSN: 1990-9772.
Type
conference paper
Language
English
Authors
Kockmann Marcel, Dipl.-Ing., Ph.D., DCGM (FIT)
Burget Lukáš, doc. Ing., Ph.D., DCGM (FIT)
Černocký Jan, prof. Dr. Ing., DCGM (FIT)
Abstract

This paper describes systems for the Age- and Gender-Sub-Challenges employ fusions of several sub-systems. We make use of our own acoustic frame-based feature sets

Keywords

Age- and gender-recognition, GMM, MMI, eigenvoice, fusion

URL
Annotation

This paper describes Brno University of Technology (BUT) system for the Interspeech 2010 Paralinguistic Challenge. Our submitted systems for the Age- and Gender-Sub-Challenges employ fusions of several sub-systems. We make use of our own acoustic frame-based feature sets, as well as the provided utterance-based acoustic, prosodic and voice quality features. Modeling is based on Gaussian Mixture Models (GMM) and Support Vector Machines (SVM), followed by linear Gaussian backends and logistic regression-based fusion. For a single subsystem, we obtain improvement of about 2% absolute, for both tasks, on the development-set. Our final fusion results in nearly 9% absolute improvement for the Age task and about 4.5% for the Gender task on the development set. On the final test set we obtain 3.5% and 2% absolute improvement, respectively.

Published
2010
Pages
2822–2825
Journal
Proceedings of Interspeech, vol. 2010, no. 9, ISSN 1990-9772
Proceedings
Proceedings of the 11th Annual Conference of the International Speech Communication Association (INTERSPEECH 2010)
Conference
Interspeech Conference, Tokyo, JP
ISBN
978-1-61782-123-3
Publisher
International Speech Communication Association
Place
Makuhari, Chiba
BibTeX
@inproceedings{BUT35025,
  author="Marcel {Kockmann} and Lukáš {Burget} and Jan {Černocký}",
  title="Brno University of Technology System for Interspeech 2010 Paralinguistic Challenge",
  booktitle="Proceedings of the 11th Annual Conference of the International Speech Communication Association (INTERSPEECH 2010)",
  year="2010",
  journal="Proceedings of Interspeech",
  volume="2010",
  number="9",
  pages="2822--2825",
  publisher="International Speech Communication Association",
  address="Makuhari, Chiba",
  isbn="978-1-61782-123-3",
  issn="1990-9772",
  url="http://www.fit.vutbr.cz/research/groups/speech/publi/2010/kockmann_interspeech2010_IS100567.pdf"
}
Projects
Mobile Biometry, MŠMT, Podpora projektů sedmého rámcového programu Evropského společenství pro výzkum, technologický rozvoj a demonstrace (2007 až 2013) podle zákona č. 171/2007 Sb., 7E08042, start: 2008-01-01, end: 2010-12-31, completed
Recognition and presentation of multimedia data, BUT, Vnitřní projekty VUT, FIT-S-10-2, 2010, start: 2010-04-01, end: 2010-12-31, completed
Security-Oriented Research in Information Technology, MŠMT, Institucionální prostředky SR ČR (např. VZ, VC), MSM0021630528, start: 2007-01-01, end: 2013-12-31, running
Speech Recognition under Real-World Conditions, GACR, Standardní projekty, GA102/08/0707, start: 2008-01-01, end: 2011-12-31, completed
Research groups
Departments
Back to top