Result Details
Brno University of Technology System for Interspeech 2010 Paralinguistic Challenge
Burget Lukáš, doc. Ing., Ph.D., DCGM (FIT)
Černocký Jan, prof. Dr. Ing., DCGM (FIT)
This paper describes systems for the Age- and Gender-Sub-Challenges employ fusions of several sub-systems. We make use of our own acoustic frame-based feature sets
Age- and gender-recognition, GMM, MMI, eigenvoice, fusion
This paper describes Brno University of Technology (BUT) system for the Interspeech 2010 Paralinguistic Challenge. Our submitted systems for the Age- and Gender-Sub-Challenges employ fusions of several sub-systems. We make use of our own acoustic frame-based feature sets, as well as the provided utterance-based acoustic, prosodic and voice quality features. Modeling is based on Gaussian Mixture Models (GMM) and Support Vector Machines (SVM), followed by linear Gaussian backends and logistic regression-based fusion. For a single subsystem, we obtain improvement of about 2% absolute, for both tasks, on the development-set. Our final fusion results in nearly 9% absolute improvement for the Age task and about 4.5% for the Gender task on the development set. On the final test set we obtain 3.5% and 2% absolute improvement, respectively.
@inproceedings{BUT35025,
author="Marcel {Kockmann} and Lukáš {Burget} and Jan {Černocký}",
title="Brno University of Technology System for Interspeech 2010 Paralinguistic Challenge",
booktitle="Proceedings of the 11th Annual Conference of the International Speech Communication Association (INTERSPEECH 2010)",
year="2010",
journal="Proceedings of Interspeech",
volume="2010",
number="9",
pages="2822--2825",
publisher="International Speech Communication Association",
address="Makuhari, Chiba",
isbn="978-1-61782-123-3",
issn="1990-9772",
url="http://www.fit.vutbr.cz/research/groups/speech/publi/2010/kockmann_interspeech2010_IS100567.pdf"
}
Recognition and presentation of multimedia data, BUT, Vnitřní projekty VUT, FIT-S-10-2, 2010, start: 2010-04-01, end: 2010-12-31, completed
Security-Oriented Research in Information Technology, MŠMT, Institucionální prostředky SR ČR (např. VZ, VC), MSM0021630528, start: 2007-01-01, end: 2013-12-31, running
Speech Recognition under Real-World Conditions, GACR, Standardní projekty, GA102/08/0707, start: 2008-01-01, end: 2011-12-31, completed