Result Details

Analysis of the DNN-Based SRE Systems in Multi-language Conditions

NOVOTNÝ, O.; MATĚJKA, P.; GLEMBEK, O.; PLCHOT, O.; GRÉZL, F.; BURGET, L.; ČERNOCKÝ, J. Analysis of the DNN-Based SRE Systems in Multi-language Conditions. In Proceedings of SLT 2016. San Diego: IEEE Signal Processing Society, 2016. p. 199-204. ISBN: 978-1-5090-4903-5.
Type
conference paper
Language
English
Authors
Novotný Ondřej, Ing., Ph.D., DCGM (FIT)
Matějka Pavel, Ing., Ph.D., DCGM (FIT)
Glembek Ondřej, Ing., Ph.D., DCGM (FIT)
Plchot Oldřich, Ing., Ph.D., DCGM (FIT)
Grézl František, Ing., Ph.D., DCGM (FIT)
Burget Lukáš, doc. Ing., Ph.D., DCGM (FIT)
Černocký Jan, prof. Dr. Ing., DCGM (FIT)
Abstract

This paper analyzes the behavior of our state-of-the-art Deep Neural Network/i-vector/PLDA-based speaker recognition systems in multi-language conditions. On the "Language Pack" of the PRISM set, we evaluate the systems performance using the NISTs standard metrics. We show that not only the gain from using DNNs vanishes, nor using dedicated DNNs for target conditions helps, but also the DNN-based systems tend to produce de-calibrated scores under the studied conditions. This work gives suggestions for directions of future research rather than any particular solutions to these issues.

Keywords

DNN, Multi-Language, Speaker Recognition

URL
Annotation

In this work, we have studied the behavior of the DNN techniques in SRE i-vector/PLDA systems, currently considered to be state-ofthe- art, as evaluated on the most common NIST SRE English test sets, such as the NIST SRE 2010, condition 5.

Published
2016
Pages
199–204
Proceedings
Proceedings of SLT 2016
Conference
2016 IEEE Workshop on Spoken Language Technology
ISBN
978-1-5090-4903-5
Publisher
IEEE Signal Processing Society
Place
San Diego
DOI
UT WoS
000399128000029
EID Scopus
BibTeX
@inproceedings{BUT132603,
  author="Ondřej {Novotný} and Pavel {Matějka} and Ondřej {Glembek} and Oldřich {Plchot} and František {Grézl} and Lukáš {Burget} and Jan {Černocký}",
  title="Analysis of the DNN-Based SRE Systems in Multi-language Conditions",
  booktitle="Proceedings of SLT 2016",
  year="2016",
  pages="199--204",
  publisher="IEEE Signal Processing Society",
  address="San Diego",
  doi="10.1109/slt.2016.7846265",
  isbn="978-1-5090-4903-5",
  url="http://ieeexplore.ieee.org/document/7846265/"
}
Files
Projects
Big speech data analytics for contact centers, EU, Horizon 2020, start: 2015-01-01, end: 2017-12-31, completed
DARPA Robust Automatic Transcription of Speech (RATS) - RATS Patrol II, BBN, start: 2015-02-23, end: 2017-03-31, completed
IARPA Building Speech Recognition for Keyword Search in a New Language in a Week with Limited Training Data (BABEL) - Babelon, BBN, start: 2012-03-05, end: 2016-11-04, completed
Information mining in speech acquired by distant microphones, MV, Bezpečnostní výzkum České republiky 2015-2020, VI20152020025, start: 2015-10-01, end: 2020-09-30, completed
Research groups
Departments
Back to top