Result Details

BUT/Phonexia Bottleneck Feature Extractor

SILNOVA, A.; MATĚJKA, P.; GLEMBEK, O.; PLCHOT, O.; NOVOTNÝ, O.; GRÉZL, F.; SCHWARZ, P.; ČERNOCKÝ, J. BUT/Phonexia Bottleneck Feature Extractor. In Proceedings of Odyssey 2018. Proceedings of Odyssey: The Speaker and Language Recognition Workshop Odyssey 2014, Joensuu, Finland. Les Sables d´Olonne: International Speech Communication Association, 2018. no. 6, p. 283-287. ISSN: 2312-2846.
Type
conference paper
Language
English
Authors
Silnova Anna, M.Sc., Ph.D., DCGM (FIT)
Matějka Pavel, Ing., Ph.D., DCGM (FIT)
Glembek Ondřej, Ing., Ph.D., DCGM (FIT)
Plchot Oldřich, Ing., Ph.D., DCGM (FIT)
Novotný Ondřej, Ing., Ph.D., DCGM (FIT)
Grézl František, Ing., Ph.D., DCGM (FIT)
Schwarz Petr, Ing., Ph.D., DCGM (FIT)
Černocký Jan, prof. Dr. Ing., DCGM (FIT)
Abstract

This paper complements the public release of theBUT/Phonexia bottleneck (BN) feature extractor. Startingwith a brief history of Neural Network (NN)-based andBN-based approaches to speech feature extraction, it describesthe structure of the released software. It follows by describingthe three provided NNs: the first two trained on the US EnglishFisher corpus with monophone-state and tied-state targets,and the third network trained in a multi-lingual fashion on17 Babel languages. The NNs were technically trained toclassify acoustic units, however the networks were optimizedwith respect to the language recognition task, which is themain focus of this paper. Nevertheless, it is worth noting thatapart from language recognition, the provided software can beused with any speech-related task. The paper concludes with acomprehensive summary of the results obtained on the NIST2015 and 2017 Language Recognition Evaluations tasks.

Keywords

bottlneck feature extractor, speech recognition, language recognition

URL
Published
2018
Pages
283–287
Journal
Proceedings of Odyssey: The Speaker and Language Recognition Workshop Odyssey 2014, Joensuu, Finland, vol. 2018, no. 6, ISSN 2312-2846
Proceedings
Proceedings of Odyssey 2018
Conference
Odyssey 2018
Publisher
International Speech Communication Association
Place
Les Sables d´Olonne
DOI
EID Scopus
BibTeX
@inproceedings{BUT155076,
  author="Anna {Silnova} and Pavel {Matějka} and Ondřej {Glembek} and Oldřich {Plchot} and Ondřej {Novotný} and František {Grézl} and Petr {Schwarz} and Jan {Černocký}",
  title="BUT/Phonexia Bottleneck Feature Extractor",
  booktitle="Proceedings of Odyssey 2018",
  year="2018",
  journal="Proceedings of Odyssey: The Speaker and Language Recognition Workshop Odyssey 2014, Joensuu, Finland",
  volume="2018",
  number="6",
  pages="283--287",
  publisher="International Speech Communication Association",
  address="Les Sables d´Olonne",
  doi="10.21437/Odyssey.2018-40",
  issn="2312-2846",
  url="https://www.fit.vut.cz/research/publication/11789/"
}
Files
Projects
Improving Robustnes in Automatic Speaker Recognition, GACR, Juniorské granty, GJ17-23870Y, start: 2017-01-01, end: 2019-12-31, completed
Information mining in speech acquired by distant microphones, MV, Bezpečnostní výzkum České republiky 2015-2020, VI20152020025, start: 2015-10-01, end: 2020-09-30, completed
IT4Innovations excellence in science, MŠMT, Národní program udržitelnosti II, LQ1602, start: 2016-01-01, end: 2020-12-31, completed
Neural networks for signal processing and speech data mining, TAČR, Program na podporu aplikovaného výzkumu ZÉTA, TJ01000208, start: 2018-01-01, end: 2019-12-31, completed
Research groups
Departments
Back to top