Result Details

Alternative Approaches to Neural Network based Speaker Verification

SILNOVA, A.; BURGET, L.; ČERNOCKÝ, J. Alternative Approaches to Neural Network based Speaker Verification. In Proceedings of Interspeech 2017. Proceedings of Interspeech. Stockholm: International Speech Communication Association, 2017. no. 08, p. 1572-1575. ISSN: 1990-9772.
Type
conference paper
Language
English
Authors
Abstract

This paper describes experiment with the standard ivector/PLDA system trained on the different NN based features.The results are reported on female part of NIST SRE 2010, condition5 (English telephone data).

Keywords

automatic speaker recognition, deep neural networks,bottleneck features

URL
Annotation

Just like in other areas of automatic speech processing, feature extraction based on bottleneck neural networks was recently found very effective for the speaker verification task. However, better results are usually reported with more complex neural network architectures (e.g. stacked bottlenecks), which are difficult to reproduce. In this work, we experiment with the so called deep features, which are based on a simple feed-forward neural network architecture. We study various forms of applying deep features to i-vector/PDA based speaker verification. With proper settings, better verification performance can be obtained by means of this simple architecture as compared to the more elaborate bottleneck features. Also, we further experiment with multi-task training, where the neural network is trained for both speaker recognition and senone recognition objectives. Results indicate that, with a careful weighting of the two objectives, multi-task training can result in significantly better performing deep features.

Published
2017
Pages
1572–1575
Journal
Proceedings of Interspeech, vol. 2017, no. 08, ISSN 1990-9772
Proceedings
Proceedings of Interspeech 2017
Conference
Interspeech Conference
Publisher
International Speech Communication Association
Place
Stockholm
DOI
UT WoS
000457505000325
EID Scopus
BibTeX
@inproceedings{BUT144491,
  author="Anna {Silnova} and Lukáš {Burget} and Jan {Černocký}",
  title="Alternative Approaches to Neural Network based Speaker Verification",
  booktitle="Proceedings of Interspeech 2017",
  year="2017",
  journal="Proceedings of Interspeech",
  volume="2017",
  number="08",
  pages="1572--1575",
  publisher="International Speech Communication Association",
  address="Stockholm",
  doi="10.21437/Interspeech.2017-1062",
  issn="1990-9772",
  url="http://www.isca-speech.org/archive/Interspeech_2017/pdfs/1062.PDF"
}
Files
Projects
Big speech data analytics for contact centers, EU, Horizon 2020, start: 2015-01-01, end: 2017-12-31, completed
Improving Robustnes in Automatic Speaker Recognition, GACR, Juniorské granty, GJ17-23870Y, start: 2017-01-01, end: 2019-12-31, completed
Information mining in speech acquired by distant microphones, MV, Bezpečnostní výzkum České republiky 2015-2020, VI20152020025, start: 2015-10-01, end: 2020-09-30, completed
IT4Innovations excellence in science, MŠMT, Národní program udržitelnosti II, LQ1602, start: 2016-01-01, end: 2020-12-31, completed
Research groups
Departments
Back to top