Result Details

Speaker Verification with Application-Aware Beamforming

MOŠNER, L.; PLCHOT, O.; ROHDIN, J.; BURGET, L.; ČERNOCKÝ, J. Speaker Verification with Application-Aware Beamforming. In IEEE Automatic Speech Recognition and Understanding Workshop - Proceedings (ASRU). Sentosa, Singapore: IEEE Signal Processing Society, 2019. p. 411-418. ISBN: 978-1-7281-0306-8.
Type
conference paper
Language
English
Authors
Abstract

Multichannel speech processing applications usually employbeamformers as means of speech enhancement through spatialfiltering. Beamformers with learnable parameters requiretraining to minimize a loss function that is not necessarilycorrelated with the final objective. In this paper, we presenta framework employing recent neural network based generalizedeigenvalue beamformer and application-specific modelthat allows for optimization of beamformer w.r.t. target application.In our case, the application is speaker verificationwhich utilizes a speaker embedding (x-vector) extractorthat conveniently comes with desired loss. We show thatapplication-specific training of the beamformer brings performanceimprovements over a system trained in the standardway. We perform our analysis on the recently introducedVOiCES corpus which contains multichannel data and allowsus to modify the evaluation trials such that enrollment recordingsremain single-channel and test utterances are multichannel.

Keywords

Speaker verification, beamforming, xvector, generalized eigenvalue problem

URL
Published
2019
Pages
411–418
Proceedings
IEEE Automatic Speech Recognition and Understanding Workshop - Proceedings (ASRU)
Conference
2019 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU 2019)
ISBN
978-1-7281-0306-8
Publisher
IEEE Signal Processing Society
Place
Sentosa, Singapore
DOI
UT WoS
000539883100055
EID Scopus
BibTeX
@inproceedings{BUT161476,
  author="Ladislav {Mošner} and Oldřich {Plchot} and Johan Andréas {Rohdin} and Lukáš {Burget} and Jan {Černocký}",
  title="Speaker Verification with Application-Aware Beamforming",
  booktitle="IEEE Automatic Speech Recognition and Understanding Workshop - Proceedings (ASRU)",
  year="2019",
  pages="411--418",
  publisher="IEEE Signal Processing Society",
  address="Sentosa, Singapore",
  doi="10.1109/ASRU46091.2019.9003932",
  isbn="978-1-7281-0306-8",
  url="https://www.fit.vut.cz/research/publication/12152/"
}
Files
Projects
Information mining in speech acquired by distant microphones, MV, Bezpečnostní výzkum České republiky 2015-2020, VI20152020025, start: 2015-10-01, end: 2020-09-30, completed
IT4Innovations excellence in science, MŠMT, Národní program udržitelnosti II, LQ1602, start: 2016-01-01, end: 2020-12-31, completed
Research groups
Departments
Back to top