Detail výsledku

Speaker Verification with Application-Aware Beamforming

MOŠNER, L.; PLCHOT, O.; ROHDIN, J.; BURGET, L.; ČERNOCKÝ, J. Speaker Verification with Application-Aware Beamforming. In IEEE Automatic Speech Recognition and Understanding Workshop - Proceedings (ASRU). Sentosa, Singapore: IEEE Signal Processing Society, 2019. p. 411-418. ISBN: 978-1-7281-0306-8.
Typ
článek ve sborníku konference
Jazyk
anglicky
Autoři
Abstrakt

Multichannel speech processing applications usually employbeamformers as means of speech enhancement through spatialfiltering. Beamformers with learnable parameters requiretraining to minimize a loss function that is not necessarilycorrelated with the final objective. In this paper, we presenta framework employing recent neural network based generalizedeigenvalue beamformer and application-specific modelthat allows for optimization of beamformer w.r.t. target application.In our case, the application is speaker verificationwhich utilizes a speaker embedding (x-vector) extractorthat conveniently comes with desired loss. We show thatapplication-specific training of the beamformer brings performanceimprovements over a system trained in the standardway. We perform our analysis on the recently introducedVOiCES corpus which contains multichannel data and allowsus to modify the evaluation trials such that enrollment recordingsremain single-channel and test utterances are multichannel.

Klíčová slova

Speaker verification, beamforming, xvector, generalized eigenvalue problem

URL
Rok
2019
Strany
411–418
Sborník
IEEE Automatic Speech Recognition and Understanding Workshop - Proceedings (ASRU)
Konference
2019 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU 2019)
ISBN
978-1-7281-0306-8
Vydavatel
IEEE Signal Processing Society
Místo
Sentosa, Singapore
DOI
UT WoS
000539883100055
EID Scopus
BibTeX
@inproceedings{BUT161476,
  author="Ladislav {Mošner} and Oldřich {Plchot} and Johan Andréas {Rohdin} and Lukáš {Burget} and Jan {Černocký}",
  title="Speaker Verification with Application-Aware Beamforming",
  booktitle="IEEE Automatic Speech Recognition and Understanding Workshop - Proceedings (ASRU)",
  year="2019",
  pages="411--418",
  publisher="IEEE Signal Processing Society",
  address="Sentosa, Singapore",
  doi="10.1109/ASRU46091.2019.9003932",
  isbn="978-1-7281-0306-8",
  url="https://www.fit.vut.cz/research/publication/12152/"
}
Soubory
Projekty
Dolování infoRmAcí z řeči Pořízené vzdÁlenými miKrofony, MV, Bezpečnostní výzkum České republiky 2015-2020, VI20152020025, zahájení: 2015-10-01, ukončení: 2020-09-30, ukončen
IT4Innovations excellence in science, MŠMT, Národní program udržitelnosti II, LQ1602, zahájení: 2016-01-01, ukončení: 2020-12-31, ukončen
Výzkumné skupiny
Pracoviště
Nahoru