Detail výsledku

Dereverberation and Beamforming in Far-Field Speaker Recognition

MOŠNER, L.; MATĚJKA, P.; NOVOTNÝ, O.; ČERNOCKÝ, J. Dereverberation and Beamforming in Far-Field Speaker Recognition. In Proceedings of ICASSP 2018. Calgary: IEEE Signal Processing Society, 2018. p. 5254-5258. ISBN: 978-1-5386-4658-8.

Typ

článek ve sborníku konference

Jazyk

anglicky

Autoři

Mošner Ladislav, Ing., UPGM (FIT)
Matějka Pavel, Ing., Ph.D., UPGM (FIT)
Novotný Ondřej, Ing., Ph.D., UPGM (FIT)
Černocký Jan, prof. Dr. Ing., UPGM (FIT)

Abstrakt

This paper deals with far-field speaker recognition. On a corpusof NIST SRE 2010 data retransmitted in a real roomwith multiple microphones, we first demonstrate how roomacoustics cause significant degradation of state-of-the-art ivectorbased speaker recognition system. We then investigateseveral techniques to improve the performances ranging fromprobabilistic linear discriminant analysis (PLDA) re-training,through dereverberation, to beamforming. We found thatweighted prediction error (WPE) based dereverberation combinedwith generalized eigenvalue beamformer with powerspectraldensity (PSD) weighting masks generated by neuralnetworks (NN) provides results approaching the clean closemicrophonesetup. Further improvement was obtained byre-training PLDA or the mask-generating NNs on simulatedtarget data. The work shows that a speaker recognition systemworking robustly in the far-field scenario can be developed.

Klíčová slova

Speaker recognition, microphone array,beamforming, dereverberation, audio retransmission

URL

https://www.fit.vut.cz/research/group/speech/public/publi/2018/mosner… PDF

Rok

2018

Strany

5254–5258

Sborník

Proceedings of ICASSP 2018

Konference

IEEE International Conference on Acoustics, Speech and Signal Processing

ISBN

978-1-5386-4658-8

Vydavatel

IEEE Signal Processing Society

Místo

Calgary

DOI

10.1109/ICASSP.2018.8462365

UT WoS

000446384605085

EID Scopus

2-s2.0-85054214985

BibTeX

@inproceedings{BUT155039,
  author="Ladislav {Mošner} and Pavel {Matějka} and Ondřej {Novotný} and Jan {Černocký}",
  title="Dereverberation and Beamforming in Far-Field Speaker Recognition",
  booktitle="Proceedings of ICASSP 2018",
  year="2018",
  pages="5254--5258",
  publisher="IEEE Signal Processing Society",
  address="Calgary",
  doi="10.1109/ICASSP.2018.8462365",
  isbn="978-1-5386-4658-8",
  url="https://www.fit.vut.cz/research/publication/11717/"
}

Soubory

pdf mosner_icassp2018_0005254.pdf 212 kB

Projekty

Dolování infoRmAcí z řeči Pořízené vzdÁlenými miKrofony, MV, Bezpečnostní výzkum České republiky 2015-2020, VI20152020025, zahájení: 2015-10-01, ukončení: 2020-09-30, ukončen
IT4Innovations excellence in science, MŠMT, Národní program udržitelnosti II, LQ1602, zahájení: 2016-01-01, ukončení: 2020-12-31, ukončen
Zvýšení spolehlivosti v automatickém rozpoznávání řečníka, GAČR, Juniorské granty, GJ17-23870Y, zahájení: 2017-01-01, ukončení: 2019-12-31, ukončen

Pracoviště

Ústav počítačové grafiky a multimédií (UPGM)