Detail výsledku

Toroidal Probabilistic Spherical Discriminant Analysis

SILNOVA, A.; BRUMMER, J.; SWART, A.; BURGET, L. Toroidal Probabilistic Spherical Discriminant Analysis. In Proceedings of ICASSP 2023. Rhodes Island: IEEE Signal Processing Society, 2023. p. 1-5. ISBN: 978-1-7281-6327-7.
Typ
článek ve sborníku konference
Jazyk
anglicky
Autoři
Silnova Anna, M.Sc., Ph.D., UPGM (FIT)
Brummer Johan Nikolaas Langenhoven, Dr., FIT (FIT)
Swart Albert du Preez
Burget Lukáš, doc. Ing., Ph.D., UPGM (FIT)
Abstrakt

n speaker recognition, where speech segments are mapped to
embeddings on the unit hypersphere, two scoring back-ends are
commonly used, namely cosine scoring and PLDA. We have
recently proposed PSDA, an analog to PLDA that uses Von
Mises-Fisher distributions instead of Gaussians. In this paper,
we present toroidal PSDA (T-PSDA). It extends PSDA with
the ability to model within and between-speaker variabilities
in toroidal submanifolds of the hypersphere. Like PLDA and
PSDA, the model allows closed-form scoring and closed-form
EM updates for training. On VoxCeleb, we find T-PSDA accu-
racy on par with cosine scoring, while PLDA accuracy is infe-
rior. On NIST SRE'21 we find that T-PSDA gives large accu-
racy gains compared to both cosine scoring and PLDA.

Klíčová slova

speaker recognition, PSDA, Von Mises-Fishe

URL
Rok
2023
Strany
1–5
Sborník
Proceedings of ICASSP 2023
Konference
2023 IEEE International Conference on Acoustics, Speech and Signal Processing IEEE
ISBN
978-1-7281-6327-7
Vydavatel
IEEE Signal Processing Society
Místo
Rhodes Island
DOI
EID Scopus
BibTeX
@inproceedings{BUT185199,
  author="Anna {Silnova} and Johan Nikolaas Langenhoven {Brummer} and Albert du Preez {Swart} and Lukáš {Burget}",
  title="Toroidal Probabilistic Spherical Discriminant Analysis",
  booktitle="Proceedings of ICASSP 2023",
  year="2023",
  pages="1--5",
  publisher="IEEE Signal Processing Society",
  address="Rhodes Island",
  doi="10.1109/ICASSP49357.2023.10095580",
  isbn="978-1-7281-6327-7",
  url="https://ieeexplore.ieee.org/document/10095580"
}
Soubory
Projekty
Neuronové reprezentace v multimodálním a mnohojazyčném modelování, GAČR, Grantové projekty exelence v základním výzkumu EXPRO - 2019, GX19-26934X, zahájení: 2019-01-01, ukončení: 2023-12-31, ukončen
Výměny pro výzkum řeči a technologií, EU, Horizon 2020, zahájení: 2021-01-01, ukončení: 2025-12-31, řešení
Výzkumné skupiny
Pracoviště
Nahoru