Result Details

Analysis of the ABC classification backends for NIST SRE24

CUMANI, S.; SILNOVA, A.; BARAHONA, S.; MOŠNER, L.; PLCHOT, O.; ROHDIN, J. Analysis of the ABC classification backends for NIST SRE24. In Proceedings of the Annual Conference of the International Speech Communication Association Interspeech. Interspeech. Rotterdam: International Speech Communication Association, 2025. p. 3978-3982.

Type

conference paper

Language

English

Authors

Cumani Sandro
Silnova Anna, M.Sc., Ph.D., DCGM (FIT)
Barahona Sara
Mošner Ladislav, Ing., Ph.D., DCGM (FIT)
Plchot Oldřich, Ing., Ph.D., DCGM (FIT)
Rohdin Johan Andréas, M.Sc., Ph.D., FIT (FIT), DCGM (FIT)

Abstract

We present an analysis of the classification backends of the ABC submission for the audio tracks of the NIST 2024 Speaker Recognition Evaluation (SRE24). Our analysis covers embedding pre-processing, classification and score-level normalization, calibration and fusion strategies adopted to cope with the source, language and duration mismatch challenges of SRE24. We show that Pairwise Support Vector Machines provide the best results, which can be further improved, for single frontends, through score-level fusion of additional classifiers. We also show that condition-aware score calibration can mitigate the effects of source mismatch, whereas score normalization methods proved ineffective. Finally, we show that generative calibration is able to achieve competitive results with respect to other approaches.

Keywords

Classification backend | Pairwise Support Vector Machine | Score calibration | Speaker Recognition Evaluation | Speaker verification

URL

https://www.isca-archive.org/interspeech_2025/cumani25_interspeech.pdf

Published

2025

Pages

3978–3982

Journal

Interspeech, ISSN

Proceedings

Proceedings of the Annual Conference of the International Speech Communication Association Interspeech

Conference

Interspeech Conference

Publisher

International Speech Communication Association

Place

Rotterdam

DOI

10.21437/Interspeech.2025-146

EID Scopus

2-s2.0-105020055422

BibTeX

@inproceedings{BUT199933,
  author="{} and Anna {Silnova} and  {} and Ladislav {Mošner} and Oldřich {Plchot} and Johan Andréas {Rohdin}",
  title="Analysis of the ABC classification backends for NIST SRE24",
  booktitle="Proceedings of the Annual Conference of the International Speech Communication Association Interspeech",
  year="2025",
  journal="Interspeech",
  pages="3978--3982",
  publisher="International Speech Communication Association",
  address="Rotterdam",
  doi="10.21437/Interspeech.2025-146",
  url="https://www.isca-archive.org/interspeech_2025/cumani25_interspeech.pdf"
}

Projects

Linguistics, Artificial Intelligence and Language and Speech Technologies: from Research to Applications, EU, MEZISEKTOROVÁ SPOLUPRÁCE, EH23_020/0008518, start: 2025-01-01, end: 2028-12-31, running

Research groups

Výzkumná skupina dolování dat z řeči BUT Speech@FIT (RG SPEECH)

Departments

Ústav počítačové grafiky a multimédií (DCGM)
Výzkumná skupina dolování dat z řeči BUT Speech@FIT (RG SPEECH)