Detail výsledku

Spoof Diarization: "What Spoofed When" in Partially Spoofed Audio

ZHANG, L.; WANG, X.; COOPER, E.; DIEZ SÁNCHEZ, M.; LANDINI, F.; EVANS, N.; YAMAGISHI, J. Spoof Diarization: "What Spoofed When" in Partially Spoofed Audio. In Proceedings of Interspeech 2024. Proceedings of Interspeech. Kos: International Speech Communication Association, 2024. no. 9, p. 502-506. ISSN: 1990-9772.
Typ
článek ve sborníku konference
Jazyk
angličtina
Autoři
ZHANG, L.
WANG, X.
COOPER, E.
DIEZ SÁNCHEZ, M.
Landini Federico Nicolás, Ph.D., UPGM (FIT)
EVANS, N.
YAMAGISHI, J.
Abstrakt

This paper defines Spoof Diarization as a novel task in the
Partial Spoof (PS) scenario. It aims to determine what spoofed
when, which includes not only locating spoof regions but also
clustering them according to different spoofing methods. As a
pioneering study in spoof diarization, we focus on defining the
task, establishing evaluation metrics, and proposing a bench-
mark model, namely the Countermeasure-Condition Cluster-
ing (3C) model. Utilizing this model, we first explore how
to effectively train countermeasures to support spoof diariza-
tion using three labeling schemes. We then utilize spoof lo-
calization predictions to enhance the diarization performance.
This first study reveals the high complexity of the task, even
in restricted scenarios where only a single speaker per au-
dio file and an oracle number of spoofing methods are con-
sidered. Our code is available at https://github.com/
nii-yamagishilab/PartialSpoof.

Klíčová slova

partial spoof, spoof diarization, countermeasure, clustering

URL
Rok
2024
Strany
502–506
Časopis
Proceedings of Interspeech, roč. 2024, č. 9, ISSN 1990-9772
Sborník
Proceedings of Interspeech 2024
Konference
Interspeech Conference
Vydavatel
International Speech Communication Association
Místo
Kos
DOI
EID Scopus
BibTeX
@inproceedings{BUT193676,
  author="ZHANG, L. and WANG, X. and COOPER, E. and DIEZ SÁNCHEZ, M. and LANDINI, F. and EVANS, N. and YAMAGISHI, J.",
  title="Spoof Diarization: {"}What Spoofed When{"} in Partially Spoofed Audio",
  booktitle="Proceedings of Interspeech 2024",
  year="2024",
  journal="Proceedings of Interspeech",
  volume="2024",
  number="9",
  pages="502--506",
  publisher="International Speech Communication Association",
  address="Kos",
  doi="10.21437/Interspeech.2024-1365",
  issn="1990-9772",
  url="https://www.isca-archive.org/interspeech_2024/zhang24j_interspeech.pdf"
}
Soubory
Projekty
Robustní zpracování nahrávek pro operativu a bezpečnost, MV, PROGRAM STRATEGICKÁ PODPORA ROZVOJE BEZPEČNOSTNÍHO VÝZKUMU ČR 2019-2025 (IMPAKT 1) PODPROGRAMU 1 SPOLEČNÉ VÝZKUMNÉ PROJEKTY (BV IMP1/1VS), VJ01010108, zahájení: 2020-10-01, ukončení: 2025-09-30, ukončen
Výzkumné skupiny
Pracoviště
Nahoru