Detail výsledku

A Hierarchical Subspace Model for Language-Attuned Acoustic Unit Discovery

YUSUF, B.; ONDEL YANG, L.; BURGET, L.; ČERNOCKÝ, J.; SARAÇLAR, M. A Hierarchical Subspace Model for Language-Attuned Acoustic Unit Discovery. In ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Toronto, Ontario: IEEE Signal Processing Society, 2021. p. 3710-3714. ISBN: 978-1-7281-7605-5.
Typ
článek ve sborníku konference
Jazyk
anglicky
Autoři
Yusuf Bolaji, UPGM (FIT)
ONDEL YANG, L.
Burget Lukáš, doc. Ing., Ph.D., UPGM (FIT)
Černocký Jan, prof. Dr. Ing., UPGM (FIT)
SARAÇLAR, M.
Abstrakt

In this work, we propose a hierarchical subspace model for acousticunit discovery. In this approach, we frame the task as one oflearning embeddings on a low-dimensional phonetic subspace, andsimultaneously specify the subspace itself as an embedding on a hyper-subspace. We train the hyper-subspace on a set of transcribedlanguages and transfer it to the target language. In the target language,we infer both the language and unit embeddings in an unsupervisedmanner, and in so doing, we simultaneously learn a subspaceof units specific to that language and the units that dwell on it.We conduct experiments on TIMIT and two low-resource languages:Mboshi and Yoruba. Results show that our model outperforms majoracoustic unit discovery techniques, both in terms of clusteringquality and segmentation accuracy.

Klíčová slova

acoustic unit discovery, hierarchical subspacemodel, unsupervised learning

URL
Rok
2021
Strany
3710–3714
Sborník
ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
Konference
2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
ISBN
978-1-7281-7605-5
Vydavatel
IEEE Signal Processing Society
Místo
Toronto, Ontario
DOI
UT WoS
000704288403193
EID Scopus
BibTeX
@inproceedings{BUT175792,
  author="YUSUF, B. and ONDEL YANG, L. and BURGET, L. and ČERNOCKÝ, J. and SARAÇLAR, M.",
  title="A Hierarchical Subspace Model for Language-Attuned Acoustic Unit Discovery",
  booktitle="ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)",
  year="2021",
  pages="3710--3714",
  publisher="IEEE Signal Processing Society",
  address="Toronto, Ontario",
  doi="10.1109/ICASSP39728.2021.9414899",
  isbn="978-1-7281-7605-5",
  url="https://www.fit.vut.cz/research/publication/12523/"
}
Soubory
Projekty
Multi-lingualita v řečových technologiích, MŠMT, INTER-EXCELLENCE - Podprogram INTER-ACTION, LTAIN19087, zahájení: 2020-01-01, ukončení: 2023-08-31, ukončen
Neuronové reprezentace v multimodálním a mnohojazyčném modelování, GAČR, Grantové projekty exelence v základním výzkumu EXPRO - 2019, GX19-26934X, zahájení: 2019-01-01, ukončení: 2023-12-31, ukončen
Vícenásobné služby inteligentního konverzačního agenta pro přijetí, řízení a integraci občanů třetích zemí v EU, EU, Horizon 2020, zahájení: 2020-02-01, ukončení: 2023-04-30, ukončen
Výzkumné skupiny
Pracoviště
Nahoru