Detail výsledku

Variational Inference for Acoustic Unit Discovery

ONDEL YANG, L.; BURGET, L.; ČERNOCKÝ, J. Variational Inference for Acoustic Unit Discovery. In Procedia Computer Science. Procedia Computer Science. Yogyakarta: Elsevier Science, 2016. no. 81, p. 80-86. ISSN: 1877-0509.
Typ
článek ve sborníku konference
Jazyk
anglicky
Autoři
Abstrakt

In this article we proposed to train a nonparametric Bayesian model for automatic units discovery within the Variational Bayesframework. Besides simplifying the training scheme, this approach proves to be fast and yields better solution whichmakes it more suitable for big databases. However, despite the improvement observed, the model still have difficultieswith the diversity of speech and tends to learn a large part of unwanted variability. The HMM model for speechsegment is convenient but unrealistic and most likely, stronger model will be needed if one wants to achieve accurate automatic units discovery. We plan to extent the present work by using the VB inference with more complex models, as in13, and to gain leverage of Bayesian language models14 to further improve the accuracy of the discovered units.

Klíčová slova

Bayesian non-parametric, Variational Bayes, acoustic unit discovery

URL
Rok
2016
Strany
80–86
Časopis
Procedia Computer Science, roč. 2016, č. 81, ISSN 1877-0509
Sborník
Procedia Computer Science
Konference
The 5th International Workshop on Spoken Language Technologies for Under-resourced Languages (SLTU'16)
Vydavatel
Elsevier Science
Místo
Yogyakarta
DOI
UT WoS
000387446500011
EID Scopus
BibTeX
@inproceedings{BUT131006,
  author="Lucas Antoine Francois {Ondel} and Lukáš {Burget} and Jan {Černocký}",
  title="Variational Inference for Acoustic Unit Discovery",
  booktitle="Procedia Computer Science",
  year="2016",
  journal="Procedia Computer Science",
  volume="2016",
  number="81",
  pages="80--86",
  publisher="Elsevier Science",
  address="Yogyakarta",
  doi="10.1016/j.procs.2016.04.033",
  issn="1877-0509",
  url="http://www.sciencedirect.com/science/article/pii/S1877050916300473"
}
Soubory
Projekty
Analytika velkých řečových dat pro kontaktní centra, EU, Horizon 2020, zahájení: 2015-01-01, ukončení: 2017-12-31, ukončen
Meeting assistant (MINT), TAČR, Program aplikovaného výzkumu a experimentálního vývoje ALFA, TA04011311, zahájení: 2014-10-01, ukončení: 2017-12-31, ukončen
Zpracování, rozpoznávání a zobrazování multimediálních a 3D dat, VUT, Vnitřní projekty VUT, FIT-S-14-2506, zahájení: 2014-01-01, ukončení: 2016-12-31, ukončen
Výzkumné skupiny
Pracoviště
Nahoru