Detail výsledku

A Region-specific Feature-space Transformation for Speaker Adaptation and Singularity Analysis of Jacobian Matrix

RATH, S.; BURGET, L.; KARAFIÁT, M.; GLEMBEK, O.; ČERNOCKÝ, J. A Region-specific Feature-space Transformation for Speaker Adaptation and Singularity Analysis of Jacobian Matrix. Proceedings of Interspeeech 2013. Proceedings of the 14th Annual Conference of the International Speech Communication Association (Interspeech 2013). Lyon: International Speech Communication Association, 2013. no. 8, p. 1228-1232. ISBN: 978-1-62993-443-3. ISSN: 2308-457X.
Typ
článek ve sborníku konference
Jazyk
anglicky
Autoři
Rath Shakti Prasad
Burget Lukáš, doc. Ing., Ph.D., UPGM (FIT)
Karafiát Martin, Ing., Ph.D., UPGM (FIT)
Glembek Ondřej, Ing., Ph.D., UPGM (FIT)
Černocký Jan, prof. Dr. Ing., UPGM (FIT)
Abstrakt

This paper describes the difficulties associated with soft R-FMLLR. By analyzing the Jacobian matrix, it was concludedthat the transformation is most likely to be non-invertibleand in this case ML estimation adversely affects the performance.A new transformation, hard R-FMLLR, is presented. Itis shown that the performance of the proposed method is betterthan soft R-FMLLR and it is computationally more efficient.

Klíčová slova

speaker recognition, speaker adaptation, feature-space transformation, speech recognition

URL
Rok
2013
Strany
1228–1232
Časopis
Proceedings of the 14th Annual Conference of the International Speech Communication Association (Interspeech 2013)., č. 8, ISSN 2308-457X
Sborník
Proceedings of Interspeeech 2013
Konference
Interspeech Conference
ISBN
978-1-62993-443-3
Vydavatel
International Speech Communication Association
Místo
Lyon
BibTeX
@inproceedings{BUT103551,
  author="Shakti Prasad {Rath} and Lukáš {Burget} and Martin {Karafiát} and Ondřej {Glembek} and Jan {Černocký}",
  title="A Region-specific Feature-space Transformation for Speaker Adaptation and Singularity Analysis of Jacobian Matrix",
  booktitle="Proceedings of Interspeeech 2013",
  year="2013",
  journal="Proceedings of the 14th Annual Conference of the International Speech Communication Association (Interspeech 2013).",
  number="8",
  pages="1228--1232",
  publisher="International Speech Communication Association",
  address="Lyon",
  isbn="978-1-62993-443-3",
  issn="2308-457X",
  url="http://www.fit.vutbr.cz/research/groups/speech/publi/2013/rath_interspeech2013_IS130146.pdf"
}
Projekty
Diskriminativní trénování modelů normalizovaných na mluvčího pro automatické rozpoznávání řeči, EU, Seventh Research Framework Programme, SIGA890, zahájení: 2011-01-07, ukončení: 2013-01-07, ukončen
Technologie zpracování řeči pro efektivní komunikaci člověk-počítač, TAČR, Program aplikovaného výzkumu a experimentálního vývoje ALFA, TA01011328, zahájení: 2011-01-01, ukončení: 2014-12-31, ukončen
Výzkumné skupiny
Pracoviště
Nahoru