Publication Details

The AMIDA 2009 Meeting Transcription System

HAIN Thomas, BURGET Lukáš, DINES John, GARNER Phillip N., EL Hannani Asmaa, HUIJBREGTS Marijn, KARAFIÁT Martin, LINCOLN Mike and WAN Vincent. The AMIDA 2009 Meeting Transcription System. In: Proceedings of the 11th Annual Conference of the International Speech Communication Association (INTERSPEECH 2010). Makuhari, Chiba: International Speech Communication Association, 2010, pp. 358-361. ISBN 978-1-61782-123-3. ISSN 1990-9772.
Czech title
AMIDA 2009 systém pro rozpoznávání meetingů
Type
conference paper
Language
english
Authors
Hain Thomas (USF)
Burget Lukáš, doc. Ing., Ph.D. (DCGM FIT BUT)
Dines John (IDIAP)
Garner Phillip N. (IDIAP)
El Hannani Asmaa (USF)
Huijbregts Marijn (UTWENTE)
Karafiát Martin, Ing., Ph.D. (DCGM FIT BUT)
Lincoln Mike (IDIAP)
Wan Vincent (USF)
URL
Keywords

speech recognition, meeting transcription

Abstract

The paper is on systems for close-taking, far field and speaker attributed STT conditions. The system was used at participation in the NIST RT'2009 STT evaluations.

Annotation

We present the AMIDA 2009 system for participation in the NIST RT'2009 STT evaluations. Systems for close-talking, far field and speaker attributed STT conditions are described. Improvements to our previous systems are: segmentation and diarisation; stacked bottle-neck posterior feature extraction; fMPE training of acoustic models; adaptation on complete meetings; improvements to WFST decoding; automatic optimisation of decoders and system graphs. Overall these changes gave a 6-13% relative reduction in word error rate while at the same time reducing the real-time factor by a factor of five and using considerably less data for acoustic model training.

Published
2010
Pages
358-361
Journal
Proceedings of Interspeech, vol. 2010, no. 9, ISSN 1990-9772
Proceedings
Proceedings of the 11th Annual Conference of the International Speech Communication Association (INTERSPEECH 2010)
Conference
Interspeech 2010, Tokyo, JP
ISBN
978-1-61782-123-3
Publisher
International Speech Communication Association
Place
Makuhari, Chiba, JP
BibTeX
@INPROCEEDINGS{FITPUB9365,
   author = "Thomas Hain and Luk\'{a}\v{s} Burget and John Dines and N. Phillip Garner and Asmaa Hannani El and Marijn Huijbregts and Martin Karafi\'{a}t and Mike Lincoln and Vincent Wan",
   title = "The AMIDA 2009 Meeting Transcription System",
   pages = "358--361",
   booktitle = "Proceedings of the 11th Annual Conference of the International Speech Communication Association (INTERSPEECH 2010)",
   journal = "Proceedings of Interspeech",
   volume = 2010,
   number = 9,
   year = 2010,
   location = "Makuhari, Chiba, JP",
   publisher = "International Speech Communication Association",
   ISBN = "978-1-61782-123-3",
   ISSN = "1990-9772",
   language = "english",
   url = "https://www.fit.vut.cz/research/publication/9365"
}
Back to top