Faculty of Information Technology, BUT

Publication Details

The AMI System for the Transcription of Speech in Meetings

HAIN Thomas, WAN Vincent, BURGET Lukáš, KARAFIÁT Martin, DINES John, VEPA Jithendra, GARAU Giulia and LINCOLN Mike. The AMI System for the Transcription of Speech in Meetings. In: Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2007). Hononulu: IEEE Signal Processing Society, 2007, pp. 357-360. ISBN 1-4244-0728-1.
Czech title
AMI systém pro přepis řeči v meetinzích
Type
conference paper
Language
english
Authors
Hain Thomas (USF)
Wan Vincent (USF)
Burget Lukáš, doc. Ing., Ph.D. (DCGM FIT BUT)
Karafiát Martin, Ing., Ph.D. (DCGM FIT BUT)
Dines John (IDIAP)
Vepa Jithendra (IDIAP)
Garau Giulia (UEDIN)
Lincoln Mike (IDIAP)
URL
Keywords
speech recognition
Abstract
The paper is about AMI System for the Transcription of Speech in Meetings
Annotation
This paper describes the AMI transcription system for speech in meetings developed in collaboration by five research groups. The system includes generic techniques such as discriminative and speaker adaptive training, vocal tract length normalisation, heteroscedastic linear discriminant analysis, maximum likelihood linear regression, and phone posterior based features, as well as techniques specifically designed for meeting data. These include segmentation and cross-talk suppression, beam-forming, domain adaptation, Web-data collection, and channel adaptive training. The system was improved by more than 20% relative in word error rate compared to our previous system and was used in the NIST RT106 evaluations where it was found to yield competitive performance
Published
2007
Pages
357-360
Proceedings
Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2007)
Conference
32nd IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Honolulu, US
ISBN
1-4244-0728-1
Publisher
IEEE Signal Processing Society
Place
Hononulu, US
BibTeX
@INPROCEEDINGS{FITPUB8463,
   author = "Thomas Hain and Vincent Wan and Luk\'{a}\v{s} Burget and Martin Karafi\'{a}t and John Dines and Jithendra Vepa and Giulia Garau and Mike Lincoln",
   title = "The AMI System for the Transcription of Speech in Meetings",
   pages = "357--360",
   booktitle = "Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2007)",
   year = 2007,
   location = "Hononulu, US",
   publisher = "IEEE Signal Processing Society",
   ISBN = "1-4244-0728-1",
   language = "english",
   url = "https://www.fit.vut.cz/research/publication/8463"
}
Back to top