Result Details

The AMI System for the Transcription of Speech in Meetings

HAIN, T.; WAN, V.; BURGET, L.; KARAFIÁT, M.; DINES, J.; VEPA, J.; GARAU, G.; LINCOLN, M. The AMI System for the Transcription of Speech in Meetings. Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2007). Hononulu: IEEE Signal Processing Society, 2007. p. 357-360. ISBN: 1-4244-0728-1.

Type

conference paper

Language

English

Authors

Hain Thomas
Wan Vincent
Burget Lukáš, doc. Ing., Ph.D., DCGM (FIT)
Karafiát Martin, Ing., Ph.D., FIT (FIT), DCGM (FIT)
Dines John
Vepa Jithendra
Garau Giulia
Lincoln Mike

Abstract

The paper is about AMI System for the Transcription of Speech in Meetings

Keywords

speech recognition

URL

https://www.fit.vut.cz/research/group/speech/public/publi/2007/hain_icassp07…

Annotation

This paper describes the AMI transcription system for speech in meetings developed in collaboration by five research groups. The system includes generic techniques such as discriminative and speaker adaptive training, vocal tract length normalisation, heteroscedastic linear discriminant analysis, maximum likelihood linear regression, and phone posterior based features, as well as techniques specifically designed for meeting data. These include segmentation and cross-talk suppression, beam-forming, domain adaptation, Web-data collection, and channel adaptive training. The system was improved by more than 20% relative in word error rate compared to our previous system and was used in the NIST RT106 evaluations where it was found to yield competitive performance

Published

2007

Pages

357–360

Proceedings

Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2007)

Conference

32nd IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)

ISBN

1-4244-0728-1

Publisher

IEEE Signal Processing Society

Place

Hononulu

BibTeX

@inproceedings{BUT25337,
  author="Thomas {Hain} and Vincent {Wan} and Lukáš {Burget} and Martin {Karafiát} and John {Dines} and Jithendra {Vepa} and Giulia {Garau} and Mike {Lincoln}",
  title="The AMI System for the Transcription of Speech in Meetings",
  booktitle="Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2007)",
  year="2007",
  pages="357--360",
  publisher="IEEE Signal Processing Society",
  address="Hononulu",
  isbn="1-4244-0728-1",
  url="http://www.fit.vutbr.cz/research/groups/speech/publi/2007/hain_icassp07.pdf"
}

Projects

Interactive Keyword Detector, GACR, Postdoktorandské granty, GP102/06/P383, start: 2006-01-01, end: 2008-12-31, completed
Security-Oriented Research in Information Technology, MŠMT, Institucionální prostředky SR ČR (např. VZ, VC), MSM0021630528, start: 2007-01-01, end: 2013-12-31, running

Research groups

Speech Data Mining Research Group BUT Speech@FIT (RG SPEECH)

Departments

Department of Computer Graphics and Multimedia (DCGM)