Publication Details

Transcribing Meetings with the AMIDA System

HAIN, T.; BURGET, L.; DINES, J.; GARNER, P.; GRÉZL, F.; EL HANNANI, A.; HUIJBREGTS, M.; KARAFIÁT, M.; LINCOLN, M.; WAN, V. Transcribing Meetings with the AMIDA System. IEEE Transactions on Audio, Speech, and Language Processing, 2012, vol. 20, no. 2, p. 486-498. ISSN: 1558-7916.

Czech title

Rozpoznávání meetingů se systémy AMIDA

Type

journal article

Language

English

Authors

Hain Thomas
Burget Lukáš, doc. Ing., Ph.D. (DCGM)
Dines John
Garner Phillip
Grézl František, Ing., Ph.D. (DCGM)
El Hannani Asmaa
Huijbregts Marijn
Karafiát Martin, Ing., Ph.D. (DCGM)
Lincoln Mike
Wan Vincent

URL

Keywords

AMI corpus, Juicer, meeting transcription, multipledistant microphone, resource optimisation, rich text

Abstract

This paper describes AMIDA systems for transcription of conference and lecture room meetings that were developed for participation in the RT evaluations conducted by NIST in years 2007 and 2009.

Annotation

In this paper, we give an overview of the AMIDA systems for transcription of conference and lecture room meetings. The systems were developed for participation in the Rich Transcription evaluations conducted by the National Institute for Standards and Technology in the years 2007 and 2009 and can process close talking and far field microphone recordings. The paper first discusses fundamental properties of meeting data with special focus on the AMI/AMIDA corpora. This is followed by a description and analysis of improved processing and modeling, with focus on techniques specifically addressing meeting transcription issues such as multi-room recordings or domain variability. In 2007 and 2009, two different strategies of systems building were followed. While in 2007 we used our traditional style system design based on cross adaptation, the 2009 systems were constructed semi-automatically, supported by improved decoders and a new method for system representation. Overall these changes gave a 6%-13% relative reduction in word error rate compared to our 2007 results while at the same time requiring less training material and reducing the real-time factor by five times. The meeting transcription systems are available at www.webasr.org.

Published

2012

Pages

486–498

Journal

IEEE Transactions on Audio, Speech, and Language Processing, vol. 20, no. 2, ISSN 1558-7916

Book

IEEE Transactions on Audio, Speech, and Language Processing

Publisher

IEEE Signal Processing Society

Place

New York

DOI

10.1109/TASL.2011.2163395

UT WoS

000299525800013

EID Scopus

2-s2.0-85008520364

BibTeX

@article{BUT91486,
  author="Thomas {Hain} and Lukáš {Burget} and John {Dines} and Phillip {Garner} and František {Grézl} and Asmaa {El Hannani} and Marijn {Huijbregts} and Martin {Karafiát} and Mike {Lincoln} and Vincent {Wan}",
  title="Transcribing Meetings with the AMIDA System",
  journal="IEEE Transactions on Audio, Speech, and Language Processing",
  year="2012",
  volume="20",
  number="2",
  pages="486--498",
  doi="10.1109/TASL.2011.2163395",
  issn="1558-7916",
  url="http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5983475"
}