Result Details

Search Engine for Information Retrieval from Speech Records

FAPŠO, M.; SCHWARZ, P.; SZŐKE, I.; SMRŽ, P.; SCHWARZ, M.; ČERNOCKÝ, J.; KARAFIÁT, M.; BURGET, L. Search Engine for Information Retrieval from Speech Records. Proceedings of the Third International Seminar on Computer Treatment of Slavic and East European Languages. Bratislava: 2006. p. 100-101.
Type
conference paper
Language
English
Authors
Fapšo Michal, Ing., Ph.D.
Schwarz Petr, Ing., Ph.D., DCGM (FIT)
Szőke Igor, Ing., Ph.D., DCGM (FIT)
Smrž Pavel, doc. RNDr., Ph.D., DCGM (FIT)
Schwarz Milan, DCGM (FIT)
Černocký Jan, prof. Dr. Ing., DCGM (FIT)
Karafiát Martin, Ing., Ph.D., DCGM (FIT)
Burget Lukáš, doc. Ing., Ph.D., DCGM (FIT)
Abstract

This paper describes a designed and implemented system for efficient storage,
indexing and search in collections of spoken documents that takes advantage
of  automatic speech recognition. As the quality of current speech
recognizers is not sufficient for a great deal of applications, it is
necessary to index the ambiguous output of the recognition, i.\,e. the
acyclic graphs of word hypotheses --- recognition lattices. Then, it is not
possible to directly apply the standard methods known from text-based systems.
The paper discusses an optimized indexing system for efficient search in the
complex and large data structure that has been developed by our group. The
search engine works as a server. The meeting browser JFerret, developed withing
the European AMI project,  is used as a client to browse search results.

Keywords

multimedia information retrieval, speech databases

URL
Published
2006
Pages
100–101
Proceedings
Proceedings of the Third International Seminar on Computer Treatment of Slavic and East European Languages
Conference
International Seminar on Computer Treatment of Slavic and East European Languages
Place
Bratislava
BibTeX
@inproceedings{BUT22170,
  author="Michal {Fapšo} and Petr {Schwarz} and Igor {Szőke} and Pavel {Smrž} and Milan {Schwarz} and Jan {Černocký} and Martin {Karafiát} and Lukáš {Burget}",
  title="Search Engine for Information Retrieval from Speech Records",
  booktitle="Proceedings of the Third International Seminar on Computer Treatment of Slavic and East European Languages",
  year="2006",
  pages="100--101",
  address="Bratislava",
  url="http://www.fit.vutbr.cz/~smrz/pdf/slovko2005_xfapso00_et_al.pdf"
}
Projects
Augmented Multi-party Interaction, EU, Sixth Framework programme, 506811-AMI, start: 2004-01-01, end: 2006-12-31, completed
New trends in research and application of voice technology, GACR, Standardní projekty, GA102/05/0278, start: 2005-01-01, end: 2007-12-31, completed
Optická síť národního výzkumu a její nové aplikace, MŠMT, Výzkumná centra (2000-2004), MSM6383917201, start: 2004-01-01, end: 2010-12-31, completed
Voice technologies for support of information society, GACR, Standardní projekty, GA102/02/0124, start: 2002-01-01, end: 2004-12-31, completed
Research groups
Departments
Back to top