Result Details
Information Retrieval from Spoken Documents
Smrž Pavel, doc. RNDr., Ph.D., DCGM (FIT)
Schwarz Petr, Ing., Ph.D., DCGM (FIT)
Szőke Igor, Ing., Ph.D., DCGM (FIT)
Schwarz Milan
Černocký Jan, prof. Dr. Ing., DCGM (FIT)
Karafiát Martin, Ing., Ph.D., DCGM (FIT)
Burget Lukáš, doc. Ing., Ph.D., DCGM (FIT)
This paper describes a designed and implemented system for efficient storage,
indexing and search in collections of spoken documents that takes advantage
of automatic speech recognition. As the quality of current speech
recognizers is not sufficient for a great deal of applications, it is
necessary to index the ambiguous output of the recognition, i.\,e. the
acyclic graphs of word hypotheses --- recognition lattices. Then, it is not
possible to directly apply the standard methods known from text-based systems.
The paper discusses an optimized indexing system for efficient search in the
complex and large data structure that has been developed by our group. The
search engine works as a server. The meeting browser JFerret, developed withing
the European AMI project, is used as a client to browse search results.
multimedia information retrieval, speech databases
@inproceedings{BUT22168,
author="Michal {Fapšo} and Pavel {Smrž} and Petr {Schwarz} and Igor {Szőke} and Milan {Schwarz} and Jan {Černocký} and Martin {Karafiát} and Lukáš {Burget}",
title="Information Retrieval from Spoken Documents",
booktitle="Proceedings of the Seventh International Conference on Intelligent Text Processing and Computational Linguistics (CICLING 2006)",
year="2006",
pages="410--416",
publisher="Springer Verlag",
address="Mexico City",
isbn="3-540-32205-1",
url="http://www.fit.vutbr.cz/~smrz/pdf/cicling2006_xfapso00_et_al.pdf"
}
New trends in research and application of voice technology, GACR, Standardní projekty, GA102/05/0278, start: 2005-01-01, end: 2007-12-31, completed
Optická síť národního výzkumu a její nové aplikace, MŠMT, Výzkumná centra (2000-2004), MSM6383917201, start: 2004-01-01, end: 2010-12-31, completed
Speech Data Mining Research Group BUT Speech@FIT (RG SPEECH)