Result Details

Resources and Benchmarks for Keyword Search in Spoken Audio From Low-Resource Indian Languages

NADIMPALLI, V.; KESIRAJU, S.; BANKA, R.; KETHIREDDY, R.; GANGASHETTY, S. Resources and Benchmarks for Keyword Search in Spoken Audio From Low-Resource Indian Languages. IEEE Access, 2022, vol. 10, no. 2022, p. 34789-34799. ISSN: 2169-3536.
Type
journal article
Language
English
Authors
NADIMPALLI, V.
Kesiraju Santosh, Ph.D., DCGM (FIT)
BANKA, R.
KETHIREDDY, R.
Gangashetty Suryakanth V, FIT (FIT)
Abstract

This paper presents the resources and benchmarks developed for keyword search (KWS)in spoken audio from six low-resource Indian languages (from two families), namely Gujarati, Hindi,Marathi, Odia, Tamil, and Telugu. The current work on constructing keywords and building benchmarkKWS systems is inspired by the popular IARPA Babel program and the subsequent works on low-resourceKWS. The keywords are constructed by taking into account their properties i.e., occurrence, length, andaverage confusability; and their effects on the evaluation metric - the term-weighted value (TWV).We makeuse of freely available speech datasets, and reprocess them to create resources for KWS, thereby addingvalue to the existing speech resources. Four ASR-based KWS systems are built, and their performance isanalyzed across the three keyword properties on all the six languages. The prepared keywords and otherrelated resources to replicate our experiments are made available for the public.We believe that the analysisand guidelines provided in this paper will not only help the research community, but also practitioners andengineers to easily create KWS resources for newer languages, datasets, and scenarios.

Keywords

Keyword search, low-resource languages, term-weighted value (TWV)

URL
Published
2022
Pages
34789–34799
Journal
IEEE Access, vol. 10, no. 2022, ISSN 2169-3536
DOI
UT WoS
000778878900001
EID Scopus
BibTeX
@article{BUT182528,
  author="NADIMPALLI, V. and KESIRAJU, S. and BANKA, R. and KETHIREDDY, R. and GANGASHETTY, S.",
  title="Resources and Benchmarks for Keyword Search in Spoken Audio From Low-Resource Indian Languages",
  journal="IEEE Access",
  year="2022",
  volume="10",
  number="2022",
  pages="34789--34799",
  doi="10.1109/ACCESS.2022.3162854",
  issn="2169-3536",
  url="https://ieeexplore.ieee.org/document/9743904"
}
Files
Projects
Neural Representations in multi-modal and multi-lingual modeling, GACR, Grantové projekty exelence v základním výzkumu EXPRO - 2019, GX19-26934X, start: 2019-01-01, end: 2023-12-31, completed
Research groups
Departments
Back to top