Result Details

Deriving Spectro-temporal Properties of Hearing from Speech Data

ONDEL YANG, L.; LI, R.; SELL, G.; HEŘMANSKÝ, H. Deriving Spectro-temporal Properties of Hearing from Speech Data. In Proceedings of ICASSP. Brighton: IEEE Signal Processing Society, 2019. p. 411-415. ISBN: 978-1-5386-4658-8.

Type

conference paper

Language

English

Authors

ONDEL YANG, L.
Li Ruizhi
SELL, G.
Heřmanský Hynek, prof. Ing., Dr. Eng., DCGM (FIT)

Abstract

Human hearing and human speech are intrinsically tied together, asthe properties of speech almost certainly developed in order to beheard by human ears. As a result of this connection, it has beenshown that certain properties of human hearing are mimicked withindata-driven systems that are trained to understand human speech.In this paper, we further explore this phenomenon by measuring thespectro-temporal responses of data-derived filters in a front-end convolutionallayer of a deep network trained to classify the phonemesof clean speech. The analyses show that the filters do indeed exhibitspectro-temporal responses similar to those measured in mammals,and also that the filters exhibit an additional level of frequency selectivity,similar to the processing pipeline assumed within the ArticulationIndex.

Keywords

perception, spectro-temporal, auditory, deeplearning

URL

Published

2019

Pages

411–415

Proceedings

Proceedings of ICASSP

Conference

2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

ISBN

978-1-5386-4658-8

Publisher

IEEE Signal Processing Society

Place

Brighton

DOI

10.1109/ICASSP.2019.8682787

UT WoS

000482554000083

EID Scopus

2-s2.0-85068988824

BibTeX

@inproceedings{BUT160004,
  author="ONDEL YANG, L. and LI, R. and SELL, G. and HEŘMANSKÝ, H.",
  title="Deriving Spectro-temporal Properties of Hearing from Speech Data",
  booktitle="Proceedings of ICASSP",
  year="2019",
  pages="411--415",
  publisher="IEEE Signal Processing Society",
  address="Brighton",
  doi="10.1109/ICASSP.2019.8682787",
  isbn="978-1-5386-4658-8",
  url="https://ieeexplore.ieee.org/document/8682787"
}

Files

pdf ondel_icassp2019_08682787.pdf 2 MB

Projects

IT4Innovations excellence in science, MŠMT, Národní program udržitelnosti II, LQ1602, start: 2016-01-01, end: 2020-12-31, completed
Zpracování, zobrazování a analýza multimediálních a 3D dat, BUT, Vnitřní projekty VUT, FIT-S-17-3984, start: 2017-03-01, end: 2020-02-29, completed

Research groups

Výzkumná skupina dolování dat z řeči BUT Speech@FIT (RG SPEECH)

Departments

Ústav počítačové grafiky a multimédií (DCGM)