Result Details

Convolutional Neural Networks and X-Vector Embedding for DCASE2018 Acoustic Scene Classification Challenge

ZEINALI, H.; BURGET, L.; ČERNOCKÝ, J. Convolutional Neural Networks and X-Vector Embedding for DCASE2018 Acoustic Scene Classification Challenge. Proceedings of DCASE 2018 Workshop. Surrey: Tampere University of Technology, 2018. p. 1-5. ISBN: 978-952-15-4262-6.
Type
conference paper
Language
English
Authors
Abstract

In this paper, the Brno University of Technology (BUT) team submissionsfor Task 1 (Acoustic Scene Classification, ASC) of theDCASE-2018 challenge are described. Also, the analysis of differentmethods on the leaderboard set is provided. The proposedapproach is a fusion of two different Convolutional Neural Network(CNN) topologies. The first one is the common two-dimensionalCNNs which is mainly used in image classification. The second oneis a one-dimensional CNN for extracting fixed-length audio segmentembeddings, so called x-vectors, which has also been used inspeech processing, especially for speaker recognition. In additionto the different topologies, two types of features were tested: logmel-spectrogram and CQT features. Finally, the outputs of differentsystems are fused using a simple output averaging in the bestperforming system. Our submissions ranked third among 24 teamsin the ASC sub-task A (task 1a).

Keywords

Audio scene classification, Convolutional neuralnetworks, Deep learning, x-vectors, Regularized LDA

URL
Published
2018
Pages
1–5
Proceedings
Proceedings of DCASE 2018 Workshop
Conference
Detection and Classification of Acoustic Scenes and Events
ISBN
978-952-15-4262-6
Publisher
Tampere University of Technology
Place
Surrey
BibTeX
@inproceedings{BUT155111,
  author="Hossein {Zeinali} and Lukáš {Burget} and Jan {Černocký}",
  title="Convolutional Neural Networks and X-Vector Embedding for DCASE2018 Acoustic Scene Classification Challenge",
  booktitle="Proceedings of DCASE 2018 Workshop",
  year="2018",
  pages="1--5",
  publisher="Tampere University of Technology",
  address="Surrey",
  isbn="978-952-15-4262-6",
  url="http://dcase.community/documents/workshop2018/proceedings/DCASE2018Workshop_Zeinali_149.pdf"
}
Files
Projects
Information mining in speech acquired by distant microphones, MV, Bezpečnostní výzkum České republiky 2015-2020, VI20152020025, start: 2015-10-01, end: 2020-09-30, completed
International mobility of researchers at the Brno University of Technology, EU, OPVVV PO2 Mezinárodní mobilita výzkumných pracovníků, EF16_027/0008371, CZ.02.2.69/0.0/0.0/16_027/0008371, start: 2018-01-01, end: 2022-09-30, running
IT4Innovations excellence in science, MŠMT, Národní program udržitelnosti II, LQ1602, start: 2016-01-01, end: 2020-12-31, completed
Research groups
Departments
Back to top