Result Details
Convolutional Neural Networks and X-Vector Embedding for DCASE2018 Acoustic Scene Classification Challenge
Burget Lukáš, doc. Ing., Ph.D., DCGM (FIT)
Černocký Jan, prof. Dr. Ing., DCGM (FIT)
In this paper, the Brno University of Technology (BUT) team submissionsfor Task 1 (Acoustic Scene Classification, ASC) of theDCASE-2018 challenge are described. Also, the analysis of differentmethods on the leaderboard set is provided. The proposedapproach is a fusion of two different Convolutional Neural Network(CNN) topologies. The first one is the common two-dimensionalCNNs which is mainly used in image classification. The second oneis a one-dimensional CNN for extracting fixed-length audio segmentembeddings, so called x-vectors, which has also been used inspeech processing, especially for speaker recognition. In additionto the different topologies, two types of features were tested: logmel-spectrogram and CQT features. Finally, the outputs of differentsystems are fused using a simple output averaging in the bestperforming system. Our submissions ranked third among 24 teamsin the ASC sub-task A (task 1a).
Audio scene classification, Convolutional neuralnetworks, Deep learning, x-vectors, Regularized LDA
@inproceedings{BUT155111,
author="Hossein {Zeinali} and Lukáš {Burget} and Jan {Černocký}",
title="Convolutional Neural Networks and X-Vector Embedding for DCASE2018 Acoustic Scene Classification Challenge",
booktitle="Proceedings of DCASE 2018 Workshop",
year="2018",
pages="1--5",
publisher="Tampere University of Technology",
address="Surrey",
isbn="978-952-15-4262-6",
url="http://dcase.community/documents/workshop2018/proceedings/DCASE2018Workshop_Zeinali_149.pdf"
}
International mobility of researchers at the Brno University of Technology, EU, OPVVV PO2 Mezinárodní mobilita výzkumných pracovníků, EF16_027/0008371, CZ.02.2.69/0.0/0.0/16_027/0008371, start: 2018-01-01, end: 2022-09-30, running
IT4Innovations excellence in science, MŠMT, Národní program udržitelnosti II, LQ1602, start: 2016-01-01, end: 2020-12-31, completed