Thesis Details

Adversarial Augmentation for Robust Speech Separation

Master's Thesis Student: Pavlus Ján Academic Year: 2021/2022 Supervisor: Žmolíková Kateřina, Ing., Ph.D.
Czech title
Adversariální augmentace pro robustní separaci řeči
Language
English
Abstract

Speech separation is the task of separating single signals from the given mixture of multiple speakers. Neural networks trained for speech separation usually work well on artificial data but they often fail on real-world examples. To improve their behavior on real-world mixtures it is possible to use training data augmentations such as noise addition. Nevertheless, the power of these augmentations is limited as they have to be manually designed.     In this work, the modified version of the generative adversarial networks (GAN) model could improve this process by generating augmentations depending on the separation performance on these data. Speech separation could be then made more robust with each generator and separator training step. This system was subjected to experimentation. During these experiments, the parameters have been tuned to find the best setting that will successfully train the GAN model without collapsing. This setting was found and the most robust model from the training was selected and evaluated. Results show that the separator model trained by the GAN model does not achieve any significant improvement from the original separator model pretrained on the WSJ0-2mix dataset during the testing on the WHAM dataset. Nevertheless, another evaluation shows that the separator model trained by the GAN model is significantly more robust than the original one towards the generated noises.

Keywords

speech separation, GAN, adversarial augmentations, robust neural network

Department
Degree Programme
Files
Status
defended, grade A
Date
21 June 2022
Reviewer
Committee
Černocký Jan, prof. Dr. Ing. (DCGM FIT BUT), předseda
Burget Lukáš, doc. Ing., Ph.D. (DCGM FIT BUT), člen
Češka Milan, doc. RNDr., Ph.D. (DITS FIT BUT), člen
Hradiš Michal, Ing., Ph.D. (DCGM FIT BUT), člen
Rozman Jaroslav, Ing., Ph.D. (DITS FIT BUT), člen
Zbořil František V., doc. Ing., CSc. (DITS FIT BUT), člen
Citation
PAVLUS, Ján. Adversarial Augmentation for Robust Speech Separation. Brno, 2022. Master's Thesis. Brno University of Technology, Faculty of Information Technology. 2022-06-21. Supervised by Žmolíková Kateřina. Available from: https://www.fit.vut.cz/study/thesis/25172/
BibTeX
@mastersthesis{FITMT25172,
    author = "J\'{a}n Pavlus",
    type = "Master's thesis",
    title = "Adversarial Augmentation for Robust Speech Separation",
    school = "Brno University of Technology, Faculty of Information Technology",
    year = 2022,
    location = "Brno, CZ",
    language = "english",
    url = "https://www.fit.vut.cz/study/thesis/25172/"
}
Back to top