Thesis Details

Subspace Modeling of Prosodic Features for Speaker Verification

Ph.D. Thesis Student: Kockmann Marcel Academic Year: 2011/2012 Supervisor: Černocký Jan, prof. Dr. Ing.

Czech title

Modelování prozodických příznaků pro ověřování mluvčího v pod-prostorech

Language

English

Abstract

The thesis investigates into speaker verification by means of prosodic features. This includes an appropriate representation of speech by measurements of pitch, energy and duration of speech sounds. Two diverse parameterization methods are investigated: the first leads to a low-dimensional well-defined set, the second to a large-scale set of heterogeneous prosodic features. The first part of this work concentrates on the development of so called prosodic contour features. Different modeling techniques are developed and investigated, with a special focus on subspace modeling. The second part focuses on a novel subspace modeling technique for the heterogeneous large-scale prosodic features. The model is theoretically derived and experimentally evaluated on official NIST Speaker Recognition Evaluation tasks. Huge improvements over the current state-of-the-art in prosodic speaker verification were obtained. Eventually, a novel fusion method is presented to elegantly combine the two diverse prosodic systems. This technique can also be used to fuse the higher-level systems with a high-performing cepstral system, leading to further significant improvements.

Keywords

Department

Department of Computer Graphics and Multimedia FIT BUT

Degree Programme

Computer Science and Engineering, Field of Study Computer Science and Engineering

Files

Status

defended

Date

21 May 2012

Citation

KOCKMANN, Marcel. Subspace Modeling of Prosodic Features for Speaker Verification. Brno, 2011. Ph.D. Thesis. Brno University of Technology, Faculty of Information Technology. 2012-05-21. Supervised by Černocký Jan. Available from: https://www.fit.vut.cz/study/phd-thesis/228/

BibTeX

@phdthesis{FITPT228,
    author = "Marcel Kockmann",
    type = "Ph.D. thesis",
    title = "Subspace Modeling of Prosodic Features for Speaker Verification",
    school = "Brno University of Technology, Faculty of Information Technology",
    year = 2012,
    location = "Brno, CZ",
    language = "english",
    url = "https://www.fit.vut.cz/study/phd-thesis/228/"
}

Theses