Thesis Details

Detection of Pre-Recorded Messages in Speech

Bachelor's Thesis Student: Boboš Dominik Academic Year: 2020/2021 Supervisor: Černocký Jan, prof. Dr. Ing.

Czech title

Detekce přednahraných úseků v řeči

Language

English

Abstract

Recognition of pre-recorded messages in speech is useful for any follow-up speech data mining. This thesis summarises the theory of searching similar utterances in speech and efficient approaches to compare two sequences. To investigate identification of redundant information in audio, it is necessary to have a large amount of data with the exact phrases repeated multiple times. We generated a dataset by mixing pre-recorded messages into phone calls with variations in speed, volume and repetitions. Our system tackles known messages and unknown messages'' scenarios by using approaches like clustering or detection in chunks. Dynamic time warping, approximate string matching and recurrent quantification analysis are compared, and finally, all mentioned techniques are combined to obtain a precise and efficient system.

Keywords

detection of re-occurring sequences in audio, segmental dynamic time warping, recurrence quantification analysis, fuzzy string matching, bottleneck features, phoneme posteriors, Mel-frequency cepstral coefficients features

Department

Department of Computer Graphics and Multimedia FIT BUT

Degree Programme

Information Technology

Files

Status

defended, grade A

Date

16 June 2021

Reviewer

Matějka Pavel, Ing., Ph.D.

Committee

Černocký Jan, prof. Dr. Ing. (DCGM FIT BUT), předseda
Češka Milan, doc. RNDr., Ph.D. (DITS FIT BUT), člen
Jaroš Jiří, doc. Ing., Ph.D. (DCSY FIT BUT), člen
Orság Filip, Ing., Ph.D. (DITS FIT BUT), člen
Rychlý Marek, RNDr., Ph.D. (DIFS FIT BUT), člen

Citation

BOBOŠ, Dominik. Detection of Pre-Recorded Messages in Speech. Brno, 2021. Bachelor's Thesis. Brno University of Technology, Faculty of Information Technology. 2021-06-16. Supervised by Černocký Jan. Available from: https://www.fit.vut.cz/study/thesis/22504/

BibTeX

@bachelorsthesis{FITBT22504,
    author = "Dominik Bobo\v{s}",
    type = "Bachelor's thesis",
    title = "Detection of Pre-Recorded Messages in Speech",
    school = "Brno University of Technology, Faculty of Information Technology",
    year = 2021,
    location = "Brno, CZ",
    language = "english",
    url = "https://www.fit.vut.cz/study/thesis/22504/"
}

Theses