Thesis Details

Detection of Pre-Recorded Messages in Speech

Bachelor's Thesis Student: Boboš Dominik Academic Year: 2020/2021 Supervisor: Černocký Jan, prof. Dr. Ing.
Czech title
Detekce přednahraných úseků v řeči
Language
English
Abstract

Recognition of pre-recorded messages in speech is useful for any follow-up speech data mining. This thesis summarises the theory of searching similar utterances in speech and efficient approaches to compare two sequences. To investigate identification of redundant information in audio, it is necessary to have a large amount of data with the exact phrases repeated multiple times. We generated a dataset by mixing pre-recorded messages into phone calls with variations in speed, volume and repetitions. Our system tackles known messages and unknown messages'' scenarios by using approaches like clustering or detection in chunks. Dynamic time warping, approximate string matching and recurrent quantification analysis are compared, and finally, all mentioned techniques are combined to obtain a precise and efficient system.

Keywords

detection of re-occurring sequences in audio, segmental dynamic time warping, recurrence quantification analysis, fuzzy string matching, bottleneck features, phoneme posteriors, Mel-frequency cepstral coefficients features

Department
Degree Programme
Information Technology
Files
Status
defended, grade A
Date
16 June 2021
Reviewer
Committee
Černocký Jan, prof. Dr. Ing. (DCGM FIT BUT), předseda
Češka Milan, doc. RNDr., Ph.D. (DITS FIT BUT), člen
Jaroš Jiří, doc. Ing., Ph.D. (DCSY FIT BUT), člen
Orság Filip, Ing., Ph.D. (DITS FIT BUT), člen
Rychlý Marek, RNDr., Ph.D. (DIFS FIT BUT), člen
Citation
BOBOŠ, Dominik. Detection of Pre-Recorded Messages in Speech. Brno, 2021. Bachelor's Thesis. Brno University of Technology, Faculty of Information Technology. 2021-06-16. Supervised by Černocký Jan. Available from: https://www.fit.vut.cz/study/thesis/22504/
BibTeX
@bachelorsthesis{FITBT22504,
    author = "Dominik Bobo\v{s}",
    type = "Bachelor's thesis",
    title = "Detection of Pre-Recorded Messages in Speech",
    school = "Brno University of Technology, Faculty of Information Technology",
    year = 2021,
    location = "Brno, CZ",
    language = "english",
    url = "https://www.fit.vut.cz/study/thesis/22504/"
}
Back to top