Thesis Details
Modelování dynamiky prosodie pro rozpoznávání řečníka
Most current automatic speaker recognition system extract speaker-depend features by looking at short-term spectral information. This approach ignores long-term information. I explored approach that use the fundamental frequency and energy trajectories for each speaker. This approach models prosody dynamics on single fonemes or syllables. It is known from literature that prosodic systems do not work as well the acoustic one but it improve the system when fusing. I verified this assumption by fusing my results with state of the art acoustic system from BUT. Data from standard evaluation campaigns organized by National Institute of Standarts and Technology are used for all experiments.
prosody, pitch, energy, speaker identification, speaker validation, speaker recognition, language model, bigram, n-gram
Černocký Jan, prof. Dr. Ing. (DCGM FIT BUT), člen
Fučík Otto, doc. Dr. Ing. (DCSY FIT BUT), člen
Kršek Přemysl, doc. Ing., Ph.D. (DCGM FIT BUT), člen
Sochor Jiří, prof. Ing., CSc. (FI MUNI), člen
Zemčík Pavel, prof. Dr. Ing. (DCGM FIT BUT), člen
@mastersthesis{FITMT6977, author = "Zden\v{e}k Jan\v{c}\'{i}k", type = "Master's thesis", title = "Modelov\'{a}n\'{i} dynamiky prosodie pro rozpozn\'{a}v\'{a}n\'{i} \v{r}e\v{c}n\'{i}ka", school = "Brno University of Technology, Faculty of Information Technology", year = 2008, location = "Brno, CZ", language = "czech", url = "https://www.fit.vut.cz/study/thesis/6977/" }