Thesis Details
Optimization of Gaussian Mixture Subspace Models and Related Scoring Algorithms in Speaker Verification
This thesis deals with Gaussian Mixture Subspace Modeling in automatic speaker recognition. The thesis consists of three parts. In the first part, Joint Factor Analysis (JFA) scoring methods are studied. The methods differ mainly in how they deal with the channel of the tested utterance. The general JFA likelihood function is investigated and the methods are compared both in terms of accuracy and speed. It was found that linear approximation of the log-likelihood function gives comparable results to the full log-likelihood evaluation while simplyfing the formula and dramatically reducing the computation speed.
Speaker Recognition, Gaussian Mixture Model, Subspace Modeling, i-vector, Joint Factor Analysis, Discriminative Training
@phdthesis{FITPT209, author = "Ond\v{r}ej Glembek", type = "Ph.D. thesis", title = "Optimization of Gaussian Mixture Subspace Models and Related Scoring Algorithms in Speaker Verification", school = "Brno University of Technology, Faculty of Information Technology", year = 2012, location = "Brno, CZ", language = "english", url = "https://www.fit.vut.cz/study/phd-thesis/209/" }