Faculty of Information Technology, BUT

Course details

Speech Signal Processing

ZRE Acad. year 2018/2019 Summer semester 5 credits

Applications of speech processing, digital processing of speech signals, production and perception of speech, introduction to phonetics, pre-processing and basic parameters of speech, linear-predictive model, cepstrum, fundamental frequency estimation, coding - time domain and vocoders, recognition - DTW and HMM, synthesis. Software and libraries for speech processing.

Guarantor

Language of instruction

Czech

Completion

Examination (written)

Time span

26 hrs lectures, 2 hrs exercises, 12 hrs pc labs, 12 hrs projects

Assessment points

51 exam, 14 half-term test, 6 labs, 29 projects

Department

Lecturer

Instructor

Subject specific learning outcomes and competences

The students will get familiar with basic characteristics of speech signal in relation to production and hearing of speech by humans. They will understand basic algorithms of speech analysis common to many applications. They will be given an overview of applications (recognition, synthesis, coding) and be informed about practical aspects of speech algorithms implementation. The students will be able to design a simple system for speech processing (speech activity detector, recognizer of limited number of isolated words), including its implementation into application programs.

Learning objectives

To provide students with the knowledge of basic characteristics of speech signal in relation to production and hearing of speech by humans. To describe basic algorithms of speech analysis common to many applications. To give an overview of applications (recognition, synthesis, coding) and to inform about practical aspects of speech algorithms implementation.

Study literature

  • Gold, B., Morgan, N.: Speech and Audio Signal Processing, John Wiley & Sons, 2000, ISBN 0-471-35154-7

Fundamental literature

  • Psutka, J.: Komunikace s počítačem mluvenou řečí. Academia, Praha, 1995, ISBN  80-200-0203-0
  • Gold, B., Morgan, N.: Speech and Audio Signal Processing, John Wiley & Sons, 2000, ISBN 0-471-35154-7
  • Rabiner, L., Juang, B.H.: Fundamentals of Speech Recognition, Signal Processing, Prentice Hall, Engelwood Cliffs, NJ, 1993, ISBN 0-13-015157-2

Syllabus of lectures

  1. Introduction, applications of speech processing. 
  2. Digital processing of speech signals.
  3. Speech production and its signal processing model. 
  4. Pre-processing and basic parameters of speech, cepstrum.
  5. Linear-predictive model. 
  6. Fundamental frequency estimation.
  7. Speech coding - basics
  8. CELP Speech coding. 
  9. Speech recognition - basics, DTW. 
  10. Hidden Markov models HMM. 
  11. Large vocabulary continuous speech recognition (LVCSR) systems. 
  12. Speaker and language recognition. Neural networks in speech processing. 
  13. Text to speech synthesis. 

Syllabus of numerical exercises

  1. Parameterization, DTW, HMM.

Progress assessment

  • mid-term test 14 pts
  • project 29 pts
  • presentation of results in computer labs 6 pts

Schedule

DayTypeWeeksRoomStartEndLect.grpGroupsInfo
Moncomp.labeven week N103 09:0010:50 1MIT
Tuecomp.labeven week O204 17:0018:50 1MIT
Wedexam2019-06-05 E104 09:0010:50 1MIT 2MIT 2. oprava
Wedlecturelectures E104 09:0010:50 1MIT 2MIT MGM xx
Wedexercise2019-04-24 E105 11:0012:50 1MIT 2MIT
Thuexam2019-05-16 E105 15:0017:50 1MIT 2MIT řádná
Thuexam2019-05-30 E104 15:0016:50 1MIT 2MIT 1. oprava

Course inclusion in study plans

  • Programme IT-MSC-2, field MBI, any year of study, Compulsory-Elective group S
  • Programme IT-MSC-2, field MBS, MIS, MMM, any year of study, Elective
  • Programme IT-MSC-2, field MGM, 1st year of study, Compulsory
  • Programme IT-MSC-2, field MIN, any year of study, Compulsory-Elective group C
  • Programme IT-MSC-2, field MPV, any year of study, Compulsory-Elective group G
  • Programme IT-MSC-2, field MSK, 2nd year of study, Compulsory-Elective group B
Back to top