Course details

Natural Language Processing (in English)

ZPJa Acad. year 2024/2025 Winter semester 5 credits

Foundations of the natural language processing, historical perspective, statistical NLP and modern era dominated by machine learning and, specifically, deep neural networks. Meaning of individual words, lexicology and lexicography, word senses and neural architectures for computing word embeddings, word sense classification and inferrence. Constituency and dependency parsing, syntactic ambiguity, neural dependency parsers. Language modeling and its applications in general architectures. Machine translation, historical perspective on the statistical approach, neural translation and evaluation scores. End-to-end models, attention mechanisms, limits of current seq2seq models. Question answering based on neural models, information extraction components, text understanding challenges, learning by reading and machine comprehension. Text classification and its modern applications, convolutional neural networks for sentence classification. Language-independent representations, non-standard texts from social networks, representing parts of words, subword models. Contextual representations and pretraining for context-dependent language modules. Transformers and self-attention for generative models. Communication agents and natural language generation. Coreference resolution and its interconnection to other text understanding components.

Guarantor

Smrž Pavel, doc. RNDr., Ph.D. (DCGM)

Course coordinator

Dočekal Martin, Ing. (DCGM)
Kesiraju Santosh, Ph.D. (DCGM)

Language of instruction

English

Completion

Examination (written)

Time span

26 hrs lectures
26 hrs projects

Assessment points

51 pts final exam (written part)
9 pts mid-term test (written part)
40 pts projects

Department

Department of Computer Graphics and Multimedia (UPGM)

Lecturer

Dočekal Martin, Ing. (DCGM)
Fajčík Martin, Ing., Ph.D. (DCGM)
Kesiraju Santosh, Ph.D. (DCGM)

Learning objectives

To understand natural language processing and to learn how to apply modern machine learning methods in this field. To get acquainted with advanced deep learning architectures that proved to be successful in various NLP tasks. To

understand the use of neural networks for sequential language modelling, to understand their use as conditional language models for transduction tasks, and to approaches employing these techniques in combination with other mechanisms for advanced applications.

The students will get acquainted with natural language processing and will understand a range of neural network models that are commonly applied in the field. They will also grasp basics of neural implementations of attention mechanisms and sequence embedding models and how these modular components can be combined to build state of the art NLP systems. They will be able to implement and to evaluate common neural network models for various NLP applications.
Students will improve their programming skills and their knowledge and practical experience with tools for deep learning as well as with general processing of textual data.

Prerequisite knowledge and skills

Knowledge of Python programming and fundamental elements of calculus.

Study literature

Géron, Aurélien. Hands-on machine learning with Scikit-Learn and TensorFlow: concepts, tools, and techniques to build intelligent systems. " O'Reilly Media, Inc.", 2017.

Syllabus of lectures

Introduction, history of NLP, and modern approaches based on deep learning
Word senses and word vector
Dependency parsing
Language models
Machine translation
Seq2seq models and attention
Question answering
Convolutional neural networks for sentence classification
Information from parts of words: Subword models
Modeling contexts of use: Contextual representations and pretraining
Transformers and self-attention for generative models
Natural language generation
Coreference resolution

Syllabus - others, projects and individual work of students

Individually assigned project

Progress assessment

Mid-term test - up to 9 points
Individual project - up to 40 points
Written final exam - up to 51 points

The evaluation includes mid-term test, individual project, and the final exam. The mid-term test does not have a correction option, the final exam has two possible correction runs.

Schedule

Day	Type	Weeks	Room	Start	End	Capacity	Lect.grp	Groups	Info
Wed	exam	2025-01-22	E104	11:00	12:50				2. termín
Thu	lecture	1., 2., 3., 4., 5., 6., 7., 8., 9., 10., 11., 12. of lectures	A112	08:00	09:50	64	1EIT 1MIT 2EIT 2MIT INTE	NSPE xx	Fajčík
Thu	lecture	2024-12-12	E112	14:00	15:50	64	1EIT 1MIT 2EIT 2MIT INTE	NSPE xx	Fajčík
Fri	exam	2025-01-31	A112	11:00	12:50				3. termín
Fri	exam	2025-01-10	E104	13:00	14:50				1. termín

Course inclusion in study plans

Programme MIT-EN (in English), any year of study, Elective
Programme MITAI, field NADE, NBIO, NCPS, NEMB, NEMB, NGRI, NHPC, NIDE, NISD, NISY, NISY up to 2020/21, NMAL, NMAT, NNET, NSEC, NSEN, NVER, NVIZ, any year of study, Elective
Programme MITAI, field NSPE, any year of study, Compulsory