Thesis Details

Určování typů a atributů entit napříč jazyky

Bachelor's Thesis Student: Švub Daniel Academic Year: 2018/2019 Supervisor: Smrž Pavel, doc. RNDr., Ph.D.
English title
Identifying Entity Types and Attributes Across Languages
Language
Czech
Abstract

The target of this thesis is to analyze articles on the Wikipedia internet encyclopedia and to convert their text written in natural language into a structured database of persons, places and other entities. The essence of the implemented program is the determination of the type of entity based on its typical characteristics, and the extraction of the most important attributes of this entity in the Czech and Slovak languages. The result of this task is a knowledge base allowing simple searching and sorting of information. Thanks to its easy extensibility, it is possible to add identification of other types of entities and other features to the program, as well as a support of other languages.

Keywords

Wikipedia, information extraction, text mining, entity atributes

Department
Degree Programme
Information Technology
Files
Status
defended, grade E
Date
10 June 2019
Reviewer
Committee
Smrž Pavel, doc. RNDr., Ph.D. (DCGM FIT BUT), předseda
Fučík Otto, doc. Dr. Ing. (DCSY FIT BUT), člen
Holík Lukáš, doc. Mgr., Ph.D. (DITS FIT BUT), člen
Szőke Igor, Ing., Ph.D. (DCGM FIT BUT), člen
Veselý Vladimír, Ing., Ph.D. (DIFS FIT BUT), člen
Citation
ŠVUB, Daniel. Určování typů a atributů entit napříč jazyky. Brno, 2019. Bachelor's Thesis. Brno University of Technology, Faculty of Information Technology. 2019-06-10. Supervised by Smrž Pavel. Available from: https://www.fit.vut.cz/study/thesis/21926/
BibTeX
@bachelorsthesis{FITBT21926,
    author = "Daniel \v{S}vub",
    type = "Bachelor's thesis",
    title = "Ur\v{c}ov\'{a}n\'{i} typ\r{u} a atribut\r{u} entit nap\v{r}\'{i}\v{c} jazyky",
    school = "Brno University of Technology, Faculty of Information Technology",
    year = 2019,
    location = "Brno, CZ",
    language = "czech",
    url = "https://www.fit.vut.cz/study/thesis/21926/"
}
Back to top