Thesis Details

Určování typů a atributů entit napříč jazyky

Bachelor's Thesis Student: Švub Daniel Academic Year: 2018/2019 Supervisor: Smrž Pavel, doc. RNDr., Ph.D.

English title

Identifying Entity Types and Attributes Across Languages

Language

Czech

Abstract

The target of this thesis is to analyze articles on the Wikipedia internet encyclopedia and to convert their text written in natural language into a structured database of persons, places and other entities. The essence of the implemented program is the determination of the type of entity based on its typical characteristics, and the extraction of the most important attributes of this entity in the Czech and Slovak languages. The result of this task is a knowledge base allowing simple searching and sorting of information. Thanks to its easy extensibility, it is possible to add identification of other types of entities and other features to the program, as well as a support of other languages.

Keywords

Wikipedia, information extraction, text mining, entity atributes

Department

Department of Computer Graphics and Multimedia FIT BUT

Degree Programme

Information Technology

Files

Status

defended, grade E

Date

10 June 2019

Reviewer

Otrusina Lubomír, Ing.

Committee

Smrž Pavel, doc. RNDr., Ph.D. (DCGM FIT BUT), předseda
Fučík Otto, doc. Dr. Ing. (DCSY FIT BUT), člen
Holík Lukáš, doc. Mgr., Ph.D. (DITS FIT BUT), člen
Szőke Igor, Ing., Ph.D. (DCGM FIT BUT), člen
Veselý Vladimír, Ing., Ph.D. (DIFS FIT BUT), člen

Citation

ŠVUB, Daniel. Určování typů a atributů entit napříč jazyky. Brno, 2019. Bachelor's Thesis. Brno University of Technology, Faculty of Information Technology. 2019-06-10. Supervised by Smrž Pavel. Available from: https://www.fit.vut.cz/study/thesis/21926/

BibTeX

@bachelorsthesis{FITBT21926,
    author = "Daniel \v{S}vub",
    type = "Bachelor's thesis",
    title = "Ur\v{c}ov\'{a}n\'{i} typ\r{u} a atribut\r{u} entit nap\v{r}\'{i}\v{c} jazyky",
    school = "Brno University of Technology, Faculty of Information Technology",
    year = 2019,
    location = "Brno, CZ",
    language = "czech",
    url = "https://www.fit.vut.cz/study/thesis/21926/"
}

Theses