Thesis Details

Spojování záznamů v genealogických datech

Bachelor's Thesis Student: Šorm Jan Academic Year: 2018/2019 Supervisor: Zbořil František, doc. Ing., Ph.D.
English title
Record Linkage in Genealogical Data
Language
Czech
Abstract

The main aim of this thesis is to study genealogical data, to find out possible problems in their merging and to implement methods for this data merging. In this thesis, it will be studied the problem of classifying similar names into common classes. This problem will be studied mainly because people's names and surnames play the most important role in every registry entry. In this thesis, it will be analyzed several metrics for calculating the distance between two strings. In addition, several experiments will be done for these metrics to classify names into classes with as few errors as possible. Based on these results, experiments for record linkage will be performed.

Keywords

genealogy, register, records, merging, strings, distances, classes, C++

Department
Degree Programme
Information Technology
Files
Status
defended, grade B
Date
10 June 2019
Reviewer
Committee
Smrž Pavel, doc. RNDr., Ph.D. (DCGM FIT BUT), předseda
Fučík Otto, doc. Dr. Ing. (DCSY FIT BUT), člen
Holík Lukáš, doc. Mgr., Ph.D. (DITS FIT BUT), člen
Szőke Igor, Ing., Ph.D. (DCGM FIT BUT), člen
Veselý Vladimír, Ing., Ph.D. (DIFS FIT BUT), člen
Citation
ŠORM, Jan. Spojování záznamů v genealogických datech. Brno, 2019. Bachelor's Thesis. Brno University of Technology, Faculty of Information Technology. 2019-06-10. Supervised by Zbořil František. Available from: https://www.fit.vut.cz/study/thesis/22057/
BibTeX
@bachelorsthesis{FITBT22057,
    author = "Jan \v{S}orm",
    type = "Bachelor's thesis",
    title = "Spojov\'{a}n\'{i} z\'{a}znam\r{u} v genealogick\'{y}ch datech",
    school = "Brno University of Technology, Faculty of Information Technology",
    year = 2019,
    location = "Brno, CZ",
    language = "czech",
    url = "https://www.fit.vut.cz/study/thesis/22057/"
}
Back to top