Thesis Details

Metody pro získávání asociačních pravidel z dat

Master's Thesis Student: Uhlíř Martin Academic Year: 2006/2007 Supervisor: Bartík Vladimír, Ing., Ph.D.

English title

Methods for Mining Association Rules from Data

Language

Czech

Abstract

The aim of this thesis is to implement Multipass-Apriori method for mining association rules from text data. After the introduction to the field of knowledge discovery, the specific aspects of text mining are mentioned. In the mining process, preprocessing is a very important problem, use of stemming and stop words dictionary is necessary in this case. Next part of thesis deals with meaning, usage and generating of association rules. The main part is focused on the description of Multipass-Apriori method, which was implemented. On the ground of executed tests the most optimal way of dividing partitions was set and also the best way of sorting the itemsets. As a part of testing, Multipass-Apriori method was compared with Apriori method.

Keywords

frequent itemset, association rules, Apriori, Multipass-Apriori, stemming, stop words, text data preprocessing,

Department

Department of Information Systems FIT BUT

Degree Programme

Information Technology, Field of Study Information Systems

Files

Thesis text 743 kB

Status

defended, grade A

Date

18 June 2007

Reviewer

Burget Radek, doc. Ing., Ph.D.

Committee

Meduna Alexander, prof. RNDr., CSc. (DIFS FIT BUT), předseda
Hanáček Petr, doc. Dr. Ing. (DITS FIT BUT), člen
Krejčíček Jaromír, prof. Ing., CSc. (UNOB), člen
Křena Bohuslav, Ing., Ph.D. (DITS FIT BUT), člen
Sumec Stanislav, Ing., Ph.D. (DCGM FIT BUT), člen
Zbořil František, doc. Ing., Ph.D. (DITS FIT BUT), člen

Citation

UHLÍŘ, Martin. Metody pro získávání asociačních pravidel z dat. Brno, 2007. Master's Thesis. Brno University of Technology, Faculty of Information Technology. 2007-06-18. Supervised by Bartík Vladimír. Available from: https://www.fit.vut.cz/study/thesis/4771/

BibTeX

@mastersthesis{FITMT4771,
    author = "Martin Uhl\'{i}\v{r}",
    type = "Master's thesis",
    title = "Metody pro z\'{i}sk\'{a}v\'{a}n\'{i} asocia\v{c}n\'{i}ch pravidel z dat",
    school = "Brno University of Technology, Faculty of Information Technology",
    year = 2007,
    location = "Brno, CZ",
    language = "czech",
    url = "https://www.fit.vut.cz/study/thesis/4771/"
}

Theses