Thesis Details

Aplikace metody učení bez učitele na hledání podobných grafů

Master's Thesis Student: Sabo Jozef Academic Year: 2020/2021 Supervisor: Křivka Zbyněk, Ing., Ph.D.
English title
Application of Unsupervised Learning Methods in Graph Similarity Search
Language
Czech
Abstract

Goal of this master's thesis was in cooperation with the company Avast to design a system, which can extract knowledge from a database of graphs. Graphs, used for data mining, describe behaviour of computer systems and they are anonymously inserted into the company's database from systems of the company's products users. Each graph in the database can be assigned with one of two labels: clean or malware (malicious) graph. The task of the proposed self-learning system is to find clusters of graphs in the graph database, in which the classes of graphs do not mix. Graph clusters with only one class of graphs can be interpreted as different types of clean or malware graphs and they are a useful source of further analysis on the graphs. To evaluate the quality of the clusters, a custom metric, named as monochromaticity, was designed. The metric evaluates the quality of the clusters based on how much clean and malware graphs are mixed in the clusters. The best results of the metric were obtained when vector representations of graphs were created by a deep learning model (variational  graph autoencoder with two relation graph convolution operators) and the parameterless method MeanShift was used for clustering over vectors.

Keywords

graph, Avast, unsupervised learning, clustering, communities detection, node embeddings, graph embeddings, graph neural networks

Department
Degree Programme
Information Technology and Artificial Intelligence, Specialization Application Development
Files
Status
defended, grade C
Date
23 June 2021
Reviewer
Committee
Zemčík Pavel, prof. Dr. Ing. (DCGM FIT BUT), předseda
Bartík Vladimír, Ing., Ph.D. (DIFS FIT BUT), člen
Holík Lukáš, doc. Mgr., Ph.D. (DITS FIT BUT), člen
Kreslíková Jitka, doc. RNDr., CSc. (DIFS FIT BUT), člen
Orság Filip, Ing., Ph.D. (DITS FIT BUT), člen
Polčák Libor, Ing., Ph.D. (DIFS FIT BUT), člen
Citation
SABO, Jozef. Aplikace metody učení bez učitele na hledání podobných grafů. Brno, 2021. Master's Thesis. Brno University of Technology, Faculty of Information Technology. 2021-06-23. Supervised by Křivka Zbyněk. Available from: https://www.fit.vut.cz/study/thesis/23791/
BibTeX
@mastersthesis{FITMT23791,
    author = "Jozef Sabo",
    type = "Master's thesis",
    title = "Aplikace metody u\v{c}en\'{i} bez u\v{c}itele na hled\'{a}n\'{i} podobn\'{y}ch graf\r{u}",
    school = "Brno University of Technology, Faculty of Information Technology",
    year = 2021,
    location = "Brno, CZ",
    language = "czech",
    url = "https://www.fit.vut.cz/study/thesis/23791/"
}
Back to top