Thesis Details

Detekce podobnosti zdrojových souborů v jazyce C

Bachelor's Thesis Student: Rek Petr Academic Year: 2015/2016 Supervisor: Matula Peter, Ing.
English title
C Language Source Files Similarity Detection

This thesis deals with design, implementation and testing of the csim tool, which compares two C source files by their similarity. The primary purpose of this tool is testing of a decompiler developed by AVG Technologies s.r.o. Testing is based on comparing abstract syntax trees of the original and decompiled source files. The reader is introduced to the basics of reverse engineering, especially reverse engineering of a binary file into a high-level programming language source file. The process of compiling followed by decompiling of a file is described along with its effect on reverse engineering. The LLVM project and the Clang compiler is introduced to the reader, since its libraries are the foundation upon which the csim tool is built.


Reverse engineering, decompiler, Clang, LLVM, source file similarity, abstract syntax tree similarity, malware detection, C language

Degree Programme
Information Technology
defended, grade A
15 June 2016
Meduna Alexander, prof. RNDr., CSc. (DIFS FIT BUT), předseda
Burget Lukáš, doc. Ing., Ph.D. (DCGM FIT BUT), člen
Holík Lukáš, doc. Mgr., Ph.D. (DITS FIT BUT), člen
Jaroš Jiří, doc. Ing., Ph.D. (DCSY FIT BUT), člen
Křivka Zbyněk, Ing., Ph.D. (DIFS FIT BUT), člen
REK, Petr. Detekce podobnosti zdrojových souborů v jazyce C. Brno, 2016. Bachelor's Thesis. Brno University of Technology, Faculty of Information Technology. 2016-06-15. Supervised by Matula Peter. Available from:
    author = "Petr Rek",
    type = "Bachelor's thesis",
    title = "Detekce podobnosti zdrojov\'{y}ch soubor\r{u} v jazyce C",
    school = "Brno University of Technology, Faculty of Information Technology",
    year = 2016,
    location = "Brno, CZ",
    language = "czech",
    url = ""
Back to top