Thesis Details
Distribuované zpracování dat o IP tocích
This thesis deals with the subject of distributed processing of IP flow. Main goal is to provide animplementation of a software collector which allows storing and processing huge amount ofa network data in particular. There was studied an open-source implementation of a framework forthe distributed processing of large data sets called Hadoop, which is based on MapReduce paradigm.There were made some experiments with this system which provided the comparison with the currentsystems and shown weaknesses of this framework. Based on this knowledge there was createda specification and scheme for an extension of current software collector within this work. In terms ofthe created scheme there was created an implementation of query framework for formed collector,which is considered as most critical in the field of distributed processing of IP flow data. Results ofexperiments with created implementation show significant performance growth and ability of linearscalability with some types of queries.
Distribution, computation, storage, database, MapReduce, Hadoop, Nfdump, IPFIX
Balík Miroslav, Ing., Ph.D. (FIT CTU), člen
Burget Radek, doc. Ing., Ph.D. (DIFS FIT BUT), člen
Drábek Vladimír, doc. Ing., CSc. (DCSY FIT BUT), člen
Holík Lukáš, doc. Mgr., Ph.D. (DITS FIT BUT), člen
Matoušek Petr, doc. Ing., Ph.D., M.A. (DIFS FIT BUT), člen
@mastersthesis{FITMT17592, author = "Pavel Krobot", type = "Master's thesis", title = "Distribuovan\'{e} zpracov\'{a}n\'{i} dat o IP toc\'{i}ch", school = "Brno University of Technology, Faculty of Information Technology", year = 2015, location = "Brno, CZ", language = "czech", url = "https://www.fit.vut.cz/study/thesis/17592/" }