Publication Details

Big Data Network Flow Processing Using Apache Spark

JEŘÁBEK Kamil and RYŠAVÝ Ondřej. Big Data Network Flow Processing Using Apache Spark. In: Proceedings of the 6th Conference on the Engineering of Computer Based Systems (ECBS 2019), 2019. Bukurešť: Association for Computing Machinery, 2019, pp. 1-9. ISBN 978-1-4503-7636-5.
Czech title
Big data zpracování síťových toků pomocí Apache Spark
Type
conference paper
Language
english
Authors
Keywords

Big Data, Network flows, Apache Spark, Cassandra, Apache Ignite

Abstract

The increasing amount of traffic flows captured as a part of network monitoring activities makes the analysis more complicated. One of the goals for network traffic analysis is to identify malicious communication. In the paper, we present a new system for big data network flow classification and clustering. The proposed system is based on the popular big data engines such as Apache Spark and Apache Ignite. The conducted experiments demonstrate the feasibility of the proposed approach and show the possible scalability.

Published
2019
Pages
1-9
Proceedings
Proceedings of the 6th Conference on the Engineering of Computer Based Systems (ECBS 2019), 2019
Conference
6th Conference on the Engineering of Computer Based Systems, Bucharest, RO
ISBN
978-1-4503-7636-5
Publisher
Association for Computing Machinery
Place
Bukurešť, RO
DOI
UT WoS
000525376600009
EID Scopus
BibTeX
@INPROCEEDINGS{FITPUB11977,
   author = "Kamil Je\v{r}\'{a}bek and Ond\v{r}ej Ry\v{s}av\'{y}",
   title = "Big Data Network Flow Processing Using Apache Spark",
   pages = "1--9",
   booktitle = "Proceedings of the 6th Conference on the Engineering of Computer Based Systems (ECBS 2019), 2019",
   year = 2019,
   location = "Bukure\v{s}\v{t}, RO",
   publisher = "Association for Computing Machinery",
   ISBN = "978-1-4503-7636-5",
   doi = "10.1145/3352700.3352709",
   language = "english",
   url = "https://www.fit.vut.cz/research/publication/11977"
}
Back to top