Publication Details

Network Intrusion Datasets: A Survey, Limitations, and Recommendations

GOLDSCHMIDT, P.; CHUDÁ, D. Network Intrusion Datasets: A Survey, Limitations, and Recommendations. COMPUTERS & SECURITY, 2025, vol. 156, p. 104510-104542. ISSN: 0167-4048.
Czech title
Datové sady pro detekci útoků na počítačových sítích: Přehled, Limitace, a Doporučení
Type
journal article
Language
English
Authors
URL
Keywords

Network intrusion detection, NIDS, Data, Systematic Literature Review (SLR), Machine learning for intrusion detection, Cybersecurity, Best practices, Recommendations, Dataset popularity analysis, Domain limitations

Abstract

Data-driven cyberthreat detection has become a crucial defense technique
in modern cybersecurity. Network defense, supported by Network
Intrusion Detection Systems (NIDSs), has also increasingly adopted
data-driven approaches, leading to greater reliance on data. Despite the
importance of data, its scarcity has long been recognized as a major
obstacle in NIDS research. In response, the community has published many
new datasets recently. However, many of them remain largely unknown and
unanalyzed, leaving researchers uncertain about their suitability for
specific use cases.

In this paper, we aim to address this knowledge gap by performing a
systematic literature review (SLR) of 89 public datasets for NIDS
research. Each dataset is comparatively analyzed across 13 key
properties, and its potential applications are outlined. Beyond the
review, we also discuss domain-specific challenges and common data
limitations to facilitate a critical view on data quality. To aid in
data selection, we conduct a dataset popularity analysis in contemporary
state-of-the-art NIDS research. Furthermore, the paper presents best
practices for dataset selection, generation, and usage. By providing a
comprehensive overview of the domain and its data, this work aims to
guide future research toward improving data quality and the robustness
of NIDS solutions.

Published
2025 (in print)
Pages
104510–104542
Journal
COMPUTERS & SECURITY, vol. 156, ISSN 0167-4048
Book
Computers & Security
Publisher
Elsevier Science
DOI
EID Scopus
BibTeX
@article{BUT194021,
  author="Patrik {Goldschmidt} and Daniela {Chudá}",
  title="Network Intrusion Datasets: A Survey, Limitations, and Recommendations",
  journal="COMPUTERS & SECURITY",
  year="2025",
  volume="156",
  pages="104510--104542",
  doi="10.1016/j.cose.2025.104510",
  issn="0167-4048",
  url="https://www.sciencedirect.com/science/article/pii/S0167404825001993"
}
Back to top