Result Details
Generator of Synthetic Datasets for Hierarchical Sequential Pattern Mining Evaluation
Zendulka Jaroslav, doc. Ing., CSc., DIFS (FIT)
Evaluation is an important part of algorithm design. Algorithms are typically evaluated on real-world and synthetic datasets. Real-world datasets are appropriate for evaluation of algorithm properties in practice but it is difficult to change the dataset to have some particular statistics, e.g. number of input items. In contrast, generated synthetic dataset simply allows changing any of statistic property of the dataset with keeping all other statistic properties. In the paper, we present a procedure for generation of sequence databases with taxonomies for an evaluation of hierarchical sequential pattern mining algorithms.
Sequence pattern mining, synthetic dataset generators, taxonomy
Evaluation is an important part of algorithm design. Algorithms are typically evaluated on real-world and synthetic datasets. Real-world datasets are appropriate for evaluation of algorithm properties in practice but it is difficult to change the dataset to have some particular statistics, e.g. number of input items. In contrast, generated synthetic dataset simply allows changing any of statistic property of the dataset with keeping all other statistic properties. In the paper, we present a procedure for generation of sequence databases with taxonomies for an evaluation of hierarchical sequential pattern mining algorithms.
@inproceedings{BUT103555,
author="Michal {Šebek} and Jaroslav {Zendulka}",
title="Generator of Synthetic Datasets for Hierarchical Sequential Pattern Mining Evaluation",
booktitle="Proceedings of the Twelfth International Conference on Informatics 2013",
year="2013",
pages="289--292",
publisher="The University of Technology Košice",
address="Košice",
isbn="978-80-8143-127-2",
url="https://www.fit.vut.cz/research/publication/10435/"
}
Centrum excelence IT4Innovations, MŠMT, Operační program Výzkum a vývoj pro inovace, ED1.1.00/02.0070, start: 2011-01-01, end: 2015-12-31, completed
Improving Security of the Internet by Using System for Analyzing of Malicious Code Spreading, TAČR, Program aplikovaného výzkumu a experimentálního vývoje ALFA, TA01010858, start: 2011-01-01, end: 2013-12-31, completed
Security-Oriented Research in Information Technology, MŠMT, Institucionální prostředky SR ČR (např. VZ, VC), MSM0021630528, start: 2007-01-01, end: 2013-12-31, running