Result Details

Generator of Synthetic Datasets for Hierarchical Sequential Pattern Mining Evaluation

ŠEBEK, M.; ZENDULKA, J. Generator of Synthetic Datasets for Hierarchical Sequential Pattern Mining Evaluation. Proceedings of the Twelfth International Conference on Informatics 2013. Košice: The University of Technology Košice, 2013. p. 289-292. ISBN: 978-80-8143-127-2.
Type
conference paper
Language
English
Authors
Šebek Michal, Ing., Ph.D., DIFS (FIT)
Zendulka Jaroslav, doc. Ing., CSc., DIFS (FIT)
Abstract

Evaluation is an important part of algorithm design. Algorithms are typically evaluated on real-world and synthetic datasets. Real-world datasets are appropriate for evaluation of algorithm properties in practice but it is difficult to change the dataset to have some particular statistics, e.g. number of input items. In contrast, generated synthetic dataset simply allows changing any of statistic property of the dataset with keeping all other statistic properties. In the paper, we present a procedure for generation of sequence databases with taxonomies for an evaluation of hierarchical sequential pattern mining algorithms.

Keywords

Sequence pattern mining, synthetic dataset generators, taxonomy

Annotation

Evaluation is an important part of algorithm design. Algorithms are typically evaluated on real-world and synthetic datasets. Real-world datasets are appropriate for evaluation of algorithm properties in practice but it is difficult to change the dataset to have some particular statistics, e.g. number of input items. In contrast, generated synthetic dataset simply allows changing any of statistic property of the dataset with keeping all other statistic properties. In the paper, we present a procedure for generation of sequence databases with taxonomies for an evaluation of hierarchical sequential pattern mining algorithms.

Published
2013
Pages
289–292
Proceedings
Proceedings of the Twelfth International Conference on Informatics 2013
Conference
Informatics 2013 - 12th International Scientific Conference on Informatics
ISBN
978-80-8143-127-2
Publisher
The University of Technology Košice
Place
Košice
BibTeX
@inproceedings{BUT103555,
  author="Michal {Šebek} and Jaroslav {Zendulka}",
  title="Generator of Synthetic Datasets for Hierarchical Sequential Pattern Mining Evaluation",
  booktitle="Proceedings of the Twelfth International Conference on Informatics 2013",
  year="2013",
  pages="289--292",
  publisher="The University of Technology Košice",
  address="Košice",
  isbn="978-80-8143-127-2",
  url="https://www.fit.vut.cz/research/publication/10435/"
}
Files
Projects
Advanced recognition and presentation of multimedia data, BUT, Vnitřní projekty VUT, FIT-S-11-2, start: 2011-01-01, end: 2013-12-31, completed
Centrum excelence IT4Innovations, MŠMT, Operační program Výzkum a vývoj pro inovace, ED1.1.00/02.0070, start: 2011-01-01, end: 2015-12-31, completed
Improving Security of the Internet by Using System for Analyzing of Malicious Code Spreading, TAČR, Program aplikovaného výzkumu a experimentálního vývoje ALFA, TA01010858, start: 2011-01-01, end: 2013-12-31, completed
Security-Oriented Research in Information Technology, MŠMT, Institucionální prostředky SR ČR (např. VZ, VC), MSM0021630528, start: 2007-01-01, end: 2013-12-31, running
Departments
Back to top