Detail výsledku

Multi-level Sequence Mining Based on GSP

ŠEBEK, M.; HLOSTA, M.; KUPČÍK, J.; ZENDULKA, J.; HRUŠKA, T. Multi-level Sequence Mining Based on GSP. Proceedings of the Eleventh International Conference on Informatics INFORMATICS'2011. 1. Košice: Faculty of Electrical Engineering and Informatics, University of Technology Košice, 2011. p. 185-190. ISBN: 978-80-89284-94-8.
Typ
článek ve sborníku konference
Jazyk
anglicky
Autoři
Šebek Michal, Ing., Ph.D., FIT (FIT), UIFS (FIT)
Hlosta Martin, Ing., Ph.D., FIT (FIT), UIFS (FIT)
Kupčík Jan, Ing., FIT (FIT), UIFS (FIT)
Zendulka Jaroslav, doc. Ing., CSc., UIFS (FIT)
Hruška Tomáš, prof. Ing., CSc., CIS ‒ Kancelář ředitele (CIS), UIFS (FIT)
Abstrakt

Mining sequential patterns is an important problem in the field of data mining and many algorithms and optimization techniques have been published to deal with that problem. An GSP algorithm, which is one of them, can be used for mining sequential patterns with some additional constraints, like gaps between items.

Taxonomies can exist upon the items in sequences. It can be applied to mine sequential patterns with items on several hierarchical levels of the taxonomy. If a more general item appears in a pattern, the pattern has higher or at least the same support as the one containing the corresponding specific item. This allows us to mine more patterns with the same minimal support parameter and to reveal new potentially useful patterns. This paper presents a method for mining multi-level sequential patterns. The method is based on the GSP algorithm and generalization of more specific sequences based on the information theory.

Klíčová slova

Sequence pattern mining, GSP, taxonomy

Rok
2011
Strany
185–190
Sborník
Proceedings of the Eleventh International Conference on Informatics INFORMATICS'2011
Řada
1
Konference
Informatics 2011 - 11th International Scientific Conference on Informatics
ISBN
978-80-89284-94-8
Vydavatel
Faculty of Electrical Engineering and Informatics, University of Technology Košice
Místo
Košice
BibTeX
@inproceedings{BUT76372,
  author="Michal {Šebek} and Martin {Hlosta} and Jan {Kupčík} and Jaroslav {Zendulka} and Tomáš {Hruška}",
  title="Multi-level Sequence Mining Based on GSP",
  booktitle="Proceedings of the Eleventh International Conference on Informatics INFORMATICS'2011",
  year="2011",
  series="1",
  pages="185--190",
  publisher="Faculty of Electrical Engineering and Informatics, University of Technology Košice",
  address="Košice",
  isbn="978-80-89284-94-8"
}
Projekty
Pokročilé rozpoznávání a prezentace multimediálních dat, VUT, Vnitřní projekty VUT, FIT-S-11-2, zahájení: 2011-01-01, ukončení: 2013-12-31, ukončen
Systém pro zvýšení bezpečnosti v prostředí Internetu analýzou šíření škodlivého kódu, TAČR, Program aplikovaného výzkumu a experimentálního vývoje ALFA, TA01010858, zahájení: 2011-01-01, ukončení: 2013-12-31, ukončen
Výzkum informačních technologií z hlediska bezpečnosti, MŠMT, Institucionální prostředky SR ČR (např. VZ, VC), MSM0021630528, zahájení: 2007-01-01, ukončení: 2013-12-31, řešení
Pracoviště
Nahoru