Detail výsledku

Combined Density- and Grid- Based Method for Clustering of Protein Substructures

BURGETOVÁ, I. Combined Density- and Grid- Based Method for Clustering of Protein Substructures. ZNALOSTI 2009, Proceedings of the 8th annual conference. Brno: Vydavateľstvo STU, 2009. p. 201-212. ISBN: 978-80-227-3015-0.
Typ
článek ve sborníku konference
Jazyk
anglicky
Autoři
Abstrakt

Data mining techniques may reveal interesting knowledge in various datasets. The biological databases are enormously large and therefore, data mining techniques could be extremely helpful to extract the knowledge from them. In our study, we focused on data mining in PDB - Protein Data Bank. We used cluster analysis to identify the sequences that occur in limited number of structural conformations (sequence-structure fragments). This knowledge about protein fragments can be used in protein structure predictions. In this paper, we present a combined density- and grid-based method that we developed for clustering of protein structures. We also compare this method with a simple density-based clustering method that we used in the first part of our study to prove the existence of protein sequences that occur in more than one structural conformation but the number of its structural conformations is limited.

Klíčová slova

Cluster analysis, data mining, PDB, sequence-structure fragments, protein structure prediction

Rok
2009
Strany
201–212
Sborník
ZNALOSTI 2009, Proceedings of the 8th annual conference
Konference
Znalosti 2009
ISBN
978-80-227-3015-0
Vydavatel
Vydavateľstvo STU
Místo
Brno
BibTeX
@inproceedings{BUT30201,
  author="Ivana {Burgetová}",
  title="Combined Density- and Grid- Based Method for Clustering of Protein Substructures",
  booktitle="ZNALOSTI 2009, Proceedings of the 8th annual conference",
  year="2009",
  pages="201--212",
  publisher="Vydavateľstvo STU",
  address="Brno",
  isbn="978-80-227-3015-0"
}
Projekty
Výzkum informačních technologií z hlediska bezpečnosti, MŠMT, Institucionální prostředky SR ČR (např. VZ, VC), MSM0021630528, zahájení: 2007-01-01, ukončení: 2013-12-31, řešení
Výzkumné skupiny
Pracoviště
Nahoru