Result Details

Decoding the Hidden Secrets of SNP Data: Revealing Ancestral Origins, Genomic Predictions, and Polygenic Risk Score

SCHWARZEROVÁ, J.; HURTA, M.; WECKWERTH, W.; WALTHER, D. Decoding the Hidden Secrets of SNP Data: Revealing Ancestral Origins, Genomic Predictions, and Polygenic Risk Score. Germany: 2023. 1 p.
Type
abstract
Language
English
Authors
Schwarzerová Jana, Ing. et Ing., MSc, UBMI (FEEC)
Hurta Martin, Ing., DCSY (FIT)
Weckwerth Wolfram, Prof., Dr. rer. nat.
Walther Dirk
Abstract

The study of Single Nucleotide Polymorphism (SNPs) data provides a gateway to uncovering profound insights into ancestral origins, enomic predictions, and polygenic genetic scores.
We embarked on an exploration of SNP data using two distinct datasets available for the plant Arabidopsis thaliana. Firstly, we employed a constructed artificial dataset to simulate and analyse mutation increments over multiple generations. This artificial dataset allowed us to investigate the dynamics of genetic variation and its implications for ancestral lineage tracing by Principal Component Analysis (PCA). Secondly, we incorporated real data comprising 27,081 non-redundant SNPs. Leveraging this extensive dataset, our investigations aimed to explore the intricate genetic landscape of Arabidopsis thaliana and reveal crucial details about population structure, genetic diversity, and the potential functional implications of identified SNPs in relation to various metabolites.
We developed a Polygenic Risk Score (PRS) tool implemented in Python - PGine: Py/Bioconda software package for the calculation of polygenic risk scores in plants that may be useful for new breeding strategies. Subsequently, we integrated diverse computational approaches to achieve Genomic Prediction models.
Our study reveals the hidden information embedded in SNP data, thereby improving our understanding of general genetic variation and its implications for different fields, including predictive modelling, population genetics and evolutionary biology.

Keywords

Arabidopsis thaliana, Single Nucleotide Polymorphisms (SNPs), Machine learning, Genetic Variation

Published
2023
Pages
1
Place
Germany
BibTeX
@misc{BUT184936,
  author="Jana {Schwarzerová} and Martin {Hurta} and Wolfram {Weckwerth} and Dirk {Walther}",
  title="Decoding the Hidden Secrets of SNP Data: Revealing Ancestral Origins, Genomic Predictions, and Polygenic Risk Score",
  year="2023",
  pages="1",
  address="Germany",
  note="Abstract"
}
Projects
PGine: Py/Bioconda software package for calculation of polygenic risk score in plants, BUT, Vnitřní projekty VUT, FEKT/FIT-J-23-8274, start: 2023-03-01, end: 2024-02-28, running
Departments
Back to top