Publication Details

SoluProt: Prediction of Protein Solubility

HON Jiří, MARUŠIAK Martin, MARTÍNEK Tomáš, ZENDULKA Jaroslav, BEDNÁŘ David and DAMBORSKÝ Jiří. SoluProt: Prediction of Protein Solubility. In: DAZ & WIKT 2018 Proceedings. Brno: Brno University of Technology, 2018, pp. 261-265. ISBN 978-80-214-5679-2.
Czech title
SoluProt: predikce rozpustnosti proteinů
Type
conference paper
Language
english
Authors
Hon Jiří, Ing., Ph.D. (DIFS FIT BUT)
Marušiak Martin, Ing. (FIT BUT)
Martínek Tomáš, doc. Ing., Ph.D. (DCSY FIT BUT)
Zendulka Jaroslav, doc. Ing., CSc. (DIFS FIT BUT)
Bednář David, Mgr. (LL)
Damborský Jiří, prof. Mgr., Dr. (LL)
Keywords

protein, solubility, prediction, machine-learning

Abstract

Protein solubility poses a major bottleneck in production of many therapeutic and industrially attractive proteins. Experimental solubilization attempts are plagued by relatively low success rates and often lead to the loss of biological activity. Therefore, any advance in computational prediction of protein solubility may reduce the cost of experimental studies significantly. Here, we propose a novel software tool SoluProt for prediction of solubility from protein sequence based on machine learning and TargetTrack database. SoluProt achieved the best accuracy 58.2% and AUC 0.61 of all available tools at an independent balanced test set derived from NESG database. While the absolute prediction performance is rather low, SoluProt can still help to reduce costs of experimental studies significantly by efficient prioritization of protein sequences. The main SoluProt contribution lies in improved preprocessing of noisy training data and sensible selection of sequence features included in the prediction model.

Published
2018
Pages
261-265
Proceedings
DAZ & WIKT 2018 Proceedings
Conference
Data a znalosti 2018, Brno, CZ
ISBN
978-80-214-5679-2
Publisher
Brno University of Technology
Place
Brno, CZ
BibTeX
@INPROCEEDINGS{FITPUB11808,
   author = "Ji\v{r}\'{i} Hon and Martin Maru\v{s}iak and Tom\'{a}\v{s} Mart\'{i}nek and Jaroslav Zendulka and David Bedn\'{a}\v{r} and Ji\v{r}\'{i} Damborsk\'{y}",
   title = "SoluProt: Prediction of Protein Solubility",
   pages = "261--265",
   booktitle = "DAZ \& WIKT 2018 Proceedings",
   year = 2018,
   location = "Brno, CZ",
   publisher = "Brno University of Technology",
   ISBN = "978-80-214-5679-2",
   language = "english",
   url = "https://www.fit.vut.cz/research/publication/11808"
}
Back to top