Result Details

SoluProt: prediction of soluble protein expression in Escherichia coli

Created: 2021
Type
software
Language
English
Authors
Hon Jiří, Ing., Ph.D.
Marušiak Martin, Ing.
Martínek Tomáš, doc. Ing., Ph.D., DCSY (FIT)
Kunka Antonín, Mgr., Ph.D.
Zendulka Jaroslav, doc. Ing., CSc., DIFS (FIT)
Bednář David, FIT (FIT)
Damborský Jiří, prof. Mgr., Dr., FIT (FIT), UMEL (FEEC)
Description

A new tool for sequence-based prediction of soluble protein expression in Escherichia coli, SoluProt, was created using the gradient boosting machine technique with the TargetTrack database as a training set. When evaluated against a balanced independent test set derived from the NESG database, SoluProts accuracy of 58.4% and AUC of 0.60 exceeded those of a suite of alternative solubility prediction tools. There is also evidence that it could significantly increase the success rate of experimental protein studies. SoluProt is freely available as a standalone program and a user-friendly webserver at https://loschmidt.chemi.muni.cz/soluprot/.

Keywords

protein solubility, machine-learning

URL
License
Use of the result by another entity is possible without acquiring a license in some cases
License Fee
The licensor does not require a license fee for the result
Projects
Metody AI pro zabezpečení kybernetického prostoru a řídicí systémy, BUT, Vnitřní projekty VUT, FIT-S-20-6293, start: 2020-03-01, end: 2023-02-28, completed
Departments
Back to top