Find in Library
Search millions of books, articles, and more
Indexed Open Access Databases
Reducing Ensembles of Protein Tertiary Structures Generated De Novo via Clustering
oleh: Ahmed Bin Zaman, Parastoo Kamranfar, Carlotta Domeniconi, Amarda Shehu
| Format: | Article |
|---|---|
| Diterbitkan: | MDPI AG 2020-05-01 |
Deskripsi
Controlling the quality of tertiary structures computed for a protein molecule remains a central challenge in de-novo protein structure prediction. The rule of thumb is to generate as many structures as can be afforded, effectively acknowledging that having more structures increases the likelihood that some will reside near the sought biologically-active structure. A major drawback with this approach is that computing a large number of structures imposes time and space costs. In this paper, we propose a novel clustering-based approach which we demonstrate to significantly reduce an ensemble of generated structures without sacrificing quality. Evaluations are related on both benchmark and CASP target proteins. Structure ensembles subjected to the proposed approach and the source code of the proposed approach are publicly-available at the links provided in Section 1.