Best Program/Tool/Server To Predict Fold Types/Structural Family For Unannotated Pdb Structures.
3
2
Entering edit mode
13.0 years ago
Pals ★ 1.3k

I am working with large number of proteins. About 80% of those structures have their folds/families/superfamilies classified by CATH and SCOP (I am using CATH). Therefore, I want to predict folds for the remaining 20% of the structures also. I would like to know if there are any prediction tools that can classify those proteins into one of the CATH olds/architecture/superfamily.

Thanks!

Kisun

pdb • 3.9k views
ADD COMMENT
1
Entering edit mode

For the 20% that do not have families assigned, do you have structures or sequences only?

ADD REPLY
0
Entering edit mode

Structure always comes with a sequence :)

ADD REPLY
3
Entering edit mode
13.0 years ago

Have you looked at HMM-based classification tools?

edit: You can also do a BLAST search with sequences on the CATH database here that gives you the most similar fold.

ADD COMMENT
0
Entering edit mode

Yes, I tried but these programs just assign my protein to one of the available folds/superfamilies from SCOP/CATH/Pfam. In fact I want the prediction programs.

ADD REPLY
2
Entering edit mode
13.0 years ago
Jan Kosinski ★ 1.6k

If you could live with SCOP, you could use fastSCOP: http://fastscop.life.nctu.edu.tw/intro.php

From the website: "fastSCOP is a web server that rapidly identifies the structural domains and determines the evolutionary superfamilies of a query protein structure. This server uses 3D-BLAST to scan quickly a large structural classification database (SCOP1.71 with <95% identity with each other) and the top ten hit domains, which have different superfamily classifications, are obtained from the hit lists. MAMMOTH, a detailed structural alignment tool, is adopted to align these top ten structures to refine domain boundaries and to identify evolutionary superfamilies."

ADD COMMENT
2
Entering edit mode
13.0 years ago

"I want to predict folds for the remaining 20% of the structures also."

IMHO, you cannot assign an existing SCOP class to these entries, because these 20% are novel .

It is true that approximately 20% of proteins in PDB is not associated with any known SCOP classification. SCOP curators are well aware of these group of structures. This 20% structures are not assigned a classification due to the sequence-structure diversity of those structures with rest of the members in PDB. For example the structures my contain additional embellishments in structure that will disqualify them from being a member of a particular fold/class. Some of these may get merged into existing classes in the future or depending upon the number of new members with similar features, it may add to a novel fold. As large part of SCOP is manually curate, you may need to wait until SCOP curators assign a fold details to these entries.

ADD COMMENT
0
Entering edit mode

Thanks a lot for the insight.

ADD REPLY

Login before adding your answer.

Traffic: 2025 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6