Furthermore this is a goal-driven assessment. It provides an objective ranking for the servers that aim at the prediction of protein fold and probable evolutionary relationship as classified in SCOP. Note that many servers do not necessarily have the same goal but instead aim at the recognition of a large fragment of similar structure in a protein of known structure regardless of its SCOP classification. I think that for such servers this ranking will be largerly irrelevant.
Also available is the same evaluation of LiveBench results.
SCOP assessment of templates for the 56 targets in CAFASP 3.
The sensitivity score is the total number of true positives. The specificity score is the average of the 10 columns. Each column shows the number of true positivies before the 'n'th false positive, where 'n' is the column number. The first column shows the number of targets which template information was available for.
The full data is available in a flat file here. Also alternative tables which classify at the fold or superfamily level, and can include the targets split into domains.
| Level | CASP only | include split domains |
| superfamily | table | table |
| fold | table | table |
| Sub sets | CASP only | include split domains |
| HM | table | table |
| FR | table | table |
A single template is assumed for every model. The template used for each model will be the first one listed in the 'PARENT' field of the submitted model (raw submission). All SCOP domains in the template and the target will be compared. If any pair of domains in the target and the template belong to the same SCOP superfamily, the model is judged as 'true'. When judging at the superfamily level, if any pair of domains in the target and the template belong to the same SCOP fold, but no pair belongs to the same superfamily, the model is judged as ambiguous and is neither 'true' nor 'false'. If no pair of domains in the target and the template belong to the same SCOP fold, the model is judged as 'false'. There are documented exceptions to the classification in SCOP, and these are taken into consideration wherever humanly possible. The major ones are listed in the following three points in the list. If any pair of domains in the target and the template belong to the TIM-barrel fold the model is judged as 'true'. If any pair of domains in the target and the template belong to any of the Rossmann folds (NAD(P), FAD/NAD(P), or Nucleotide binding domains) the model is judged as 'true'. The families in the Membrane all-alpha superfamily are more like superfamilies and are treated accordingly. If the template and the target belong to different families within the Membrane all-alpha superfamily, the model is judged as 'false'.
The classification of the targets into SCOP is provisional and generously provided with the caveat that it may change before the next SCOP release. There is a small chance that an incorrect model will be judged 'true' when built from a template which by chance contains a SCOP domain in the same superfamily as the target, although the model does not use that part of the template. As this is unlikely to happen often by chance, it is not expected to affect the results. Models which are built from more than one template will only have one template (the first listed) considered. Since a binary (true/false) decision is being made, by judging only the first listed template, the result is not affected unless there are other templates listed which would be 'true' when the first is 'false'. Methods using models which make modifications to the backbone of the template will not be as relevant for this comparison. It is not possible in all cases to extract a template at all. This is reflected in the numbers in the first column of the tables. Templates using theoretical models are ignored. Templates which have no record in PDB are ignored. Templates which are old or have been replaced are kept and the classification obtained from old SCOP releases.