Table I: Results Pos. where was the
structure found in the list of hits; NA number of aligned residues
between matched structures; r.m.s.d. root mean square deviation. Pdb ID if
present reflect the structure that was found instead of structure mentioned in
second column (might be connected with information about sequence identity with
the structure from second column). Pride is not covered since it didnt
succeed in any single case.
Query |
Non-trivialsimilarity |
Dali
Pos., NA, r.m.s.d. () |
Vast
Pos., NA, r.m.s.d. () |
Matras
|
Dejavu
Pos., NA, r.m.s.d.
()
|
SSMPos.,
Match
|
TopPos., NA |
TopsPos. |
Lock
Pos., NA, r.m.s.d. () |
TopscanPos.,
|
CE
Pos., NA, r.m.s.d. () |
|
1fxi:a |
1ubq |
11., 64, 3,2 |
6., 51, 2,3 |
1ndd(57%) |
16., 52, 1,794 |
45., 83 % |
No |
Yes, score 2 |
3., 50, 1,774 |
no |
no |
|
1ten: |
3hhr:b |
10., 86, 2,1 |
101., 60, 1,6 |
1axiB |
no |
No |
no |
Yes |
no |
19. |
51., 87, 1,9 |
|
3hla:b |
2rhe |
13, 86, 2,1 |
no |
No |
no |
No |
8fab, 31., 74 |
no |
7., 53, 1,769 |
no |
Far from the beg, 84, 3,4 |
|
2aza:a |
1paz |
16., 124, 2,6 |
22., 69, 2,2 |
1bqk(66%) |
21.,74, 1,750 |
No |
no |
no |
6.,67, 1,769 |
Yes , |
Far, 85, 2,9 |
|
1cew:I |
1mol:a |
4., 81, 2,3 |
12., 66, 2,0 |
3 |
1., 74, 1,655 |
No |
Yes |
3., |
3., 57, 1,752 |
no |
7., 74, 2,1 |
|
1cid |
2rhe |
12., 93, 2,8 |
No with subset |
No |
No |
No |
no |
no |
17., 49, 1,864 |
Yes? |
f., 96, 3,4 |
|
1crl |
1ede |
18, 210, 3,5 |
30., 186, 3,4 |
1b6q |
No |
No |
no |
NO??? |
4., 114, 2,082 |
57., score 29,333 ?? |
f., 213, 4.1 |
|
2sim |
1nsb:a |
9., 3,8, 287 |
7., 291, 3,8 |
1nsc |
41., 130, 2,170 |
No |
no |
Yes |
2., 208, 1.996 |
9., score 55,0 |
f., |
|
1bge:b |
2gmf:a |
no |
8., 68, 2,1 |
18 |
No |
No |
99., |
no |
9., 74, 2,00 |
no |
>200, 107, 4,1 |
|
1tie |
4fgf |
10., 109, 2,9 |
24., 76, 1,9 |
1bfg |
26., 93, 1,636 |
No |
11., 76 |
no |
3., 97, 1,577 |
no |
28., 128, 3,2 |
|
1g61 |
1bwd |
5., 182, 3,6 |
5, 161, 3,9 |
5 |
8., 116, 2,137 |
No |
Yes, |
1.,4jdw |
no |
1., 4jdw |
2.,
142, |
Notes to results:
Dali found to 3hla:b only 8fab, which is structural representative for 2rhe a
values in the table are for 8fab.
Values for top to 3hla:b are to 8fab.
Vast values for aza:2 are actually for structure 1pmy and lock values are for
structure 1pzc
Tops found 1ede as a result for 1crl on 15th position between
irrelevant results (score 45)
Results (values) for 2aza with Dali server are for 1bqk structure, which has
65%identity with 1paz and is structure representative in FSSP for 1paz.
Dali results to 1cid are for structure 8fab:A. Dali results to 2sim are for
structure 2qwc, which is only 30 identical to 1nsb.
Dali didnt find the result for 1bge:b, although it found it three years ago.
CE found the result for 1bge:b only in the whole database and not in the
representatives, but the 2gmf is in the set of representatives.
Lock values for 1tie are to structure 2fbf, Vast values are to 1bff and top
values to 1bfg, what is everything similar protein.
Vast did not find result for 1cid in subset of PDB database, but found it in
all PDB database on position 267, 85 aligned residues with rmsd 1,6
Table
II: Processing times - given values refer to period of time among sending the
query and obtaining the results. I have used pre-computed results in the case
of CE server, therefore I dont have any values for CE processing times (more
in paper).
|
|
Dali |
Vast |
Dejavu |
Matras |
SSM |
Top |
Tops |
Lock |
Topscan |
|
1fxi:a |
3 min |
10 min |
73 min |
6 min |
Sec |
65 min |
2 min |
7 min |
sec |
|
1ten: |
9 min |
10 min |
88 min |
18 min |
1 min |
62 min |
2 min |
12 min |
Sec |
|
3hla:b |
8 min |
8 min |
17 min |
34 min |
1 min |
7 min |
6 min |
10 min |
Sec |
|
2aza:a |
ND |
15 min |
28 min |
27 min |
2 min |
10 min |
3min |
8 min |
sec |
|
1cew:I |
5min |
10 min |
94 min |
15 min |
Sec |
8 min |
3 min |
10 min |
Sec |
|
1cid: |
ND |
8 min |
94 min |
17 min |
1 min |
10 min |
5 min |
11 min |
Sec |
|
1g61 |
6 min |
27 min |
38 min |
23 |
5 min |
21 min |
12 min |
24 min |
sec |
|
1crl |
12 min |
85 min |
170 min |
34 min |
4 min |
16 min |
7 min |
26 min |
1 min |
|
2sim |
6 min |
ND |
115 min |
53 min |
2 min |
10 min |
20 min |
5O min |
40 sec |
|
1bge:b |
4 min |
6 min |
28 min |
12 min |
Sec |
28 min |
1 min |
7 min |
sec |
|
1tie |
150 min |
10 min |
48 min |
30 min |
2 min |
15 min |
6 min |
12 min |
sec |
Table III: Summary of results
|
method |
Dali |
Vast |
Dejavu |
Matras |
Pride |
SSM |
Top |
Tops |
Topscan |
Lock |
CE |
|
Correct results |
10/11 |
9/11 |
6/11 |
9/11 |
0/11 |
1/11 |
5/11 |
5/11 |
6/11 |
9/11 |
10/11 |
Ten difficult cases other way
round:
I have tried one of the ten difficult cases (1cew) also other way round: that
means I took the structure, which I was supposed to find in first search and
used it as a query. I looked if I can find the previous query among the results
and I checked also the given values for the quality of match (scores, rmsds)
I found out that the results and even the scores were as expected with for all the servers except of
Vast,Top, Topscan, Dejavu and Pride.
Vast, Top and Topscan found the structure, but with different scores and
Dejavu and Pride failed in this
test.
Vast : 1mol as a query found 73 aa overlap and rmsd 1,8 for structure 1 cew,
while 1cew as a query found overlap of only 66 amino acids and rmsd 2,0 !
Top: 1cew as a query: values 4,74 structural diversity, 63 aligned residues,
18, 7 topological diversity and 5 aligned SSE.
1mol as a query gives 3,54 structural diversity, 64 aligned residues, 14, 5
topological diversity and 4 aligned SSE
Topscan found 1dim (another structure of the same protein as 2sim) with
1nsb:A as the query structure with score 64,15, while 2sim as a query found
1nsb with score 55,0
Table IV: Structures summary this list shows only structures that were used
as a query
|
|
Resolution () |
lenght |
SSE content (%) |
|
1fxi:a |
2,2 |
96 |
38 |
|
1ten: |
1,8 |
90 |
53 |
|
3hla:b |
2,6 |
99 |
50 |
|
2aza:a |
1,8 |
129 |
49 |
|
1cew:I |
2,0 |
108 |
69 |
|
1g61:A |
1,3 |
225 |
58 |
|
1cid: |
2,8 |
177 |
58 |
|
1crl |
2,06 |
534 |
50 |
|
2sim |
1,6 |
381 |
50 |
|
1bge:b |
2,2 |
159 |
72 |
|
1mol:A |
1,7 |
94 |
71 |
|
1tie |
2,5 |
166 |
43 |
I mentioned that 1cid has two identical domains in one chain, which fact can
biased the results in some of the programs. I used in my test whole chain
according to recipe (Fischer et al.,1996).