a) In each cell, the predicted initiation codon is shown at the upper left, the termination codon at the lower left, the number of amino acids at upper right and the number of nucleotides at the lower right. Start codons other than ATG & TAA and stop codons other than TAG are shown in bold and underlined. Dashes (---) indicate the gene has not been fully sequenced (S. malayensis only). Trematodes - Sja: Schistosoma japonicum (Schistosomatidae, GenBank accession AF215860); Sme: S. mekongi (Schistosomatidae, AF217449); Sml: S. malayensis (Schistosomatidae, AF295106); Smn: S. mansoni (Schistosomatidae, AF216698); Fhe: Fasciola hepatica (Fasciolidae, AF216697); Pwe: Paragonimus westermani (Paragonimidae, AF219379). Cestodes - Tcr: Taenia crassiceps (Taeniidae, AF216699); Tso: Taenia solium (Taeniidae, AB086256); Emu: Echinococcus multilocularis (Taeniidae, AB018440); Egr: E. granulosus (Taeniidae, G1: genotype 1 (sheep-dog strain), AF297617 and G4: genotype 4 (horse-dog strain), AF346403); Hdi: Hymenolepis diminuta (Hymenolepididae, AF314223).
Species |
Base-compositiona) |
Total bp usagec) | Total codon Nod) |
Codon ending witha) |
|||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
T | C | A | G | T+A | T | C | A | G | |||||||
% | % | % | % | % | All | FFR | All | FFR | All | FFR | All | FFR | |||
Trematodes | |||||||||||||||
S. mansoni | 45.6 | 8.2 | 23.3 | 23.0 | 68.9 | 10344 | 3448 | 47.6 | 51.8 | 4.2 | 4.3 | 27.1 | 24.1 | 21.1 | 19.7 |
S. japonicum | 48.3 | 8.0 | 23.0 | 20.7 | 71.3 | 10218 | 3406 | 54.8 | 64.3 | 3.4 | 3.7 | 23.9 | 19.6 | 17.9 | 12.5 |
S. mekongi | 48.4 | 6.7 | 24.3 | 20.6 | 72.7 | 10296 | 3432 | 56.0 | 65.9 | 0.9 | 1.2 | 25.5 | 21.0 | 17.5 | 11.9 |
S. malayensisb) | 48.8 | 6.6 | 23.6 | 20.9 | 72.4 | 8145b) | 2714 | 56.1 | 67.9 | 1.0 | 1.0 | 25.2 | 20.7 | 17.6 | 10.4 |
F. hepatica | 49.4 | 9.6 | 14.2 | 26.8 | 63.6 | 10104 | 3368 | 57.2 | 67.8 | 4.6 | 5.5 | 9.9 | 6.7 | 28.3 | 20.0 |
P. westermani | 38.3 | 17.9 | 13.2 | 30.6 | 51.5 | 10086 | 3362 | 34.1 | 35.4 | 21.7 | 19.4 | 7.1 | 6.7 | 37.1 | 38.5 |
Cestodes | |||||||||||||||
T. solium | 48.6 | 7.9 | 23.5 | 20.0 | 72.1 | 10092 | 3364 | 53.4 | 56.7 | 3.7 | 3.8 | 26.3 | 25.3 | 16.7 | 14.2 |
T. crassiceps | 50.6 | 7.2 | 23.4 | 18.8 | 74.0 | 10095 | 3365 | 56.6 | 63.6 | 2.2 | 1.3 | 25.1 | 20.7 | 16.1 | 14.5 |
E. multilocularis | 50.6 | 7.1 | 18.3 | 24.0 | 68.9 | 10098 | 3366 | 57.8 | 61.8 | 2.3 | 2.1 | 16.0 | 15.4 | 23.8 | 20.8 |
E. granulosus (G1) | 49.9 | 7.6 | 16.8 | 25.7 | 66.7 | 10092 | 3364 | 56.3 | 58.7 | 3.4 | 4.0 | 12.6 | 11.1 | 27.7 | 26.2 |
E. granulosus (G4) | 50.1 | 7.4 | 17.7 | 24.8 | 67.8 | 10065 | 3355 | 56.8 | 60.4 | 2.9 | 2.8 | 14.1 | 12.8 | 26.2 | 24.0 |
H. diminuta | 47.4 | 9.5 | 23.7 | 19.4 | 71.1 | 10074 | 3358 | 50.7 | 52.1 | 6.0 | 5.1 | 26.7 | 25.8 | 16.6 | 16.9 |
A | C | D | E | F | G | H | I | K | L | M | N | P | Q | R | S | T | V | W | Y | Totalb) | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Sja | 85 | 95 | 87 | 64 | 367 | 234 | 48 | 330 | 56 | 493 | 111 | 119 | 81 | 27 | 58 | 348 | 92 | 351 | 104 | 232 | 3382 |
Sme | 78 | 102 | 82 | 68 | 342 | 234 | 55 | 352 | 66 | 496 | 117 | 123 | 82 | 24 | 55 | 354 | 87 | 348 | 100 | 243 | 3408 |
Smn | 84 | 102 | 66 | 74 | 327 | 249 | 53 | 304 | 60 | 536 | 115 | 97 | 75 | 31 | 64 | 379 | 67 | 418 | 108 | 215 | 3424 |
Fhe | 116 | 121 | 70 | 75 | 372 | 300 | 51 | 166 | 44 | 562 | 88 | 70 | 95 | 26 | 64 | 333 | 80 | 434 | 118 | 159 | 3344 |
Pwe | 151 | 110 | 66 | 81 | 339 | 320 | 60 | 124 | 50 | 577 | 77 | 70 | 91 | 28 | 71 | 360 | 84 | 417 | 111 | 149 | 3336 |
EgrG1 | 80 | 149 | 80 | 65 | 398 | 243 | 50 | 199 | 42 | 507 | 82 | 97 | 69 | 25 | 50 | 338 | 89 | 464 | 97 | 216 | 3340 |
EgrG4 | 79 | 140 | 82 | 66 | 407 | 228 | 52 | 207 | 44 | 499 | 87 | 100 | 71 | 25 | 53 | 333 | 91 | 455 | 97 | 215 | 3331 |
Emu | 82 | 148 | 76 | 63 | 418 | 236 | 49 | 221 | 43 | 499 | 82 | 106 | 71 | 24 | 51 | 344 | 88 | 436 | 93 | 212 | 3342 |
Tso | 81 | 138 | 84 | 68 | 413 | 193 | 56 | 314 | 48 | 494 | 86 | 126 | 71 | 23 | 49 | 371 | 95 | 343 | 90 | 197 | 3340 |
Tcr | 63 | 135 | 84 | 57 | 433 | 186 | 52 | 302 | 49 | 514 | 93 | 153 | 72 | 21 | 48 | 358 | 93 | 325 | 86 | 217 | 3341 |
Hdi | 100 | 131 | 67 | 73 | 417 | 190 | 53 | 297 | 47 | 507 | 83 | 136 | 80 | 22 | 52 | 365 | 110 | 298 | 88 | 217 | 3333 |
a) Start codons omitted. Schistosoma malayensis was omitted because a full sequence was not available. Names of species as for Table 1.