Professional Documents
Culture Documents
1
There are two general categories of methods for calculating
phylogenetic trees: distance-based and character-based. The distance-
based methods compute a matrix of pair wise distances between
sequences in an alignment, and then construct a tree based entirely on
the distance computations. Neighbor-Joining (9), WEIGHBOR (10),
BIONJ (11), FASTME (12) and a latest approach considering maximum-
likelihood estimated triplets of sequences (13) belong to this category.
The disadvantages of distance-based methods include the inevitable
loss of evolutionary information when a sequence alignment is
converted to pair wise alignment and bad performance on large
datasets.
2
Phylogenetic Tree:
Branch: defines the relationship between the taxa in terms of descent & ancestry
Node: a node represents a taxonomic unit. This can be a taxon (an existing
genus) or an unknown ancestor of 2 or more species.
Branch length: represents the number of changes that have occurred in that
branch.
3
Figure 1. Tree Structure & topology.
[http://users.ugent.be/~avierstr/principles/phylogeny.html
4
likelihood methods. The resulting tree is called a cladogram. Cladistic
methods use each alignment position as evolutionary information to
build a tree.
Sequences
Sequence A ACGCGTTGGGCGATGGCAAC
Sequence B ACGCGTTGGGCGACGGTAAT
Sequence C ACGCATTGAA TGATGATAAT
Sequence D ACACATTGAG TGATAATAAT
A B C D
A __ 3 7 6
5
B __ __ 6 7
C __ __ __ 3
D __ __ __ __
Clustering methods
Neighbor Joining: this method tries to correct the UPGMA method for
its assumption that the rate of evolution is the same in all taxa.
For each position in the alignment, all possible trees are evaluated and
are given a score based on the number of evolutionary changes
needed to produce the observed sequence changes. The most
parsimonious tree is the one with the fewest evolutionary changes for
all sequences to derive from a common ancestor. This is a more time-
consuming method than the distance methods.
6
evolutionary changes). The likelihood's for each aligned position are
then multiplied to provide likelihood for each tree. The tree with the
maximum likelihood is the most probable tree. This is the slowest
method of all but seems to give the best result and the most
information about the tree.
Ubiquitin
7
Random coil Beta sheet Alpha Helix
http://en.wikipedia.org/wiki/Ubiquitin
8
The ubiquitination system functions in a wide variety of cellular
processes, such as, Antigen processing, Apoptosis, Biogenesis of
organelles, Cell cycle and division, DNA transcription and repair,
Differentiation and development, and Immune response and
inflammation (23)
Reaction
9
Mn-SOD –Mitochondria, and many bacteria contain Mn-SOD. The
ligands of manganese ions are 3 histidine side chains, an aspartate
side chain and a water molecule or hydroxy ligand depending on the
Mn oxidation state (respectively II and III). (26)
No Alpha Helix
http://en.wikipedia.org/wiki/Superoxide_dismutase
Superoxide is one of the main reactive oxygen species in the cell and
serves a key antioxidant role. Defects in SODs results in the severe
pathologies. SOD knockout mice lacking SOD2 die several days after
birth, amidst massive oxidative stress. Mice lacking SOD1 develop
hepatocellular carcinoma, an acceleration of age-related muscle mass
loss, an earlier incidence of cataracts and a reduced lifespan. Mice
lacking SOD3 do not show any obvious defects and exhibit a normal
lifespan, but are more sensitive to hyperoxic injury. Knockout mice of
10
any SOD enzyme are more sensitive to the lethal effects of superoxide
generating drugs, such as paraquat and diquat. (28)
Calmodulin
Calmodulin (CaM = CALcium MODULated proteIN) binds calcium,
regulates a number of protein targets, thereby affecting a variety of
cellular functions
No Beta sheet
http://en.wikipedia.org/wiki/Calmodulin
Structure
Function
11
long-term memory, nerve growth and the immune response. CaM is
expressed in many cell types and can have different subcellular
locations, including the cytoplasm, within organelles, or associated
with the plasma or organelle membranes. Many of the proteins that
CaM binds are unable to bind calcium themselves, and as such use
CaM as a calcium sensor and signal transducer. CaM can also make
use of the calcium stores in the endoplasmic reticulum, and the
sarcoplasmic reticulum. CaM undergoes a conformational change upon
binding to calcium, which enables it to bind to specific proteins for a
specific response. CaM can bind up to four calcium ions, and can
undergo post-translational modifications, such as phosphorylation,
acetylation, methylation and proteolytic cleavage, each of which can
potentially modulate its actions. Calmodulin can also bind to edema
factor toxin from the anthrax bacteria. (30)
Mechanism
Calcium is bound via the use of the EF hand motif, which supplies an
electronegative environment for ion coordination. After calcium
binding, hydrophobic methyl groups from methionine residues become
exposed on the protein via conformational change. This presents
hydrophobic surfaces, which can in turn bind to Basic Amphiphilic
Helices (BAA helices) on the target protein. These helices contain
complementary hydrophobic regions. The flexibility of Calmodulin's
hinged region allows the molecule to "wrap around" its target. This
property allows it to tightly bind to a wide range of different target
proteins.(31)
Actin
12
Thus, actin participates in muscle contraction, cell motility, cell division
and cytokinesis, vesicle and organelle movement, cell signaling, and
the establishment and maintenance of cell shape and cell junctions. In
most of these processes actin interacts with cellular membranes. In
vertebrates, three main groups of actin isoforms, alpha, beta, and
gamma have been identified. The alpha actins are found in muscle
tissues and are a major constituent of the contractile apparatus. The
beta and gamma actins co-exist in most cell types as components of
the cytoskeleton, and as mediators of internal cell motility. (32)
Actin
http://en.wikipedia.org/wiki/Actin
Actin exhibits four main functions. It forms the most dynamic one of
the three subclasses of the cytoskeleton, which gives mechanical
support to cells, and hardwires the cytoplasm with the surroundings to
support signal transduction. It allows cell motility (see Actoclampin
molecular motors), phagocytosis of bacteria by macrophages. In
muscle cells, Actin forms the scaffold on which myosin proteins
generate force to support muscle contraction.
13
Myosin V walks towards the barbed end of actin filaments, while
myosin VI walks toward the pointed end. Most actin filaments are
arranged with the barbed end toward the cellular membrane and the
pointed end toward the cellular interior. This arrangement allows
myosin V to be an effective motor for export of cargos and myosin VI
to be an effective motor for import. (33)
Microfilament
14
Molecular structure of a tubulin dimer. The α-tubulin subunit is on top,
indicating a microtubule polarity with the (-) end towards the top of the
page. The two GTP subunits are drawn as space filling models, and the
paclitaxel molecule is attached to the β-tubulin subunit and drawn as a
ball and stick model.
15
while dimers bound to GDP tend to fall apart; thus, this GTP cycle is
essential for the dynamic instability of the microtubule.
16
The amino acid sequences were submitted to J-PRED ( J-PRED is a
secondary structure prediction program that folds given polypeptide in
their respective secondary structure) (44) to fold into different
secondary structures. From each folded sequence for given specie,
chains for α-helix, β-sheet and the random coil (RC) were dissected in
silico and regions of each conformation were concatenated in the same
order as they appear within the polypeptide. Concatenation is to link
together consecutive series of characters, the direction of
concatenation is from amino terminal to carboxy terminal. The
resultant concatenates of α helix, β sheet and RC were subjected to
Multiple sequence alignment by T-coffee to build NJ trees in Phylip and
visualized in MEGA.
Seq- MQIFVKTLTGKTITLEVEPSDTIENVKAKIQDKEGIPPDQQRLIFAGKQLEDG
Subject to jpred
Seq MQIFVKTLTGKTITLEVEPSDTIENVKAKIQDKEGIPPDQQRLIFAGKQLEDG
Sec structure -EEEEEE----EEEEEE----HHHHHHHHHHHH--------EEEE--------
E-Beta sheet, H- Alpha Helix, --- Random coil
17
Concatenation of Beta sheet
MQIFVKTLTGKTITLEVEPSDTIENVKAKIQDKEGIPPDQQRLIFAGKQLEDG
-EEEEEE----EEEEEE----HHHHHHHHHHHH--------EEEE--------
19
Submit the sequences to Jpred (Secondary Structure
Prediction server).
21
and from these predicts secondary structure using a neural network
called Jnet. The server accepts two inputs types, a family of aligned
protein sequences or a single protein sequence. If a single protein
sequence is submitted a automatic process creates a multiple
sequence alignment, prior to prediction( 47).
MEGA
www.megasoftware.net/
Molecular evolutionary genetics analysis (MEGA) software with its focus
on facilitating the exploration and analysis of the DNA and protein
sequence variation from an evolutionary perspective. Currently in its
third major release MEGA4.0 contains facilities for automatic and
manual sequence alignment, web based mining of databases,
inference of the phylogenetic trees, estimation of evolutionary
distances and testing evolutionary hypothesis.
It is designed for comparative analysis of homologous gene sequences
either from multigene families or from different species with a special
emphasis on inferring evolutionary relationships and patterns of DNA
and protein evolution. MEGA provides many convenient facilities for
the assembly of sequence data sets from files or web-based
repositories, and it includes tools for visual presentation of the results
obtained in the form of interactive phylogenetic trees and evolutionary
distance matrices.(48)
PHYLIP
http://evolution.genetics.washington.edu/phylip.html
22
the Internet. The programs are controlled through a menu, which asks
the users which options they want to set, and allows them to start the
computation. Most of the programs look for the data in a file called
"infile" -- if they do not find this file they then ask the user to type in
the file name of the data file.Output is written onto special files with
names like "outfile" and "outtree". Trees written onto "outtree" are in
the Newick format. There is no mouse-windows interface for PHYLIP
(49)
Tree view
http://taxonomy.zoology.gla.ac.uk/rod/treeview.html
TreeView is a simple program for displaying phylogenies. It runs on
both Apple Macintosh and Windows PCs, using almost identical
interfaces. It can read many different tree file formats (including
NEXUS, PHYLIP, Hennig86, NONA, MEGA, and ClustalW/X).(50)
Fidelity Algorithm
The Fidelity Algorithm compares all positions on the taxonomic
dendrograms as benchmark to the observed clades or clusters in a
phylogenetic trees based on quantifiable parameters, such as MSA of
nucleic acidss and polypeptides (Milner and Modak, 2009).
23
proteins Alpha tubulin and Cu/znSOD shows that beta sheet as most
conserved secondary structure. In none of the proteins alpha helix has
highest score, its scores is between random coil and beta sheet. It shows
that the extent of conservation of alpha helix is intermediate between
random coil and alpha helix for the proteins under study. It appears that
in most cases the sequences in both α-helix and random coil are the
most conserved while β-sheet sequences are least conserved.
I conclude that the stretches of amino acids in the alpha helix
and random coil have a significant role to play in defining the
phylogenetic status. It also appears that beta sheets play the least
significant role in deciding the evolutionary status of a species.
24
CONCLUSION
• From the study conducted on 6 proteins, I conclude that random
coil is most conserved sequence in a way that it doesn’t take
part in evolution.
• Beta sheet has significant role in evolution
• It also appears that beta sheets play the least significant role in
deciding the evolutionary status of a species.
• Alpha helix is intermediate between random coil and beta sheet
for its contribution to evolution.
BIBLIOGRAPHY
1. Futuyma, Douglas J. (2005). Evolution. Sunderland,
Massachusetts: Sinauer Associates.
2. Worth CL, Gong S, Blundell TL, Structural and functional
constraints in the evolution of protein families
Structural and functional constraints in the evolution of protein families
Worth CL, Gong S, Blundell TL. Biochemistry Department, University
of Cambridge, UK.
Structural and functional restraints in the evolution of protein families
and superfamilies.Gong S, Worth CL, Bickerton GR, Lee S,
25
Tanramluk D, Blundell TL. Department of Biochemistry, University
of Cambridge, Cambridge CB2 1GA, UK.
Hubbard, T. J., and T. L. Blundell. 1987. Comparison of solvent-
inaccessible cores of homologous proteins: definitions useful for
protein modelling. Protein Eng 159-171.
3. Pauling L, Corey RB, Branson HR (1951). "The structure of
proteins; two hydrogen-bonded helical configurations of the
polypeptide chain". Proc Natl Acad Sci USA 37 (4): 205–211.
4. Neurath, H (1940). "Intramolecular folding of polypeptide chains
in relation to protein structure". Journal of Physical Chemistry
44: 296–305.
5. Bull, J.J. and H.A. Wichman. (2001) Applied evolution. Ann.
Rev. Ecol. System. 16:183-217.
6. Neighbour Joining Method (Saitou and Nei, 1987).
7. Bruno WJ, Socci ND, Halpern AL . Weighted neighbor joining: a
likelihood-based approach to distance-based phylogeny
reconstruction. Mol Biol Evol. 2000 Jan;17(1):189-97.
8. Olivier Gascuel* and Mike Steel Neighbor-Joining
RevealedMolecular Biology and Evolution 2006 23(11):1997-
2000; doi:10.1093/molbev/msl072
9. Desper, Richard, Gascuel, Olivier Distance-based Phylogeny
Reconstruction (Optimal Radius), **1999, Atteson; 2005, Elias,
Lagergren (2008)
10. Ranwez, Vincent Gascuel, Olivier Improvement of Distance-
Based Phylogenetic Methods by a Local Maximum Likelihood
Approach Using... (2002)
11. Camin, J. H., and R. R. Sokal. 1965. A method for deducing
branching sequences in phylogeny. Evolution 19: 311-326.
12. Felsenstein, J. 1981a. Evolutionary trees from DNA
sequences: a maximum likelihood approach. J. Molecular
Evolution 17: 368-376.
13. John P. Huelsenbeck and Fredrik Ronquist MRBAYES: Bayesian
inference of phylogenetic trees Bioinformatics Vol. 17 no. 8
2001Pages 754-755.
14. David L. Swofford, Florida State University PAUP*:
Phylogenetic Analysis Using Parsimony (and Other Methods) 4.0
Beta.
15. Gonzalo Giribet TNT: Tree Analysis Using New Technology
Systematic Biology 2005 54(1):176-178.
16. R.R Sokal Phenetic Taxonomy; Theory and Methods,
Annual Review of Ecology and systematics.
17. Harvey P.H., and M.D. Pagel. 1991. The Comparative
Method in Evolutionary Biology. Oxford University Press, Oxford
and New York. 239 pp.
18. http://en.wikipedia.org/wiki/Ubiquitin
19. Case Study: Ubiquitin Eduardo Cruz-Chu and JC Gumbart.
26
20. Essays in Biochemistry, Volume 41 (2005): The Ubiquitin-
Proteasome System (Portland Press).
21. McCord JM, Fridovich I (1988). "Superoxide dismutase: the first
twenty years (1968-1988)". Free Radic. Biol. Med. 5 (5-6): 363–
9.
22. http://en.wikipedia.org/wiki/Superoxide_dismutase
23. Borgstahl GE, Parge HE, Hickey MJ, Beyer WF Jr, Hallewell RA,
Tainer JA (1992). "The structure of human mitochondrial
manganese superoxide dismutase reveals a novel tetrameric
interface of two 4-helix bundles.". Cell 71 (1): 107–18.
24. Barondeau DP, Kassmann CJ, Bruns CK, Tainer JA, Getzoff ED
(2004). "Nickel superoxide dismutase structure and
mechanism". Biochemistry (25): 8038–47.
25. Tainer JA, Getzoff ED, Richardson JS, Richardson DC (1983).
"Structure and mechanism of copper, zinc superoxide
dismutase.". Nature (5940): 284–7.
26. http://en.wikipedia.org/wiki/Calmodulin.
27. Stevens FC (1983). "Calmodulin: an introduction". Can. J.
Biochem. Cell Biol. 61 (8): 906–10.
28. Chin D, Means AR (2000). "Calmodulin: a prototypical calcium
sensor". Trends Cell Biol. 10 (8): 322–8
29. http://en.wikipedia.org/wiki/Actin
30. Kabsch W, Vandekerckhove J Structure and function of
actin. Annu Rev Biophys Biomol Struct. 1992;21:49-76.
31. Tobacman LS, Korn ED. The kinetics of actin nucleation and
polymerization. J Biol Chem. 1983 Mar 10;258(5):3207-14.
32. Higaki T, Sano T, Hasezawa S. Actin microfilament
dynamics and actin side-binding proteins in plants. Curr Opin
Plant Biol. 2007 Dec;10(6):549-56.
33. Nogales E, Downing KH, Amos LA, Löwe J (June 1998). "Tubulin
and FtsZ form a distinct family of GTPases". Nat Struct Biol 5
(6): 451–8.
34. Heald R, Nogales E (January 2002). "Microtubule dynamics". J
Cell Sci 115 (Pt 1)
35. McNally FJ, Vale RD (November 1993). "Identification of katanin,
an ATPase that severs and disassembles stable microtubules".
Cell 75 (3): 419–29.
36. Notredame C, Higgins DG, Heringa J. T-Coffee: A novel method
for fast and accurate multiple sequence alignment. J Mol Biol.
2000 Sep 8;302(1):205-17
37. http://www.bioinformatics.uthscsa.edu/www/phylip/
38. Readseq by D.G. Gilbert, 2.1.26 (18-Oct-2007) Software at
http://iubio.bio.indiana.edu/soft/molbio/readseq/java/
39. Saitou and Nei, 1987 Neighbour Joining Method
27
40. Tamura K, Dudley J, Nei M, Kumar S.MEGA4: Molecular
Evolutionary Genetics Analysis (MEGA) software version 4.0.Mol
Biol Evol. 2007 Aug;24(8):1596-9. Epub 2007 May 7.
41. Cole C, Barber JD & Barton GJ. Nucleic Acids Res. 2008
42. Feng DF, Doolittle RF. Progressive sequence alignment as a
prerequisite to correct phylogenetic trees. J Mol Evol.
1987;25(4):351-60.
43. Christian Cole, Jonathan D. Barber and Geoffrey J. Barton*
The Jpred 3 secondary structure prediction server
44. Cuff JA, Barton GJ. Application of multiple sequence alignment
profiles to improve protein secondary structure prediction.
Proteins. 2000 Aug 15;40(3):502-11.
45. . Kumar S, Nei M, Dudley J, Tamura K, MEGA: a biologist-centric
software for evolutionary analysis of DNA and protein
sequences. Brief Bioinform. 2008 Jul;9(4):299-306. Epub 2008
Apr 16.
46. Joseph Felsenstein (August, 2003). Inferring Phylogenies.
47. Page RD. TreeView: an application to display phylogenetic trees
on personal computers. Comput Appl Biosci. 1996
Aug;12(4):357-8.
48. Milner (2008) Ph.D. Thesis, University of Karnataka,
Dharwad, in preparation.
APPENDIX
Phylogenetic Trees of various proteins are listed in the appendix. These
phylogenetic trees were constructed using NJ method.
Figure 1 Ubiquitin Full seq
28
CULQ U
BO TFB
TRYCR
APLCA
CA NA L
NEO FI
PA RBR
YEA ST
PICG U
SCHPO
NEUCR
CHA GB
KLULA
ARA TH
PEA
HELA N
M AIZE
ORY SJ
SO LLC
SO Y BN
W HEA T
CA EEL
DICDI
DA NRE
SHEEP
M OUSE
RA BIT
XENLA
BO VIN
CA VPO
CHICK
DRO M E
ONCM Y
SA LSA
HUM AN
PA NTR
RA T
AEDA E
29
CULQU
BOTFB
CAEEL
WHEAT
ORYSJ
MAIZE
HELAN
ARATH
SOYBN
SOLLC
PEA
TRYCR
PICGU
NEUCR
PARBR
KLULA
NEOFI
DICDI
XENLA
RAT
RABIT
MOUSE
HUMAN
DROME
DANRE
CAVPO
SHEEP
CHICK
PANTR
SALSA
ONCMY
BOVIN
SCHPO
YEAST
CANAL
CHAGB
APLCA
AEDAE
30
CULQU
BOTFB
TRYCR
MOUSE
CAVPO
ONCMY
HUMAN
SALSA
RABIT
RAT
SHEEP
BOVIN
DICDI
CHICK
DROME
PANTR
XENLA
CAEEL
APLCA
ARATH
HELAN
MAIZE
ORYSJ
PEA
WHEAT
CANAL
PICGU
KLULA
PARBR
YEAST
SCHPO
CHAGB
NEOFI
NEUCR
SOLLC
SOYBN
DANRE
AEDAE
31
CHAGB
SALSA
HUMAN
ORYSJ
SHEEP
CHICK
PICGU
MOUSE
RABIT
DICDI
NEOFI
MAIZE
PEA
SOYBN
XENLA
APLCA
PARBR
HELAN
NEUCR
RAT
WHEAT
BOTFB
CULQU
BOVIN
CAEEL
CANAL
CAVPO
DROME
ARATH
ONCMY
SCHPO
YEAST
DANRE
TRYCR
KLULA
PANTR
SOLLC
AEDAE
32
CULQU
APLCA
DROME
CAEEL
ARATH
HELAN
PEA
ORYSJ
MAIZE
WHEAT
SOYBN
SOLLC
TRYCR
BOTFB
NEOFI
PARBR
CHAGB
NEUCR
CANAL
PICGU
KLULA
YEAST
SCHPO
DICDI
PANTR
BOVIN
MOUSE
RAT
SHEEP
CAVPO
SALSA
CHICK
DANRE
HUMAN
ONCMY
RABBIT
XENLA
AEDAE
33
CULQU
DROME
APLCA
CAEEL
ARATH
PEA
MAIZE
ORYSJ
HELAN
WHEAT
SOLLC
BOTFB
PARBR
NEOFI
CHAGB
NEUCR
CANAL
PICGU
KLULA
YEAST
SCHPO
TRYCR
DICDI
SOYBN
ONCMY
SALSA
PANTR
CHICK
DANRE
BOVIN
MOUSE
RABBIT
RAT
SHEEP
CAVPO
HUMAN
XENLA
AEDAE
34
CAEEL
CULQU
APLCA
DROME
ARATH
SOYBN
HELAN
PEA
ORYSJ
MAIZE
WHEAT
SOLLC
TRYCR
BOTFB
CHAGB
NEUCR
NEOFI
PARBR
CANAL
PICGU
KLULA
YEAST
SCHPO
DICDI
BOVIN
DANRE
MOUSE
RAT
CAVPO
CHICK
SALSA
HUMAN
ONCMY
SHEEP
PANTR
RABBIT
XENLA
AEDAE
35
CULQU
DROME
APLCA
ARATH
WHEAT
HELAN
SOLLC
PEA
SOYBN
ORYSJ
MAIZE
CAEEL
DANRE
ONCMY
SALSA
BOVIN
SHEEP
HUMAN
PANTR
RABBIT
CAVPO
MOUSE
RAT
PICGU
TRYCR
XENLA
CHICK
BOTFB
NEOFI
CHAGB
NEUCR
PARBR
CANAL
KLULA
YEAST
SCHPO
DICDI
AEDAE
36
CULQU
DROME
CHICK
CAEEL
APLCA
ARATH
HELAN
ORYSJ
SOYBN
SOLLC
MAIZE
PEA
WHEAT
BOTFB
YEAST
CANAL
NEOFI
PARBR
CHAGB
NEUCR
KLULA
SCHPO
DICDI
PICGU
TRYCR
BOVIN
SHEEP
HUMAN
PANTR
RABBIT
CAVPO
MOUSE
RAT
DANRE
ONCMY
SALSA
XENLA
AEDAE
37
CULQU
DROME
TRYCR
APLCA
ARATH
WHEAT
DICDI
BOTFB
NEOFI
CHAGB
NEUCR
KLULA
PARBR
CANAL
SCHPO
YEAST
CAEEL
PICGU
HELAN
PEA
SOYBN
SOLLC
MAIZE
ORYSJ
DANRE
ONCMY
SALSA
XENLA
CHICK
HUMAN
PANTR
BOVIN
SHEEP
MOUSE
RAT
CAVPO
RABBIT
AEDAE
38
TRYCR
LEIMA
EUG GR
GIALA
PLAYO
PLAF7
TO XGO
KARM I
ARATH
DAUCA
HORVU
W HEAT
ORYSJ
PEA
GOSHI
MAIZE
CHLRE
ASPFU
BOTFB
NEO FI
NEUCR
CHAG B
CANAL
YEAS7
YEAST
SCHPO
PNECA
DICDI
AEDAE
BOM M O
CAEEL
DANRE
SALSA
XENLA
XENTR
CHICK
HUM AN
MACFA
MO USE
PANTR
PIG
RAT
9TRYP
39
TRYCR
LEIMA
GIALA
PLAYO
EUGGR
PLAF7
KARM I
TOXGO
CHLRE
ARATH
DAUCA
PEA
GOSHI
HORVU
W HEAT
MAIZE
ORYSJ
ASPFU
BOTFB
NEOFI
NEUCR
SCHPO
CHAGB
CANAL
YEAS7
YEAST
PNECA
AEDAE
BOM MO
CAEEL
DANRE
DICDI
SALSA
XENLA
XENTR
CHICK
HUM AN
MACFA
MOUSE
PANTR
PIG
RAT
9TRYP
40
TRYCR
LEIM A
GIALA
PLAYO
EUGGR
KARM I
TOXGO
PLAF7
ASPFU
NEUCR
BOTFB
NEOFI
CHAGB
CANAL
YEAS7
YEAST
CHLRE
ARATH
HORVU
W HEAT
M AIZE
PEA
ORYSJ
GOSHI
BOM MO
DAUCA
DICDI
CAEEL
DANRE
AEDAE
XENTR
SALSA
XENLA
PNECA
SCHPO
CHICK
HUM AN
M ACFA
M OUSE
PANTR
PIG
RAT
9TRYP
41
LEIMA
TRYCR
EUGGR
GIALA
CHLRE
ARATH
MAIZE
DAUCA
PEA
HORVU
W HEAT
ORYSJ
GOSHI
PLAF7
TOXGO
ASPFU
PLAYO
CANAL
BOTFB
NEUCR
NEOFI
CHAGB
YEAS7
YEAST
SCHPO
PNECA
BOM M O
KARM I
CAEEL
SALSA
XENLA
AEDAE
DANRE
DICDI
XENTR
CHICK
HUM AN
MACFA
MOUSE
PANTR
PIG
RAT
9TRYP
42
TRYCR
LEIMA
EUGGR
ARATH
HORVU
MAIZE
ORYSJ
DAUCA
GOSHI
PEA
W HEAT
CHLRE
KARMI
PLAF7
PLAYO
TOXGO
GIALA
ASPFU
NEOFI
BOTFB
CHAGB
SCHPO
PNECA
CANAL
YEAS7
YEAST
NEUCR
DICDI
CAEEL
SALSA
XENLA
CHICK
MACFA
XENTR
AEDAE
BOMMO
DANRE
HUMAN
MOUSE
PANTR
PIG
RAT
9TRYP
43
TRYCR
LEIM A
EUG GR
ARATH
HO RVU
MA IZE
ORYSJ
GO SHI
PEA
W HEAT
DA CAU
PLA F7
PLA YO
CHLRE
KA RM I
TO XG O
GIA LA
ASPFU
NEO FI
BO TFB
CHA GB
SCHPO
CA NAL
YEA S7
YEA ST
PNECA
NEUCR
DICDI
SA LSA
CA EEL
XENLA
AEDA E
BO M M O
CHICK
MA CFA
XENTR
DA NRE
PIG
HUM AN
MO USE
PA NTR
RA T
9TRYP
44
TRYCR
LEIMA
CHLRE
EUGGR
KARM I
TOXGO
ARATH
HORVU
MAIZE
ORYSJ
DACAU
W HEAT
PEA
GOSHI
DICDI
PLAF7
PLAYO
GIALA
ASPFU
NEOFI
BOTFB
CHAGB
NEUCR
PNECA
CANAL
YEAST
YEAS7
SCHPO
CAEEL
XENLA
MACFA
XENTR
DANRE
AEDAE
BOMM O
SALSA
CHICK
PIG
HUMAN
MOUSE
PANTR
RAT
9TRYP
45
LEIMA
TRYCR
ARATH
EUGGR
GOSHI
HORVU
ORYSJ
MAIZE
CHLRE
PEA
W HEAT
DAUCA
DICDI
KARMI
PLAF7
PLAYO
TOXGO
ASPFU
NEOFI
CHAGB
BOTFB
CANAL
YEAS7
YEAST
GIALA
NEUCR
SCHPO
CAEEL
PNECA
XENLA
XENTR
AEDAE
BOMMO
DANRE
HUMAN
MOUSE
PANTR
RAT
PIG
CHICK
MACFA
SALSA
9TRYP
46
TRYCR
LEIM A
EUG GR
KARM I
PLAF7
PLAYO
TO XG O
ARATH
DAUCA
GO SHI
PEA
HO RVU
M AIZE
W HEAT
ORYSJ
CHLRE
ASPFU
NEO FI
BO TFB
CHAG B
NEUCR
CANAL
YEAS7
YEAST
SCHPO
PNECA
DICDI
GIALA
BO M M O
CAEEL
CHICK
M ACFA
AEDAE
PIG
DANRE
SALSA
XENLA
XENTR
HUM AN
M OUSE
PANTR
RAT
9TRYP
47
TRYCR
LEIM A
EUG GR
KARM I
PLAF7
PLAYO
TO XG O
CHLRE
GIALA
ARATH
DAUCA
PEA
GO SHI
M AIZE
W HEAT
HO RVU
ORYSJ
ASPFU
NEO FI
BO TFB
CHAG B
NEUCR
CANAL
YEAS7
YEAST
SCHPO
PNECA
DICDI
AEDAE
BO M M O
CAEEL
CHICK
M ACFA
PIG
DANRE
SALSA
XENLA
XENTR
M OUSE
HUM AN
PANTR
RAT
9TRYP
48
TRYCR
LEIM A
EUG GR
ARA TH
DA UCA
M AIZE
W HEA T
GO SHI
HO RVU
PEA
ORYSJ
CHLRE
PLA F7
PLA YO
TO XG O
KA RM I
ASPFU
NEO FI
BO TFB
NEUCR
CHA GB
PNECA
CA NA L
YEA S7
YEA ST
SCHPO
DICDI
GIA LA
AEDA E
BO M M O
CHICK
M ACFA
CA EEL
PIG
DA NRE
SA LSA
XENLA
XENTR
PA NTR
HUM A N
M OUSE
RA T
9TRYP
49
TRYCR
LEIMA
DICDI
ARA TH
GO SHI
W HEAT
HO RVU
PEA
MAIZE
ORYSJ
DAUCA
EUG GR
CHLRE
KARM I
PLAF7
PLAYO
TO XGO
GIALA
CHICK
MACFA
AEDAE
ASPFU
NEO FI
BO TFB
CHA GB
NEUCR
CANAL
YEAS7
YEAST
PNECA
BO MM O
CAEEL
SCHPO
PIG
DANRE
XENLA
XENTR
SALSA
HUM A N
MO USE
PANTR
RAT
9TRYP
50
Polypeptide Full seq Alpha Helix Random coil Beta sheet
Ubiquitin 59.5 49.45 59.9 41.58
Calmodulin 57.87 57.51 58.22 --------
Cu/znSOD 65.03 ------- 62.16 63.22
Actin 64.88 63.35 65.22 61.33
Alpha 63.61 60.08 58.72 65.47
tubulin
Beta tubulin 66.80 64.67 67.52 62.22
OBJECTIVES
51
52