Professional Documents
Culture Documents
Protein
The properties of a protein are largely determined by its three-dimensional structure.
In the 1930s and 1940s, Linus Pauling and Robert Corey determined the X-ray structures of
several amino acids and dipeptides in an effort to elucidate the structural constraints on the
conformations of a polypeptide chain.
These studies indicated that the peptide group has a rigid, planar structure, which, Pauling
pointed out, is a consequence of resonance interactions that give the peptide bond an 40%
double-bond character:
The structure of each amino acid in a polypeptide can be adjusted by rotation about two
single bonds. (A) Phi () is the angle of rotation about the bond between the nitrogen and the
a-carbon atoms, whereas psi () is the angle of rotation about the bond between the a-
carbon and the carbonyl carbon atoms. (B) A view down the bond between the nitrogen and
the a-carbon atoms, showing how is measured. (C) A view down the bond between the a-
carbon and the carbonyl carbon atoms, showing how y is measured.
Polypeptide Backbone Conformations May Be Described by Their Torsion Angles.
It is important because they indicate that the backbone of a protein is a linked
sequence of rigid planar peptide groups.
We can therefore specify a polypeptides backbone conformation by the torsion
angles (rotation angles or dihedral angles) about Ca -N bond () and Ca - C bond
() of each of its amino acid residues.
These angles, and , are both defined as 180 when the polypeptide chain is in its
planar, fully extended (all-trans) conformation and increase for a clockwise rotation
when viewed from Ca.
Pauling and Corey predicted the structure of the helix 6 years before it was actually seen
in the x-ray reconstruction of the structure of myoglobin.
The a-helical content of proteins ranges widely, from none to almost 100%. For
example, about 75% of the residues in ferritin, a protein that helps store iron, are in
helices. Indeed, about 25% of all soluble proteins are composed of helices connected by
loops and turns of the polypeptide chain. Single a helices are usually less than 45
long. Many proteins that span biological membranes also contain helices.
pleated sheet
In 1951, the year that they proposed the a helix, Pauling and Corey also postulated the
existence of a different polypeptide secondary structure, the pleated sheet.
Structure of a strand.
The side chains (green) are
alternately above and below
the plane of the strand.
b Pleated sheets come in two varieties:
The antiparallel pleated sheet, in which neighboring hydrogen bonded polypeptide
chains run in opposite directions.
The parallel pleated sheet, in which the hydrogen bonded chains extend in the same
direction.
The distance between adjacent amino acids along a b strand is approximately 3.5 , in
contrast to a distance of 1.5 along an a helix.
The side chains of adjacent amino acids point in opposite directions. A b sheet is formed
by linking two or more b strands lying next to one another through hydrogen bonds.
Adjacent chains in a b sheet can run in opposite directions (antiparallel sheet) or in the
same direction (parallel sheet).
In the antiparallel arrangement, the NH group and the CO group of each amino acid are
respectively hydrogen bonded to the CO group and the NH group of a partner on the
adjacent chain. In the parallel arrangement, the hydrogen-bonding scheme is slightly more
complicated.
For each amino acid, the NH
group is hydrogen bonded to the
CO group of one amino acid on
the adjacent strand, whereas the
CO group is hydrogen bonded to
the NH group on the amino acid
two residues farther along the
chain.
Many strands, typically 4 or 5 but
as many as 10 or more, can come
together in b sheets. Such sheets
can be purely antiparallel, purely
parallel, or mixed.
The conformations in which these b structures are
optimally hydrogen bonded vary somewhat from that of a
fully extended polypeptide ( = = 180).
They therefore have a rippled or pleated edge-on
appearance (Fig.), which accounts for the designation
pleated sheet. In this conformation, successive side
chains of a polypeptide chain extend to opposite sides of
the pleated sheet with a two-residue repeat distance of
7.0 .
b Sheets are common structural motifs in proteins. In
globular proteins, they consist of from 2 to as many as 15
polypeptide strands, the average being 6 strands, which
have an aggregate width of 25.
The polypeptide chains in a sheet are known to be up to
15 residues long, with the average being 6 residues that
have a length of 21.
Parallel b sheets of less than five strands are rare. This
observation suggests that parallel b sheets are less stable
than antiparallel b sheets, possibly because the hydrogen
bonds of parallel sheets are distorted in comparison to
those of the antiparallel sheets.
Mixed parallel antiparallel b sheets are common but, nevertheless, only 20% of
the strands in b sheets have parallel bonding on one side and antiparallel bonding
on the other (vs an expected 50% for the random mixing of strand directions).
The b pleated sheets in globular proteins invariably exhibit a pronounced right-
handed twist when viewed along their polypeptide strands. Such twisted b sheets
are important architectural features of globular proteins since b sheets often form
their central cores.
Nonrepetitive Structures
Regular secondary structureshelices and b sheets comprise around half of the
average globular protein.
The proteins remaining polypeptide segments are said to have a coil or loop
conformation. That is not to say, however, that these nonrepetitive secondary
structures are any less ordered than are helices or b sheets; they are simply irregular
and hence more difficult to describe.
Globular proteins consist largely of approximately straight runs of secondary structure
joined by stretches of polypeptide that abruptly change direction. Such reverse turns
or b bends (so named because they often connect successive strands of antiparallel b
sheets) almost always occur at protein surfaces; indeed, they partially define these
surfaces.
Most reverse turns involve four successive amino acid residues more or less arranged
in one of two ways, Type I and Type II, that differ by a flip of the peptide unit linking
residues 2 and 3.
Both types of b bends contain a hydrogen bond, although deviations from these ideal
conformations often disrupt this hydrogen bond.
Type I b bends may be considered to be distorted sections of 310 helix. In Type II b
bends, the oxygen atom of residue 2 crowds the Cb atom of residue 3, which is
therefore usually Gly. Residue 2 of either type of b bend is often Pro since it can
facilely assume the required conformation.
Almost all proteins 60 of residues contain one or more loops of 6 to 16 residues that
are not components of helices or b sheets and whose end-to-end distances are 10.
Such loops (so named because they have the necked-in shape of the Greek
uppercase letter omega), which may contain reverse turns, are compact globular entities
because their side chains tend to fill in their internal cavities.
Since loops are almost invariably located on the protein surface, they may have
important roles in biological recognition processes.
Protein Architecture
Tertiary Structure of Small Globular Proteins:
Myoglobin
John Kendrew and Max Perutz shared the 1962
Nobel Prize in Chemistry for determining the
structure of myoglobin and other globular
proteins.
The first breakthrough in understanding the three-dimensional structure of a globular
protein came from x-ray diffraction studies of myoglobin carried out by John Kendrew
and his colleagues in the 1950s.
Myoglobin contains a single polypeptide chain of 153 amino acid residues of known
sequence and a single iron protoporphyrin, or heme, group.
The same heme group that is found in myoglobin is found in hemoglobin, the oxygen-
binding protein of erythrocytes, and is responsible for the deep red-brown color of both
myoglobin and hemoglobin.
(PDB ID 1MBO)
The longest a helix has 23 amino acid residues
and the shortest only 7; all helices are right-
handed. More than 70% of the residues in
myoglobin are in these - a helical regions. X-ray
analysis has revealed the precise position of each
of the R groups, which occupy nearly all the
space within the folded chain.
The positioning of amino acid side chains reflects a structure that derives much of its
stability from hydrophobic interactions. Most of the hydrophobic R groups are in the
interior of the molecule, hidden from exposure to water.
All but two of the polar R groups are located on the outer surface of the molecule, and
all are hydrated. The myoglobin molecule is so compact that its interior has room for
only four molecules of water.
This dense hydrophobic core is typical of globular proteins. The fraction of space
occupied by atoms in an organic liquid is 0.4 to 0.6. In a globular protein the fraction is
about 0.75, comparable to that in a crystal (in a typical crystal the fraction is 0.70 to
0.78, near the theoretical maximum).
In this packed environment, weak interactions strengthen and reinforce each other. For
example, the nonpolar side chains in the core are so close together that short-range van
der Waals interactions make a significant contribution to stabilizing hydrophobic
interactions.
Three of the four Pro residues are found at bends. The fourth Pro residue occurs within an a helix,
where it creates a kink necessary for tight helix packing.
The flat heme group rests in a crevice, or pocket, in the myoglobin molecule. The iron atom in the
center of the heme group has two bonding (coordination) positions perpendicular to the plane of
the heme.
One of these is bound to the R group of the His residue at position 93; the other is the site at
which an O2 molecule binds. Within this pocket, the accessibility of the heme group to solvent is
highly restricted.
This is important for function, because free heme groups in an oxygenated solution are rapidly
oxidized from the ferrous (Fe2+) form, which is active in the reversible binding of O2, to the ferric
(Fe3+) form, which does not bind O2.
We need to analyze its folding patterns
Motif, also called a supersecondary structure or fold. A motif is simply a recognizable
folding pattern involving two or more elements of secondary structure and the connection(s)
between them.
A motif can be very simple, such as two elements of secondary structure folded against each other,
and represent only a small part of a protein for example a -a- loop.
A motif can also be a very elaborate structure involving scores of protein segments folded together,
such as the b barrel. In some cases, a single large motif may comprise the entire protein. The term
encompasses any advantageous folding pattern and is useful for describing such patterns. The
segment defined as a motif may or may not be independently stable.
Note that a motif is not a hierarchical structural element falling between secondary and tertiary
structure.
The protein portion, the globin, consists of two a chains (141 residues each) and two
chains (146 residues each).
The subunits of hemoglobin are arranged in symmetric pairs, each pair having one a and one
subunit. Hemoglobin can therefore be described either as a tetramer or as a dimer of
a protomers.
1. Identical subunits of multimeric proteins are generally arranged in one or a limited set of
symmetric patterns. Oligomers can have either rotational symmetry or helical symmetry;
that is, individual subunits can be superimposed on others by rotation about one or more
rotational axes or by a helical rotation.
2. There are several forms of rotational symmetry. The simplest is cyclic symmetry, involving
rotation about a single axis. If subunits can be superimposed by rotation about a single axis, the
protein has a symmetry defined as Cn (C for cyclic, n for the number of subunits related by the
axis). The axis itself is described as an n fold rotational axis. The a protomers of hemoglobin
are related by C2 symmetry.
A somewhat more complicated rotational symmetry is dihedral symmetry, in which a
twofold rotational axis intersects an n-fold axis at right angles; this symmetry is defined as