Professional Documents
Culture Documents
USA
Vol. 84, pp. 6340-6344, September 1987
Biochemistry
ABSTRACT
A complementary DNA (cDNA) clone from a
human placenta cDNA library encoding extracellular superoxide dismutase (EC-SOD; superoxide:superoxide oxidoreductase, EC 1.15.1.1) has been isolated and the nucleotide sequence determined. The cDNA has a very high G+C content.
EC-SOD is synthesized with a putative 18-amino acid signal
peptide, preceding the 222 amino acids in the mature enzyme,
indicating that the enzyme is a secretory protein. The first 95
amino acids of the mature enzyme show no sequence homology
with other sequenced proteins and there is one possible
N-glycosylation site (Asn-89). The amino acid sequence from
residues 96-193 shows strong homology (:50%) with the rinal
two-thirds of the sequences of all known eukaryotic CuZn
SODs, whereas the homology with the P. leiognathi CuZn SOD
is clearly lower. The ligands to Cu and Zn, the cysteines
forming the intrasubunit disulfide bridge in the CuZn SODs,
and the arginine found in all CuZn SODs in the entrance to the
active site can all be identified in EC-SOD. A comparison with
bovine CuZn SOD, the three-dimensional structure of which is
known, reveals that the homologies occur in the active site and
the divergencies are in the part constituting the subunit contact
area in CuZn SOD. Amino acid sequence 194-222 in the
carboxyl-terminal end of EC-SOD is strongly hydrophilic and
contains nine amino acids with a positive charge. This sequence
probably confers the affnity of EC-SOD for heparin and
heparan sulfate. An analysis of the amino acid sequence
homologies with CuZn SODs from various species indicates
that the EC-SODs may have evolved from the CuZn SODs
before the evolution of fungi and plants.
The publication costs of this article were defrayed in part by page charge
payment. This article must therefore be hereby marked "advertisement"
in accordance with 18 U.S.C. 1734 solely to indicate this fact.
6340
6341
-10
MetLeuAlaLeuLeuCysSerCysLeuLeuLeuAlaAlaGlyAlaSerAsp
CTGGGTGCAGCTCTCTTTTCAGGAGAGAAAGCTCTCTTGGAGGAGCTGGAAAGGTGCCCGACTCCAGCCATGCTGGCGCTACTGTGTTCCTGCCTGCTCCTGGCAGCCGGTGCCTCGGAC
1
120
-1 +1
10
20
30
AlaTrpThrGlyGluAspSerAlaGluProAsnSerAspSerAlaGluTrpIleArgAspMetTyrAlaLysValThrGluIleTrpGlnGluValMetGlnArgArgAspAspAspGly
GCCTGGACGGGCGAGGACTCGGCGGAGCCCAACTCTGACTCGGCGGAGTGGATCCGAGACATGTACGCCAAGGTCACGGAGATCTGGCAGGAGGTCATGCAGCGGCGGGACGACGACGGC
40
240
50
60
70
ThrLeu~isAlaAlaCysGlnValGlnProSerAlaThrLeuAspAlaAlaGlnProArgValThrGlyValValLeuPheArgGlnLeuAlaProArgAlaLysLeuAspAlaPhePhe
ACG CT CCACGCCGC
80
CTGCCAGGTGCAG CCGT CGGC CACG CTGGACGCCGCGCAG C CCCGGGTGACCGGCGTCGTC CTCTT CCGG CAGCTTGCGCC CCG CGC CAAG CT CGACGCCTTCTTC
360
90
100
110
AlaLeuGluGlyPheProThrGluProAsnSerSerSerArgAlaIleHisValHisGlnPheGlyAspLeuSerGlaGlyCysGluSerThrGlyProHisTyrAsnProLeuAlaVa1
GCCCTGGAGGGCTTCCCGACCGAGCCGAACAGCTCCAGCCGCGCCATCCACGTGCACCAGTTCGGGGACCTGAGCCAGGGCTGCGAGTCCACCGGGCCCCACTACAACCCGCTGGCCGTG
480
120
130
140
150
ProHisProGlnHisProGlyAspPheGlyAsnPheAlavalArgAspGlySerLeuTrpArgTyrArgAlaGlyLeuAlaAlaSerLeuAlaGlyProHisSerIleValGlyArgAla
CCGCACCCGCAGCACCCGGGCGACTTCGGCAACTTCGCGGTCCGCGACGGCAGCCTCTGGAGGTACCGCGCCGGCCTGGCCGCCTCGCTCGCGGGCCCGCACTCCATCGTGGGCCGGGCC
600
160
170
180
190
ValValValHisAlaGlyGluAspAspLeuGlyArgGlyGlyAsnGlnAlaSerM'alGiuAsnGlyAsnAlaGlyArgArgLeuAlaCysCysValValGlyValCysGlyProGlyLeu
GTGGTCGTCCACGCTGGCGAGGACGACCTGGGCCGCGGCGGCAACCAGGCCAGCGTGGAGAACGGGAACGCGGGCCGGCGGCTGGCCTGCTGCGTGGTGGGCGTGTGCGGGCCCGGGCTC
720
200
210
220
TrpGluArgGlnAlaArgGluHisSerGluArgLysLysArgArgArgGluSerGluCysLysAlaAla***
TGGGAGCGCCAGGCGCGGGAGCACTCAGAGCGCAAGAAGCGGCGGCGCGAGAGCGAGTGCAAGGCCGCCTGAGCGCGGCCCCCACCCGGCGGCGGCCAGGGACCCCCGAGGCCCCCCTCT
840
GCCTTTGAGCTTCTCCTCTGCTCCAACAGACACCTTCCACTCTGAGGTCTCACCTTCGCCTCTGCTGAAGTCTCCCCGCAGCCCTCTCCACCCAGAGGTCTCCCTATACCGAGACCCACC
960
ATC CTT CCAT CCTGAGGAC CGC C CCAAC CCT CGGAGCC CCCCACTCAGTAGGTCTGAAGGC CTCCATTTGTACCGAAACAkCC CCG CTCACGCTGACAGC CTCCTAGG CT CCCTGAGGTAC
1080
CTTTCCACCCAGACCCTCCTTCCCCACCCCATAAGCCCTGAGACTCCCGCCTTTGACCTGACGATCTTCCCCCTTCCCGCCTTCAGGTTCCTCCTAGGCGCTCAGAGGCCGCTCTGGGGG
1200
GTTGC CTCGAGTCCCCCCAC CCCT C CCCAC CCAC CACCG CTC CCGCGGCAAGCCAGCCCGTGCAACGGAAGCCAGGCCAACTG C CCCGCGTCTTCAG CTGTTTCG CATC CACCGCCACCC
1320
CACTGAGAGCTGCTCCTTTGGGGGAATGTTTGGCAACCTTTGTGTTACAGATTAAAAATTCAGCAATTCAAAAAAA
1396
FIG. 1. cDNA sequence and deduced amino acid sequence of EC-SOD. Underlined amino acid sequences have been confirmed by amino
acid sequence analysis of intact EC-SOD and of tryptic peptides.
ants were selected on plates containing ampicillin. A recombinant plasmid carrying the cDNA insert was identified and
designated pLS3. Plasmid pLS3 DNA was subjected to
RESULTS
Nucleotide Sequence of Human EC-SOD cDNA. The nucleotide sequence and deduced amino acid sequence of the
cDNA insert of Xsp3 are shown in Fig. 1. The insert was
found to be 1396 base pairs (bp) long and contained an open
reading frame of 240 amino acids surrounded by 69-bp and
607-bp untranslated regions of the 5' and 3' ends, respectively. Underlined amino acid sequences have been confirmed by amino acid sequence analysis of intact EC-SOD
and of tryptic peptides. The start of mature EC-SOD (cf.
Materials and Methods) corresponds to amino acid 19 of the
deduced amino acid sequence. The preceding 18 amino acids
apparently represent a signal peptide. Like known signal
peptides, this sequence of amino acids is rich in hydrophobic
amino acids and the last residue is alanine, which is one of the
amino acids found in this position in known signal peptides
(26). The DNA sequence found at the translation initiating
codon, -CAGCCAUGC-, is homologous to the postulated
consensus sequence for eukaryotic initiation sites, -CCGCCAUG(G)- (27). A possible polyadenylylation signal with the
sequence -ATTAAA- homologous to the postulated consen-
6342
Ser
Arg
Val
Pro
Thr
No. of
Codon
UUA
UUG
CUU
CUC
CUA
CUG
UCU
UCC
UCA
UCG
AGU
AGC
CGU
CGC
CGA
CGG
AGA
AGG
GUU
GUC
GUA
GUG
CCU
CCC
CCA
CCG
ACU
ACC
ACA
ACG
codons
0
Amino
acid
Ala
0
1
7
1
Gly
11
1
4
1
Ile
5
0
His
Gln
Asn
Lys
0
5
Asp
CAU
CAC
CAA
CAG
AAU
AAC
AAA
AAG
GAU
GAC
Glu
GAA
Cys
GAG
UGU
UGC
UUC
UAU
UAC
0
10
Tyr
0
7
5
0
0
1
UUU
10
No. of
codons
1
21
1
9
1
17
Phe
0
8
Codon
GCU
GCC
GCA
GCG
GGU
GGC
GGA
GGG
AUU
AUC
AUA
7
3
0
9
0
11
0
7
0
5
0
14
0
15
1
(1987)
DISCUSSION
The amino acid sequence of human EC-SOD, as deduced
from the cDNA, contains a signal peptide, which indicates
that EC-SOD is a secretory protein as postulated (1, 3-5).
The sequence of the central portion of the mature EC-SOD
is highly homologous with the final two-thirds of the sequences of the CuZn SODs. The carboxyl-terminal end of
EC-SOD is very hydrophilic and contains many positively
charged amino acid residues. It is therefore likely to extend
out into the solvent. We propose that this part of the protein
confers the affinity of EC-SOD for heparin (5) and analogues
such as heparan sulfate leading in vivo to binding to cellular
surfaces (ref. 6; K. Karlsson and S.L.M., unpublished data).
The region of the EC-SOD sequence, residues 96-193,
homologous to the CuZn SOD family corresponds to residues
44-148 in bovine CuZn SOD, the three-dimensional structure
of which is known in detail (40). The first 43 residues of
bovine CuZn SOD constitute B-strands "la, 2b, 3c," and the
first half of "6d." EC-SOD was also deviant from residues
137-147 (92-102 in bovine CuZn SOD), which correspond to
B-strand "4f." These parts of bovine CuZn SOD are positioned in the outer surface of the "B-barrel," which constitutes the main part of the subunit contact area in the enzyme.
Since EC-SOD is a tetramer, its subunit contact area should
be different and larger. It is possible that the deviant
sequences in the homologous region and parts of the first 95
amino acids of EC-SOD are responsible for the tetramer
subunit contact.
The parts of bovine CuZn SOD that are homologous to
EC-SOD form the environment of the Cu and Zn atoms. All
ligands to the Cu and Zn atoms can be identified in EC-SOD,
as can the cysteines forming the intramolecular disulfide
bond. Arg-186 in EC-SOD corresponds to Arg-141 in the
entrance to the active site of bovine CuZn SOD. The
conformation of EC-SOD around the active site is therefore
expected to be very similar to that of the CuZn SODs. This
would explain the similarities of EC-SOD and CuZn SODs
with respect to specific activity (5) and sensitivity to inhibitors like cyanide, azide, diethyldithiocarbamate, and 1202
(8).
Comparison of EC-SOD with sequences of CuZn SODs
from different species allows some speculations on the
evolutionary history of the EC-SODs (Fig. 2). If the 98 amino
acids of EC-SOD in sequence positions 96-193 are compared
with homologous positions in the CuZn SODs, the following
numbers of identical amino acids are found: human, 53; pig,
56; cow, 55; horse, 52; swordfish, 52; fruit fly, 49; spinach,
54; bakers' yeast, 48; P. leiognathi, 30. Except for P.
leiognathi, the identities with the CuZn SODs from various
species cannot be judged to differ significantly and follow any
6343
30
20
10
1
W T E G n S A E P N S D S A E W I R D M Y A K V T E I W Q E V M Q R R D D D G
40
T L H A A C Q V
Human CuZn so0
"
Pig
Cow
"
Horse
"
Sword fish
"
Fruit fly
"
Spinach
"
Bakers yeast
P. leiognathi"
80
F A L
K [W
L VIT
V WT
V L K
G L.G
K IVS
T IV N
TWS
60
50
P S A T
A T - K
A T - K
A T - K
A L - K
V L - K
V V-K
A T K K
V 0 D
D AAA
V C V L
V C V L
V C V L
V C V L
V C V L
A V CV
A V A V L
AVA V L
L T V K M
L
A
A
A
A
A
R V
G D
GO
G D
G D
G A
GD
K G T
K GD
T D L
S S R A I HV H
G LIHG F H V H
G
G DIH G F H V H
G
G DH G F H V H
G
G IDH G F H V H
G
G EIH G F H V H
I
G LIH G F H V H
G
G K H G F HLH
V
N A E ]G F HIIH
Y
PEAIMI HGFHIH
VTFTPELAD-LJT
S
E
E
E
K
P
K
P
L SRK H G GIPI K DJ E R H V G D
E S KK H G GPI K IDL E R H V G D
E R H V G D
L S K K H G GIP K D
H G G P K D E E
G[P K D E D
H G A P V D E N
H G A P E I ElV
H G AIP TID EIV
H G F P w T D D
IHA
N K
110
G C [ENP
E F G D N T A G C T S A G P H F N P
Q
E
G
E
E
E
F G D N T Q G C T S A G P H F N P
G D N TQ G C T S A G P H F N P
G D N T Q G C T T|A G IH F N P
G D N T N G C I S A G P H F N P
F
F
F
F
F
G P H F N P
G D N T N G C M S
Grp H F N P
G D T T N G C M S
FG
DA
NGCVSAGPHFNPHDL
R H V G D
R H V G D
G D
R H
R H A G D
gH V G D
W R Y
R A G L A A S
S Vr S
V Al D V sT E
LGN
K DG V A T V YII EDI S V III A
LGN V TAI
K N G V A I V D LI V D P L I S
L G N V T IA
D V D M K D S V IIS
E N
L G N V T A
TD-K I S
A NIG V AK I
L G N V T[A
G D C P T K V N IT D S K I T
E A
L G N
V
T
D N Q I P
I
E
A
V AIN T D G V A
L G N
G N K T D E NIG V AlK G S F KD S L I K
NT N P V L A P -R
FVSA
L
TAI
LISIGIDPHH C I I G R
ILl S Gl D H IS I I G R
Y S I
LISIGHE
LISIGIK HSI
P IS
G
I
LITI
L|FIGIA DSI
I G R
I G R
V R D
D
G
D
D
D
T
D|G
GE3A
1 70
160
150
L A
G E ID
VV
G R
[GGN
180
VAENGAG
T
G N A G
190
ACCV
A
A
A
A
A
H
EL KHAI M IAGGN
GV
C G V I G I
C G V IG I
C G V I
C G V I
C G V I
GIC G V I
A C G VWS
A C G V I
A C G V I
G
G
G
G
G
G
0
I
I
T
I
L
L
G
G
G
RL
RL
RL
RL
RL
RL
RI
RP
GG G
LT L K
140
130
120
L A VP
L S K K
A SIKK
Y G K E
D K K T
F KKT
70
G V V L F R Q L A P R A K L D A F
G I I N F E Q K E S N G P V
P V
P V Q G T I Y F E L K G E K - T V
P QGTIHFEAK-- G D T V
P
H GV I H F E Q Q Q E G G P V
E T T GJ T V Y LF E Q E G N A N A V
KG TV F F EQ ES S - - G T PUV
N V E G V V T L T Q E D - 0 G P T
G VS GV V K 7 QASESEPT
T G K P V G T I ffjL S Q N K Y GWV
100
90
G F P T E P N
L T
S T K
L A
T II K G
L T
S II T G
L T
F WE G
L T
I L K G
L A
E V C G
L A
R
S G
P
E A GN
T
G
G
G
G
G
A
S
A
0
P
K
K
K
K
R
220
210
200
C G P G L W E R Q A R E H S E R K K R R R E S E C K A A
Human
SOD
A Q
CuZn
Is
Pig
T
Cow
A K
Horse
A P
Sword fish
E
A K
Fruit fly
T P V
Spinach
Bakers yeast
T N
P. leiognathi
FIG. 2. Comparison of the deduced amino acid sequence (designated by the single-letter code) of mature human EC-SOD with those of CuZn
SODs from human (30, 31), pig (32), cow (33), horse (34), swordfish (35), fruit fly (36), spinach (37), bakers' yeast (38), and P. leiognathi (39).
Homologous amino acids are enclosed in boxes in positions where six or more are shared by the nine CuZn SODs. *, Sequence
(SEKDGKVVLGGA) in P. leiognathi not presented.
emerges if the same 98 amino acid positions are compared
among the CuZn SODs with the human enzyme as the basis:
pig, 86; cow, 83; horse, 84; swordfish, 77; fruit fly, 64;
The skillful technical assistance of Ms. Stina Olofsson and assistance with computer analysis of Dr. Robert Harr is gratefully
acknowledged. Dr. Lena Tibell and Dr. Gunnar Skogman are
thanked for valuable discussions. The study was supported by
Swedish Medical Research Council Grants 04761, 07149, and 03556;
6344