Professional Documents
Culture Documents
Research Article
Collagen Research Unit, Biocenter and Department of Medical Biochemistry, PO Box 5000, FIN-90014 University of Oulu, Finland
Institute for Medical Molecular Biology, Medical University of Lubeck, D-23538 Lubeck, Germany
* Correspondence to:
J. Myllyharju, Department of
Medical Biochemistry, University
of Oulu, PO Box 5000,
FIN-90014 University of Oulu,
Finland.
E-mail: johanna.myllyharju@oulu.
Abstract
Four human genes, two of them encoding the proa1 and proa2 chains of type I procollagen
and two of them the two types of subunit of prolyl 4-hydroxylase (4-PH), were integrated
into the genome of Pichia pastoris. The proa1 and proa2 chains expressed formed type I
procollagen molecules with the correct 2 : 1 chain ratio, and the 4-PH subunits formed an
active enzyme tetramer that fully hydroxylated the proa chains. Chains lacking their N but
not C propeptides formed pCcollagen molecules with the 2 : 1 chain ratio and, surprisingly,
the expression levels of pCcollagen were 1.53-fold relative to those of procollagen. Both
types of molecule could be converted by pepsin treatment to collagen molecules that
formed native-type brils in vitro. The expression levels obtained for the pCcollagen using
only single copies of each of the four genes and a 2 l fermenter ranged up to 0.5 g/l,
indicating that it should be possible to optimize this system for high-level production of
recombinant human type I collagen for numerous medical applications. Copyright # 2001
John Wiley & Sons, Ltd.
Keywords:
Introduction
The collagens are a family of extracellular matrix
proteins that play a dominant role in maintaining
the structural integrity of various tissues. The
molecule of type I collagen, the most abundant
collagen, is a long, thin rod. It consists of two
identical a1(I) chains and one a2(I) chain that are
coiled around one another in a triple-helical
conformation. The molecule is rst synthesized in
the form of a larger precursor that needs to be
processed by cleavage of the N and C propeptides
present in its proa1(I) and proa2(I) chains. The
collagen molecules then self-assemble into crossstriated brils in which each molecule is displaced
by about one-quarter of its length relative to its
nearest neighbour along the axis of the bril (for
reviews on collagens, see Kielty et al., 1993;
Prockop and Kivirikko, 1995; Kadler et al., 1996).
In addition to the type I collagen heterotrimer,
many tissues contain small amounts of an [a1(I)]3
homotrimer (Kielty et al., 1993). Early renaturation
Copyright # 2001 John Wiley & Sons, Ltd.
M. Nokelainen et al.
798
799
Expression vectors
Selection
Polypeptides expresseda
Proa1(I)
Proa2(I)
Proa1(I)+Proa2(I)
Proa2(I)/Proa1(I)
PCa1(I)
PCa2(I)
PCa1(I)+PCa2(I)
The PDI polypeptide that acts as the b subunit of 4-PH has the Saccharomyces cerevisiae a mating factor (aMF) prepropeptide in all strains,
whereas all other polypeptides have their own signal sequences.
b
Zeocin resistance.
a
Proa2(I)/Proa1(I) strain
A third strain expressing 4-PH was rst generated.
The coding region of the a subunit cDNA PA59
was ligated as a SmaI fragment (Vuorela et al.,
1997) into the EcoRI site of pBLADEIX and an
internal XbaI site in the cDNA was removed by
site-directed mutagenesis (Stratagene). A PDI
cDNA extending from the codon for the rst
amino acid after the signal peptide (Vuorela et al.,
1997) was ligated into the EcoRI site following the
S. cerevisiae a mating factor prepro sequence
in
pBLARGSX.
The
pBLADEIXa
and
pBLARGSXPDI, linearized with XbaI and BglII,
respectively, were co-transformed into the yJC300
(his4, arg4, ade1) strain. To generate a proa2(I)/
proa1(I) double expression vector, the cDNAs were
rst cloned separately into the EcoRI site of pAO815
(Invitrogen), as described for pPICZBproa1(I) and
pBLADEIXproa2(I). The proa1(I) expression cassette was cut from pAO815proa1(I) in two fragments, BglIIBamHI and BamHIBamHI, and
cloned downstream of the proa2(I) cassette into
pAO815proa2(I) digested with BamHI to generate
pAO815proa2(I)/proa1(I). This construct was linearized with SalI and transformed into the strain
expressing 4-PH described above. The strain was
named Proa2(I)/Proa1(I) (Table 1).
800
M. Nokelainen et al.
Results
Generation of recombinant P. pastoris strains
co-expressing human type I procollagen and
4-PH, and analysis of the recombinant proteins
The strains Proa1(I), Proa2(I), Proa1(I)+Proa2(I)
and Proa2(I)/Proa1(I) that expressed the corresponding polypeptide chains of human type I
procollagen and human 4-PH (Table 1) were generated as described in Materials and methods. The
recombinant strains were cultured in BMGY
medium, and expression was induced in BMMY
medium, methanol being added every 12 h. The
cells were harvested and broken 60 h after induction, and aliquots of the soluble fraction of the cell
extracts were analysed by SDSPAGE followed by
Western blotting. Full-length proa1(I) chains were
seen in immunoblots of the strains Proa1(I),
Proa1(I)+Proa2(I) and Proa2(I)/Proa1(I) (Figure 1A,
lanes 1, 3 and 4), and full-length proa2(I) chains
were seen in those of the Proa2(I), Proa1(I)+
801
Figure 1. Expression of the recombinant human type I procollagen homotrimers and heterotrimers in P. pastoris. (A) 50 mg
samples of the soluble fractions of the cell lysates from the strains Proa1(I) (lanes 1 and 5), Proa2(I) (lanes 2 and 6),
Proa1(I)+Proa2(I) (lanes 3 and 7) and Proa2(I)/Proa1(I) (lanes 4 and 8) were analysed by SDSPAGE under reducing
conditions followed by Western blotting, using antibodies against the N propeptide of the human proa1(I) chain (lanes 14)
and C propeptide of the human proa2(I) chain (lanes 58). The arrows indicate the proa1(I) and proa2(I) chains. The lines on
the left indicate the positions of the molecular weight markers (kDa). (B) 20 ml aliquots of pepsin-treated samples from the
strains Proa1(I) (lane 1), Proa2(I) (lane 2), Proa1(I)+Proa2(I) (lane 3) and Proa2(I)/Proa1(I) (lane 4) were analysed by
SDSPAGE under reducing conditions followed by silver staining. The arrows indicate the a1(I) and a2(I) chains
Copyright # 2001 John Wiley & Sons, Ltd.
802
M. Nokelainen et al.
Figure 2. Expression of the recombinant human type I pCcollagen homotrimers and heterotrimers in P. pastoris. (A) 50 mg
samples of the soluble fractions of the cell lysates from the strain expressing only 4-PH (lane 1) and the strains Proa1(I) (lane
2), PCa1(I) (lane 3) and PCa1(I)+PCa2(I) (lane 4) were analysed by SDSPAGE under reducing conditions followed by
Western blotting, using an antibody against the collagenous region of various collagens. The arrows indicate the proa1(I),
pCa1(I) and pCa2(I) chains. (B) 20 ml aliquots of pepsin-treated samples from the strains PCa1(I) (lane 1) and
PCa1(I)+PCa2(I) (lane 2) were analysed by SDSPAGE under reducing conditions followed by silver staining. The arrows
indicate the a1(I) and a2(I) chains
Copyright # 2001 John Wiley & Sons, Ltd.
803
Discussion
The data presented here demonstrate that it is
possible to integrate four genes into the genome of
P. pastoris and successfully co-express all four
polypeptides. The two types of 4-PH subunit
Table 2. Amino acid analysis of the puried recombinant human type I collagen homotrimer and heterotrimer
Amino acid
Recombinant
human type I
collagen homotrimera
(residues/1000)
Non-recombinant
human type I
collagen homotrimerb
(residues/1000)
Recombinant human
type I collagen heterotrimera
(residues/1000)
Non-recombinant
human type I
collagen heterotrimerb
(residues/1000)
Aspartic acid
Glutamic acid
4-Hydroxyproline
Serine
Glycine
Histidine
Arginine
Threonine
Alanine
Proline
Valine
Methionine
Tyrosine
Isoleusine
Leucine
Hydroxylysine
Phenylalanine
Lysine
46t4.3
77t4.4
110t2.2
35t1.6
314t5.4
4.8t0.5
50t1.3
19t0.8
120t13.3
122t2.9
21t2.2
NDc
ND
9t0.1
20t2.8
0
11t1.0
33t1.6
42
73
108
34
333
3
50
16
115
124
21
7
1
6
19
9
12
26
40t5.9
71t0.3
99t5.7
31t3.6
315t19
6.9t1.5
53t1.7
19t3.2
123t1.5
125t14
25t0.4
ND
ND
9t1.1
23t1.7
0
10t1.2
34t1.9
43
71
103
33
335
6
50
17
111
120
26
6
2
9
23
10
12
23
804
M. Nokelainen et al.
Acknowledgements
We thank Dr James Cregg, Keck Graduate Institute of
Applied Life Sciences, for the gift of the pBLARGIX,
pBLARGSX and pBLADEIX vectors and the P. pastoris
strain yJC300, and Raija Juntunen, Anne Kokko, Eeva
Lehtimaki, Minna Siurua and Tanja Vaisanen for their
expert technical assistance. This work was supported by
grants from the Health Science Council of the Academy of
Finland, the Finnish Centre of Excellence Programme
20002005 (44843), from the European Commission (BIO4CT96-0537), from the National Institutes of Health (R01
AR45879), from FibroGen Inc. (South San Francisco, CA),
from the Research and Science Foundation of Farmos, and
from the Jenny and Antti Wihuri Foundation.
References
Bruckner P, Prockop DJ. 1981. Proteolytic enzymes as probes for
the triple-helical conformation of procollagen. Anal Biochem
110: 360368.
Cereghino JL, Cregg JM. 2000. Heterologous protein expression
in the methylotrophic yeast Pichia pastoris. FEMS Microbiol
Rev 24: 4566.
Cregg JM, Vedvick TS, Raschke WC. 1993. Recent advances in
the expression of foreign genes in Pichia pastoris. Biotechnology 11: 905910.
John DCA, Watson R, Kind AJ, Scott AR, Kadler KE, Bulleid
NJ. 1999. Expression of an engineered form of recombinant
procollagen in mouse milk. Nature Biotechnol 17: 385389.
Kadler KE, Holmes DF, Trotter JA, Chapman JA. 1996.
Collagen bril formation. Biochem J 316: 111.
Keizer-Gunnink I, Vuorela A, Myllyharju J, Pihlajaniemi T,
Kivirikko KI, Veenhuis M. 2000. Accumulation of properly
folded human type III procollagen molecules in specic
intracellular membranous compartments in the yeast Pichia
pastoris. Matrix Biol 19: 2936.
Kielty CM, Hopkinson I, Grant ME. 1993. Collagen: the
collagen family: structure, assembly, and organization in the
extracellular matrix. In Connective Tissue and Its Heritable
Disorders: Molecular, Genetic and Medical Aspects, Royce PM,
Steinmann B (eds). WileyLiss: New York; 103147.
Kivirikko KI, Myllyharju J. 1998. Prolyl 4-hydroxylases and
Copyright # 2001 John Wiley & Sons, Ltd.
805
806
M. Nokelainen et al.