Professional Documents
Culture Documents
1 64
α-S824: MCEQKLISEEDL GMYGKLNDLLEDLQEVLKHVNQHWQGGQKNMNKVDHHLQNVIEDIHDFMQGG
65 GSGGKLQEMMKEFQQVLDEIKQQLQGGDNSLHNVHENIKEIFHHLEELVHR 105
Fluorescence (AU)
Fluorescence (AU)
100
15 8
80
10 6
60
40 4
5
20 2
0
0 0
460 480 500 520 540 500 550 600 650 700 400 450 500 550 600
Wavelength (nm) Wavelength (nm) Wavelength (nm)
α-S824 β4 β17 β23
E
β4 β17 β23
F
β4 β17 β23
b17, or b23 were explored in a third type of experiment. Total cell experiments. Three proteomic experiments were performed,
lysates were prepared essentially without removal of aggregate each consisting of three biological repeats (independent trans-
material and combined 1:1:1 (Figure 3A and Figure S3A). The fections). A protein was identified as b protein interactor
expressed proteins were quantitatively isolated using anti-Myc when its isotope-labeled peptides were either enriched relative
antibody coupled to magnetic beads, followed by SDS-PAGE, to the a-S824 control or relative to one of the other b proteins
in-gel digestion, and LC-MS/MS analysis. with > 95% confidence in at least two of the three repeats of
It seemed plausible that initial coaggregation may be driven by a set (see Extended Experimental Procedures and Figures
relatively weak interactions, which might introduce a stochastic S3B–S3D). A total of 94 interactors of b23, 73, of b17 and 57 of
element in the proteomic analysis. To overcome this problem b4 were identified in experiments of equivalent sampling size,
we based our analysis on extensive biological repetitions of the consistent with the relative toxicity of the proteins (Figure 3B
and Tables S1–S3). Only four proteins were marginally enriched Immunofluorescence analysis demonstrated extensive co-
on a-S824 relative to the vector only control, including two ribo- localization of eIF3D and eIF4GII with the aggregates and
somal proteins (Figure S3B). Approximately 60% of the b4 and western blotting of pulldowns confirmed that at least 10%
b17 interactors were also found to interact with b23, indicating of cellular eIF3D coaggregated with b23 and b17, compared
a high degree of overlap in interaction profiles (Figure 3B). to 6% with b4 (Figures S3E and S3F, and data not shown).
Western blotting of pulldowns and immunofluorescence analysis Indeed, labeling experiments showed that cells expressing
of cells confirmed the results from SILAC/MS for several interac- b23 for 24 hr had a 35% reduced protein synthesis capacity
tors (Figures S3E and S3F). Thus, interactions of the b protein (Figure S3G). Similarly, the altered morphology of b-protein-
aggregates with multiple endogenous proteins precede the expressing cells revealed by actin staining (Figure S3H) may be
strong decrease in cell viability observed at 72 hr after transfec- attributed to the association of filamins A, B, C, (FLNA, FLNB,
tion (Figure 2A). FLNC), and the giant protein plectin-1 (PLEC1) (500 kDa)
As summarized for b23, most of the proteins associated with with the aggregates (Figure 3C), proteins that are critical for
the aggregates have their primary location in the cytoplasm, the formation and maintenance of cytoskeletal architecture.
nucleus and mitochondria (Figure 3C and Table S1). Proteins These results show that many different proteins, involved in
involved in chromatin regulation, RNA processing, transcrip- a range of essential cellular functions, are affected by the
tion, translation, cytoskeletal function, vesicle transport, and b protein aggregates.
protein quality control were highly represented. These proteins
are generally of average cellular abundance (Su et al., 2002) Aberrant Stress Response in b-Protein-Expressing Cells
and for several of them between 10% and 45% of total was The proteomic analysis identified several cytosolic chaperones
associated with the aggregates, based on depletion from and chaperone regulators to be associated with the aggregates,
supernatant fractions after pulldown as measured by SILAC/ including Hsc70 (Hsc71) and its cochaperones Hsp110
MS (Table S1 and Extended Experimental Procedures). Note (Hsp105), Hdj1/2 and Bag2, as well as the nascent chain associ-
that this analysis probably underestimates the extent of ated complex, NAC (Figure 3C). Hsp110 was enriched in the
sequestration, since coaggregates may partially dissociate aggregates in a manner correlating with the relative toxicity of
during isolation. Interestingly, 12 different translation initiation b4, b17 and b23, as confirmed by western blotting and immuno-
factors interacted directly or indirectly with the aggregates, fluorescence (Figures 4A and 4B). Indeed, overexpression of
including 9 of the 13 subunits of the eIF3 complex and 3 Hsp110 (Figure S4A) partially suppressed b4 and b17 aggrega-
subunits of eIF4 (Figure 3C). b17 aggregates contained 10 tion and toxicity but was inefficient in mitigating the toxic effects
and b4 aggregates 9 of these proteins (Tables S2 and S3). of b23 (Figures 4C and Figures S4B and S4C).
Intensity
mix β17 β17
Expt. II: α-S824 β4 β17 lysates IP LC-MS/MS
α-S α-S β4 β4
1:1:1 C 824 824
Expt. III: β4 β17 β23
m/z m/z m/z
B C Miscellaneous
β4 ABCE1 OGT
Molecular chaperones
BAG2 HSPA8 Translation
FAM120A SBDS
(57) Protein degradation
CHORDC1
DNAJA1
HSPH1
NACA
EEF1A1
EIF3A
EIF3I
EIF3L
CACYBP UBC DNAJB1 NACB EIF3C EIF3M
ERLIN2 UBR4
EIF3D EIF4A1
SUMO2 Mitochondria
EIF3E EIF4A3
Vesicle transport EIF3F EIF4G3 SLC25A6 VDAC1
21 AP1B1 CLTC tRNA synthetases EIF3G IGF2BP3 CHCHD3 VDAC2
AP1G1 SEC16A EPRS WARS IMMT VDAC3
AP1M1 VAPA YARS SSBP1
1 4
Metabolism
31 ALDH18A1 Ribosome biogenesis
RNA processing
BAT1 RBM8A
CAD DIMT1L GEMIN4 SMN1
CHERP NVL GEMIN5 SR140
47 23 HDLBP PDCD11 STRAP
15 Nuclear structure WDR3
AHNAK
Cytoskeleton Chromatin structure
LMNB1 Transcription
AKAP12 PLEC1 CHD4 RUVBL2
NUMA1
Nuclear transport H1F0 SMC4 CAND1 PNN
ARHGDIA SEPT2
KPNA2 RBBP4 SMARCA4 CNOT1 PURA
CCDC88A SEPT7
GTF2I SND1
β17
DIAPH1 SEPT9 RANBP1 RUVBL1 SMARCA5
Remarkably, expression of the b proteins did not induce the Structural Features of b Protein Interactors
cytosolic stress response or heat-shock response (HSR), as no A bioinformatic analysis of the physicochemical properties of the
increase in the levels of Hsp110, Hsp70, or Hsp27 was observed b protein interactors was conducted to see whether these
(Figures 4A and data not shown). A possible defect in the HSR proteins share certain structural features. We focused our initial
was further analyzed using a luciferase reporter gene under analysis on the interactors of b23 (Table S1). Compared to a set
control of the HSF1-dependent Hsp70 promoter (Williams of 3055 control proteins identified by LC-MS/MS in a total cell
et al., 1989). While inhibition of proteasome function by MG132 lysate (Table S4), the b23 interactors are shifted to higher molec-
in control cells resulted in a 5-fold induction of the reporter, ular weight, with a significantly greater fraction of proteins above
this induction was completely abolished in cells expressing the 150 kDa (p < 0.005) (Figure 5A). In addition, the interactors have
b proteins for 24 hr (Figure 4D). The phorbol-12-myristate-13- a lower average hydrophobicity and a bimodal hydrophobicity
acetate (PMA)-mediated induction of a luciferase reporter under distribution (Kyte and Doolittle, 1982) (p < 0.005) (Figure 5B).
the NF-kB promoter was also impaired, but to a lesser extent The occurrence of domain folds among the b23 interactors, as
than the inhibition of the stress response (Figure 4E). Thus, classified in SCOP, was generally similar to that of lysate
expression of the model b sheet proteins leads to a deficiency proteins. However, the b23 interactors contained significantly
of the normal cytosolic stress response, thereby limiting the more proteins with all beta domains (SCOP class b) (p < 0.05)
capacity of cells to mount an effective defense. (Figure S5A), including the beta-barrel VDAC proteins of the
Viability (% of control)
C β4 β17 β23 α-S824 100 (A and B) Association of Hsp110 with b protein
aggregates. HEK293T cells were transfected as
Hsp110
shown.
(D and E) Inhibitory effect of b proteins on
10 cellular stress response pathways. Cells were
cotransfected with HSP70-luciferase reporter (D)
β23 or NF-kB-luciferase reporter constructs (E) and
1 the b-protein-expressing plasmids. 6 hr later, 5 mM
MG132 (D) or 16 mM PMA (E) were added to induce
C β4 β17 β23 α-S824 the respective promoter. Luciferase activity was
measured 24 hr after transfection. The promoter
activity in cells transfected with control vector, C,
without inducer was set to 1. Standard deviations
of three independent experiments are shown.
See also Figure S4.
outer mitochondrial membrane and the filamins which have and 10% of 100 to 429 residues. The IURs are enriched in polar
Ig-domain repeats (Table S1 and Figure S3F). amino acids and in amino acids that have a high propensity to
The lower hydrophobicity of many interactors suggested that form coil and turn regions, such as M, K, R, E, S, Q, and P,
these proteins may be rich in intrinsically unstructured regions and are depleted in aromatic and hydrophobic amino acids W,
(IURs). Indeed, compared to lysate proteins, the b23 interactors Y, F, C, I, and V (Dunker et al., 2008). Such sequences are struc-
have a significantly greater fraction of total amino acids in IURs turally flexible and populate a range of conformational states
(p < 0.05), based on the DisoDB database (Pentony and Jones, from extended disordered to collapsed, molten globule-like
2009) and the DisEMBL and IUPred prediction tools for unstruc- structures (Dunker et al., 2008; Pentony and Jones, 2009). Using
tured regions (Dosztanyi et al., 2005; Linding et al., 2003) (Fig- the Zagg algorithm to predict aggregation propensities, the b23
ure 5C, Tables S1 and S4, and data not shown). Pronounced interactors have higher aggregation scores than lysate proteins
differences emerged when considering the fraction of proteins (see Figure S6B below).
with IURs longer than 30 or 50 residues (Figure 5D). For example, A comparison of the proteins that interact preferentially with
60% of the b23 interactors are predicted to contain at least one b4, b17, and b23 (18, 28, and 27 proteins, respectively), as
unstructured segment of 30 amino acids, compared to 45% in defined by the SILAC experiments (Figure S3D and Extended
lysate proteins or the complete proteome (p < 0.005) (Figure 5D). Experimental Procedures), revealed that a gradual increase in
The corresponding numbers for IURs > 50 amino acids are molecular weight and decrease in hydrophobicity of the interac-
40% and 25% (p < 0.005), respectively. Moreover, the b23 tors, along with a slight increase in their disorder, correlated with
interactors contain on average 3 disordered segments of 30 the differential cytotoxicity of the three b proteins (Figure 5E).
amino acids (compared to 1.8 for the lysate proteins, p < This trend was also observed when comparing the complete
0.005). The predicted IURs of the interactors are shifted to interactor sets of the three b proteins (Tables S1–S3).
greater length (p < 0.005) (Figure S5B and Tables S1 and S4), The prominent association of proteins with low hydropho-
with 21% of the proteins containing IURs > 80 amino acids bicity and high intrinsic disorder with the aggregates was
40
80
120
160
200
240
280
320
480
520
560
600
640
based on a chi-square test. Disorder was deter-
mined using DisoDB.
Molecular weight (kDa)
(E) Structural properties of lysate proteins and of
B p<0.005 proteins interacting preferentially with b4, b17, or
14 0.2
b23.
0.0
% of identified proteins
-0.2
10
-0.4
8 -0.6
0.2
0.4
0.6
0.8
1.0
-1.6
-1.4
-1.2
-1.0
-0.8
-0.6
-0.4
-0.2
60
16
** β17 interactors were enriched on the Arctic mutant rela-
t
40 50
Molecular weigh
12 30
40 β23 interactors tive to the less toxic Ab1 42 WT (Table
30 S6). Notably, these proteins resemble
8 20 20
-0.55 the b protein interactors in physico-
y
10 -0.50
cit
4 10
bi
20 -0.40
op
0 0 40 60
Disord -0.35 enriched in IURs (p < 0.05) (Figures
dr
Intensity
30 70 Lysate proteins
t (kDa)
70
60 ** Old β23 interactors
stretches (%)
25 60
translation initiation factors (Table S7),
50 50 New β23 interactors
Molecular weigh
-0.6 10
cit
5 10
-0.5 Interestingly, the old and new interac-
bi
0
ho
20 40 -0.4
0 0
op
11
10 80 actors within the protein interaction network.
9
β4
Disease proteins
Shown in (A) are the average number of functional
/Protein
8 7.5 7.2
7 β17
60 interactions of the b23 interactors in comparison
6 β23
4.9 to proteins in the HPRD database and in the
5
4 40
3 2.8 experimentally determined cell lysate (3000
2 1.6 1.5 proteins). These functional interactions are cate-
0.8 20
1 0.4
gorized into total interactions, interactions with
0 1000
s
β23 850
r
HPRD Lysate 0 essential proteins and interactions with proteins
to
700
ac
200
All interactions Esse 250 involved in neurodegenerative disease (Ray-
r
550
te
ntial 300
In
Essential intera 350 chaudhuri et al., 2009). In (B), the complete sets of
ctors
Disease b4, b17, and b23 interactors are compared in
terms of these functional properties.
C (C) Model for the interaction of b aggregates with
pre-existent and newly synthesized proteins. Pre-
existent proteins are structurally flexible in their
functional state and are involved in multiple
Proteins or Chaperones Chaperones Newly-
protein complexes synthesized protein-protein interactions, which may be dis-
containing rupted by their association with the b aggregates.
disordered regions
The newly synthesized proteins are structurally
co-aggregation co-aggregation Amyloidogenic vulnerable to coaggregation during folding and
regions (red)
assembly. Interaction of both the preexistent
Toxic and newly synthesized proteins with the b aggre-
β-aggregates/
Structured oligomers gates is facilitated by the limiting capacity of
proteins Proteins not yet folded
and/or assembled chaperones to shield aggregate surfaces and by
the failure of the cells to mount an efficient stress
response.
conditions of limited chaperone capacity, they would be prone to neurodegenerative disease proteins (Raychaudhuri et al., 2009)
aggregate during and shortly after synthesis. This group is (Figure 7A and Table S1).
enriched among the newly synthesized proteins. Finally, some Assuming that a disturbance of functional protein interactions
proteins occupy a transition zone, combining physicochemical contributes critically to b aggregation toxicity, b23 would be
features of both groups. expected to differ in this regard from the less cytotoxic proteins
b4 and b17. We found that the b4, b17 and b23 interactors are
Aggregate Interactors Have Critical Network Functions physically linked to a total of 600, 643, and 912 different proteins,
The structural flexibility and relatively large size of the aggregate including 216, 213, and 340 essential proteins and 53, 56, and 84
interactors suggests that these proteins may normally be proteins associated with neurodegenerative disease networks,
involved in numerous functional protein interactions. To address respectively (Figure 7B). Thus, the capacity of the b protein
this possibility, we analyzed how the b23 interactors are linked aggregates to interact with and sequester highly connected
with the cellular protein network. A query of the Human cellular proteins correlates well with their relative cytotoxicity.
Proteome Reference Data Base (HPRD) (Keshava Prasad
et al., 2009) revealed that each of these proteins functionally DISCUSSION
interacts with 12 different proteins on average, compared to
7 per lysate protein and 7.5 per protein in HPRD (19,651 Widespread Coaggregation of Metastable Proteins
entries) (Figure 7A). Notably, most of the b23 interactors have A key finding of this study is that amyloidogenic aggregation can
no or only few interactions with any of the other b23 interactors, result in the sequestration of numerous proteins that share
suggesting that coaggregation may disrupt their functional distinct physicochemical properties: They are relatively large in
complexes. For example, the microfilament protein vimentin size and exhibit high structural flexibility, with a significant
interacts with more than 100 different proteins according to enrichment in disordered regions, features that are strongly
HPRD, but only three of those are among the identified b23 inter- linked with multifunctionality (Figure 7C).
actors, although 49 potential vimentin interactors were detected The artificial b sheet proteins used as a model were designed
in the lysate or background of the pulldowns (data not shown). to assemble into fibrils (West et al., 1999). Like natural amyloido-
Essential proteins often occupy critical ‘‘hub’’ positions in the genic proteins, they populate a range of prefibrillar aggregation
network (Haynes et al., 2006; Jeong et al., 2001). Each b23 target intermediates, which are likely to represent the primary toxic
protein interacts on average with 5 different essential proteins, agents in aggregation diseases (Chiti and Dobson, 2006; Jahn
compared to only 3 per lysate protein and 1.5 per entry in and Radford, 2008). Based on recent findings, the proteotoxicity
HPRD (Figure 7A). Moreover, the b23 interactors are more of such species correlates with the exposure of ANS-binding
frequently linked than lysate proteins, through direct interac- hydrophobic surfaces (Bolognesi et al., 2010; Campioni
tions, with proteins that have been found in association with et al., 2010) and reactivity with the A11 anti-oligomer antibody
Plasmids
For bacterial expression, the N-terminally Myc-tagged b proteins were cloned into the vector pTrcHis (Invitrogen), excluding the His-
tag of the vector. a-S824 protein was cloned into the pProEx HT vector. For mammalian expression, N-terminally Myc-tagged
proteins were cloned into pcDNA3.1/Myc-His (Invitrogen), excluding the Myc- and His-tags of the vector. Human HSP110 was in-
serted into the vector pcDNA3.1 (Invitrogen). HSP70-luciferase (Williams et al., 1989) and NF-kB-luciferase (Heil et al., 2004) reporter
constructs were kindly provided by Dr. H. Wagner (Technical University Munich). GFP fusion constructs of wild-type (WT) Ab1-42 and
its Arctic mutant (E22G) (Wurth et al., 2002) were cloned into pcDNA3.1 using the restriction sites HindIII/NotI.
Protein Purification
Protein expression in E. coli BL21 cells (DE3) was induced with 0.5 mM IPTG at an OD600 of 1.0 for 4 hr at 37 C. The cells were
collected by centrifugation at 5000 x g for 10 min and resuspended in lysis buffer (25% sucrose, 50 mM Tris pH 8.0, 1 mM
EDTA), supplemented with complete protease inhibitor (Roche) and 1 mg/ml lysozyme (Sigma). The cells were lysed by sonication
or repeated freeze-thaw cycles and the DNA was digested by Benzonase (Novagen). Inclusion bodies were isolated by repeated
washing in 0.5% Triton X-100, 1 mM EDTA and centrifugation (20.000 x g for 15 min). The pellet was dissolved in 8 M urea and purified
on a MonoQ 10/100 HR16/10 column in 8 M urea, 25 mM Tris pH 7.5 using a gradient from 0 to 1 M NaCl. The b-protein-containing
fractions were further purified by size exclusion chromatography on Sephacryl S-300 HiPreP 26/60 in 8 M urea, 0.1 M NaCl, 25 mM
Tris pH 7.5. After dialysis overnight into 0.1 M NaCl, 25 mM sodium phosphate pH 7.5, the proteins formed soluble oligomers/aggre-
gates, which eluted in the excluded volume of Sephacryl S-300 size-exclusion chromatography. The protein concentration of the
b sheet proteins was determined by UV absorbance at 210 nm (Abs210 0.1% (= g/l) = 20). All proteins were stored at 80 C in
25 mM sodium-phosphate pH 7.5, 100 mM NaCl.
To purify a-S824 protein, cell lysate was centrifuged at 20.000 x g for 15 min, and the supernatant was applied to Ni-NTA agarose
chromatography following standard procedures. The protein concentration of the purified a-S824 protein was determined by UV
absorbance at 280 nm (3280 = 12950 M-1 cm-1). The protein was stored at 80 C in 25 mM sodium-phosphate pH 7.5, 100 mM NaCl.
Antibodies
A11 anti-oligomer antibody was kindly provided by Dr. C. Glabe (University of California, Davis). Anti-Myc (Cy3-conjugated), anti-
rabbit IgG (peroxidase-conjugated), and anti-mouse IgG (peroxidase-conjugated) were purchased from Sigma, anti-Hsp110, anti-
Myc, anti-VDAC1, anti-RanBP1, anti-eIF3D, anti-eIF4GII and anti-Vigilin from SantaCruz, anti-GAPDH from Chemicon, anti-GFP
from Roche, anti-Myc (FITC-conjugated) from Zymed, and anti-mouse IgG (FITC-conjugated) from Biosource and anti-mouse IgG
(Cy3-conjugated) anti-rabbit (Cy3-conjugated) from Dianova.
Fluorescence Spectroscopy
Fluorescence measurements with purified proteins were performed on a FluoroLog-3 spectrofluorometer. The fluorescent dyes ThT
and ANS were adjusted to a final concentration of 20 mM, NIAD-4 to 1 mM. ThT fluorescence was excited at 440 nm, NIAD-4 at 475 nm
and ANS at 375 nm .
Analysis of MS Data
Generation of Ratios
MS data were analyzed with MaxQuant version 1.0.12.31 (Cox and Mann, 2008) using the following parameters: Quant; SILAC Trip-
lets, Medium Labels: Arg6, Lys4, Heavy Labels: Arg10, Lys8. Maximum labeled amino acids: 3. Variable modifications: Oxidation (M),
Acetyl (Protein N-terminus). Fixed Modifications: Carbamidomethyl (C). Database: Human IPI, released March 3 2009 with contam-
inants and a decoy database added by the SequenceReverser.exe program released with MaxQuant v. 1.0.12.4. Enzyme: Trypsin/P.
MS/MS tolerance: 0.5 Da. Maximum missed cleavages: 2. Top MS/MS peaks/100 Da: 6. Mascot version 2.2 (Matrix Sciences, www.
matrixsciences.com) was used to generate search results for MaxQuant. Identify; Peptide FDR: 0.01. Protein FDR: 0.01. Maximum
PEP: 1. Minimum unique peptides: 1. Minimum peptide length: 6. Minimum peptides: 1. Protein Quantitation based on Razor and
Unique peptides. Minimum Ratio count: 2. ‘Requantify’ and ‘Keep low-scoring versions of identified peptides’ were both enabled.
Additionally, for those proteins for which no quantitation was available, but for which at least two unique peptides were found
only in one isotope but not another isotope, arbitrary ratios of 10 or 0.1 were assigned, and Significance B was assigned the value
of ‘0’. Unless noted otherwise, normalized ratios were used. Supplemental Tables show the leading protein for each protein group;
that is, the protein which best matched all of the peptides identified. The raw MS data along with a full list of identified proteins and
quantitations is available at https://proteomecommons.org/tranche, entering the following hash: +Ff0/p8lSBrrzCKZfzAwYS3
+Bqw5fonokB679f136te2iklhHtFMUpeT5SM/I3XuufTyrXj0ycVVC6G4Li/L02 dA4jcAAAAAAABVfg = =.
Depletion of Interactors
The depletion of interactors from the total cell lysate by binding to b23 aggregates was estimated by analyzing the supernatants
(technically flow-through fractions) from the IPs in experiment I. H/M ratios were used to determine the amount of each of the inter-
actors remaining in the cell lysate after IP. The depletion was then estimated by dividing by the approximate transfection efficiency
(50%). A depletion value is reported when an H/M ratio was measured in the supernatant in at least two out of the three replicates and
when the H/M ratio is 0.95 or lower, corresponding to a mean depletion of greater than 10% (Table S1). Relative error in the SILAC
ratios prevents the calculation of reliable estimates for depletion when the depletion is very low. Therefore, for proteins where the
mean H/M ratio of at least two replicates is 0.95 or higher, a depletion value of % 10% has been entered in Table S1. Values of deple-
tion around 10% and higher were confirmed by quantitative western blotting in which protein amounts in IPs were compared with
amounts in total lysate (Figure S3F and Figure 4A, and data not shown).
Pulse-SILAC
The design of this experiment (also see experiment IV above) is shown in Figure 6B. The M-labeled cells were transfected with b23.
After the transfection, half of the cells were shifted to H medium, while the other half continued in M medium. The L-labeled cells were
transfected with a-S824, and half of them were left on L medium, as a control. Lysates were prepared, mixed and analyzed as above.
The measured H/Mlysate24h ratio of individual proteins in the lysate was used to calculate the expected H/Mlysate12h, the ratio at the
midpoint of the experiment, according to the equation:
H=Mlysate24h
H=Mlysate12h = :
2 + H=Mlysate24h
Bioinformatic Analysis
The SUPERFAMILY predictions (http://supfam.cs.bris.ac.uk) according to SCOP 1.73 classification (http://scop.mrc-lmb.cam.ac.
uk/) where used to assign protein folds. Information on protein interactions is based on HPRD Release 9 (Keshava Prasad et al.,
2009) (www.hprd.org). This data is combined with the information about gene essentiality in mouse from MGD (Bult et al., 2008)
(http://www.informatics.jax.org/) and the compilation of proteins associated with neurodegenerative disease proteins (Raychaudhuri
et al., 2009). The cellular abundance of the b protein interactors was estimated according to HEK293T mRNA levels measured by Su
et al. (2002) (http://biogps.gnf.org).
pagg
i = ah phi + as psi + ahyd phyd
i + ac pci
1X 3
pagg
i = pagg + apat Ipat + agk Igk
7 j = 3 i + j i i
where we considered the aggregation rate of a seven-residue segment of the protein centered at position i. Ipat
i and Igk
i are included,
pat
respectively, to account for the presence of hydrophobic patterns and of gatekeeper residues. The term Ii is 1 if residue i is included
in a hydrophobic pattern over five residues and 0 otherwise, while the term Igk
i is defined as
X
10
Igk
i = ci + j
j = 10
where the sum over the charges ci of individual amino acids is made over a sliding window of 21 residues; shorter windows are
considered at the N- and C-termini. The term Igk
i is introduced to take into account the fact that, when a hydrophobic pattern is flanked
by charged residues, its contribution to the aggregation propensity is much reduced by electrostatic repulsions.
By normalizing pagg
i , the intrinsic aggregation propensity score, Ziagg , is obtained, which is used to compare the aggregation
propensity of different sequences
pagg magg
Ziagg = i
sagg
where magg is the average value of Ziagg over a set of random polypeptides having the same length as the sequence of interest, and
agg
s is the corresponding standard deviation from the average (Tartaglia et al., 2008). The corresponding intrinsic propensity for
aggregation of the protein without structural correction, Z agg , is calculated as
1X N
Z agg = Z agg
N i=1 i
The weights a’s are determined by fitting the Z agg scores against a database of aggregation rates measured experimentally (Tar-
taglia et al., 2008).
P
N agg agg agg
Z~i w Z~i Z~0
agg
Z~ = i = 1 N
P ~agg ~agg
w Zi Z0
i=1
SUPPLEMENTAL REFERENCES
Bult, C.J., Eppig, J.T., Kadin, J.A., Richardson, J.E., and Blake, J.A. (2008). The Mouse Genome Database (MGD): mouse biology and model systems. Nucleic
Acids Res. 36, D724–D728.
Heil, F., Hemmi, H., Hochrein, H., Ampenberger, F., Kirschning, C., Akira, S., Lipford, G., Wagner, H., and Bauer, S. (2004). Species-specific recognition of single-
stranded RNA via toll-like receptor 7 and 8. Science 303, 1526–1529.
Heinz, C., Engelhardt, H., Niederweis, M., Heinz, C., Engelhardt, H., and Niederweis, M. (2003). The core of the tetrameric mycobacterial porin MspA is an
extremely stable beta-sheet domain. J. Biol. Chem. 278, 8678–8685.
Tartaglia, G.G., Cavalli, A., and Vendruscolo, M. (2007). Prediction of local structural stabilities of proteins from their amino acid sequences. Structure 15, 139–
143.
Wurth, C., Guimard, N.K., and Hecht, M.H. (2002). Mutations that reduce aggregation of the Alzheimer’s A beta 42 peptide: an unbiased search for the sequence
determinants of A beta amyloidogenesis. J. Mol. Biol. 319, 1279–1290.
Zandomeneghi, G., Krebs, M.R., McCammon, M.G., Fandrich, M. (2004). FTIR reveals structural differences between native beta-sheet proteins and amyloid
fibrils. Protein Sci. 13, 3314–3321.