Professional Documents
Culture Documents
Proteomics Tools
Molecular Biology Tools
Separation & Display Tools
Protein Identification Tools
Protein Structure Tools
Protein Identification
2D-GE + MALDI-MS
2D-GE + MS-MS
Multidimensional LC + MS-MS
1D-GE + LC + MS-MS
De Novo Peptide Sequencing
Create ions
Ionization
method
MALDI
Electrospray
(Proteins must be
charged and dry)
Separate ions
Mass analyzer
MALDI-TOF
MW
Triple Quadrapole
AA seq
MALDI-QqTOF
AA seq and MW
QqTOF
Detect ions
Mass
spectrum
Database
analysis
Library
Spot removed
from gel
Artificial
spectra built
Fragmented
using trypsin
Spectrum of
fragments
generated
MATCH
Artificially
trypsinated
Database of
sequences
(i.e. SwissProt)
Methods for
protein
identification
MS Principles
Different elements can be uniquely
identified by their mass
MS Principles
Different compounds can be uniquely
identified by their mass
Butorphanol
N -CH2OH
L-dopa
Ethanol
COOH
HO
-CH2CH-NH2
CH3CH2OH
HO
HO
MW = 327.1
MW = 197.2
MW = 46.1
Mass Spectrometry
Analytical method to measure the
molecular or atomic weight of samples
Weighing proteins
A mass spectrometer creates charged particles (ions) from molecules.
Common way is to add or take away an ions:
NaCl + e NaCl
-
NaCl NaCl+ + eIt then analyzes those ions to provide information about the molecular
weight of the compound and its chemical structure.
Mass Spectrometry
For small organic molecules the MW can be
determined to within 5 ppm or 0.0005% which
is sufficiently accurate to confirm the
molecular formula from mass alone
For large biomolecules the MW can be
determined within an accuracy of 0.01% (i.e.
within 5 Da for a 50 kD protein)
Recall 1 dalton = 1 atomic mass unit (1 amu)
MS History
JJ Thomson built MS prototype to measure
m/z of electron, awarded Nobel Prize in 1906
MS concept first put into practice by Francis
Aston, a physicist working in Cambridge
England in 1919
Designed to measure mass of elements
Aston Awarded Nobel Prize in 1922
MS History
1948-52 - Time of Flight (TOF) mass
analyzers introduced
1955 - Quadrupole ion filters introduced by
W. Paul, also invents the ion trap in 1983
(wins 1989 Nobel Prize)
1968 - Tandem mass spectrometer appears
Mass spectrometers are now one of the
MOST POWERFUL ANALYTIC TOOLS IN
CHEMISTRY
MS Principles
Find a way to charge an atom or
molecule (ionization)
Place charged atom or molecule in a
magnetic field or subject it to an electric
field and measure its speed or radius of
curvature relative to its mass-to-charge
ratio (mass analyzer)
Detect ions using microchannel plate or
photomultiplier tube
Ionizer
Mass Analyzer
Detector
Create ions
Separate ions
Ionization
method
Mass analyzer
MALDI
Electrospray
(Proteins must be
charged and dry)
Detect ions
Mass
MALDI-TOF
spectrum
MW
Database
Triple Quadrapole
AA seq
analysis
MALDI-QqTOF
AA seq and MW
QqTOF
AA seq and
protein modif.
Mass spectrometers
L in e a r T im e O f F lig h t t u b e
io n s o u r c e
d e te c to r
t im e o f f l ig h t
io n s o u r c e
r e f le c t o r
tim e o f flig h t
m/z ratio:
Molecular weight divided by the
charge on this protein
aspirin
M
M
Resolution in MS
Resolution in MS
783.455
QTOF
784.465
785.475
783.6
Inlet
Sample Plate
Target
HPLC
GC
Solids probe
Ion
Source
Mass
Filter
MALDI
ESI
IonSpray
FAB
LSIMS
EI/CI
TOF
Quadrupole
Ion Trap
Mag. Sector
FTMS
Detector
Microch plate
Electron Mult.
Hybrid Detec.
Data
System
PCs
UNIX
Mac
EI Fragmentation of CH3OH
CH3OH
CH3OH+
CH3OH
CH2O=H+
CH3OH
CH2O=H+
+ H
CH3 + OH
CHO=H+ + H
MALDI
ESI
Soft Ionization
Soft ionization techniques keep the
molecule of interest fully intact
Electro-spray ionization first conceived in
1960s by Malcolm Dole but put into
practice in 1980s by John Fenn (Yale)
MALDI first introduced in 1985 by Franz
Hillenkamp and Michael Karas (Frankfurt)
Made it possible to analyze large
molecules via inexpensive mass analyzers
such as quadrupole, ion trap and TOF
Ionization methods
Electrospray Ionization
Sample dissolved in polar, volatile buffer
(no salts) and pumped through a stainless
steel capillary (70 - 150 m) at a rate of 10100 L/min
Strong voltage (3-4 kV) applied at tip along
with flow of nebulizing gas causes the
sample to nebulize or aerosolize
Aerosol is directed through regions of
higher vacuum until droplets evaporate to
near atomic size (still carrying charges)
Electrospray (Detail)
Electrospray Ionization
Can be modified to nanospray system
with flow < 1 L/min
Very sensitive technique, requires less
than a picomole of material
Strongly affected by salts & detergents
Positive ion mode measures (M + H)+ (add
formic acid to solvent)
Negative ion mode measures (M - H)- (add
ammonia to solvent)
Matrix-Assisted Laser
Desorption Ionization
337 nm UV laser
cyano-hydroxy
cinnamic acid
MALDI
MALDI
Sample is ionized by bombarding sample
with laser light
Sample is mixed with a UV absorbant
matrix (sinapinic acid for proteins, 4hydroxycinnaminic acid for peptides)
Light wavelength matches that of
absorbance maximum of matrix so that
the matrix transfers some of its energy to
the analyte (leads to ion sputtering)
MALDI Ionization
Matrix
+
+ +-+
Laser
Analyte
+
+ ++ + --+
-+
+
+
+
Absorption of UV radiation
by chromophoric matrix and
ionization of matrix
Dissociation of matrix,
phase change to supercompressed gas, charge
transfer to analyte molecule
Expansion of matrix at
supersonic velocity, analyte
trapped in expanding matrix
plume (explosion/popping)
MALDI
Unlike ESI, MALDI generates spectra that
have just a singly charged ion
Positive mode generates ions of M + H
Negative mode generates ions of M - H
Generally more robust that ESI (tolerates
salts and nonvolatile components)
Easier to use and maintain, capable of
higher throughput
+
+ +
p u ls e d
U V o r I R la s e r
(3 -4 n s )
vacuum
+
s tro n g
e le c tr ic
fie ld
Vacc
c lo u d o f
p ro to n a te d
p e p tid e m o le c u le s
d e te c to r
T im e O f F lig h t tu b e
d e te c to r
t im e o f flig h t
R e fle c to r T im e O f F lig h t tu b e
io n s o u r c e
d e te c to r
r e f le c t o r
t im e o f flig h t
MALDI = SELDI
337 nm UV laser
cyano-hydroxy
cinnaminic acid
MALDI
MALDI/SELDI Spectra
Normal
Tumor
Inlet
Sample Plate
Target
HPLC
GC
Solids probe
Ion
Source
Mass
Filter
MALDI
ESI
IonSpray
FAB
LSIMS
EI/CI
TOF
Quadrupole
Ion Trap
Mag. Sector
FTMS
Detector
Microch plate
Electron Mult.
Hybrid Detec.
Data
System
PCs
UNIX
Mac
Different Types of MS
ESI-QTOF
MALDI-QTOF
Different Types of MS
GC-MS - Gas Chromatography MS
MCP
DETECTOR
PUSHER
HEXAPOLE
QUADRUPOLE
ION
SOURCE
SKIMMER
HEXAPOLE
COLLISION
CELL
TOF
REFLECTRON
HEXAPOLE
FT-ICR
2D - LC/LC
Study protein
complexes
without gel
electrophoresis
Complex mixture is
simplified prior to
MS/MS by 2D LC
(trypsin)
2D LC/MS
Principles of Fingerprinting
Sequence
>Protein 1
acedfhsakdfqea
sdfpkivtmeeewe
ndadnfekqwfe
>Protein 2
acekdfhsadfqea
sdfpkivtmeeewe
nkdadnfeqwfe
>Protein 3
acedfhsadfqeka
sdfpkivtmeeewe
ndakdnfeqwfe
Mass (M+H)
Tryptic Fragments
4842.05
acedfhsak
dfgeasdfpk
ivtmeeewendadnfek
gwfe
4842.05
acek
dfhsadfgeasdfpk
ivtmeeewenk
dadnfeqwfe
4842.05
acedfhsadfgek
asdfpk
ivtmeeewendak
dnfegwfe
Principles of Fingerprinting
Sequence
>Protein 1
acedfhsakdfqea
sdfpkivtmeeewe
ndadnfekqwfe
Mass (M+H)
4842.05
>Protein 2
acekdfhsadfqea
sdfpkivtmeeewe
nkdadnfeqwfe
4842.05
>Protein 3
acedfhsadfqeka
sdfpkivtmeeewe
ndakdnfeqwfe
4842.05
Mass Spectrum
http://ca.expasy.org/tools/peptidecutter/
http://ca.expasy.org/tools/peptidecutter/peptidecutter_enzymes.html#Tryps
Trypsin
XXX[KR]--[!P]XXX
Chymotrypsin
Lys C
Asp N endo
CNBr
XX[FYW]--[!P]XXX
XXXXXK-- XXXXX
XXXXXD-- XXXXX
XXXXXM--XXXXX
Why Trypsin?
60 kDa; 57 461 Da
pI = 4.75
533.3
675.4
879.4
1196.6
1249.6
1583.9
1853.9
2526.4
544.3
701.4
921.5
1217.6
1344.7
1616.8
1869.9
2542.4
545.3
726.4
953.4
1228.5
1455.8
1726.7
2286.2
3329.6
614.4
822.4
974.5
1232.6
1484.6
1759.9
2302.2
4211.4
634.3
855.5
988.5
1233.7
1514.8
1775.9
2317.2
http://us.expasy.org/tools/peptide-mass.html
>Protein 1
acedfhsakdfqea
sdfpkivtmeeewe
ndadnfekqwfe
Monoisotopic Mass: the sum of the exact or accurate masses of the lightest stable isotope of the atoms
in a molecule
H-1.007828503 amu
2
H-2.014017780 amu
1
C-12
13
C-13.00335, 14C-14.00324
12
Masses in MS
Monoisotopic
mass is the mass
determined using
the masses of the
most abundant
isotopes
Average mass is
the abundance
weighted mass of
all isotopic
components
Amino acid
R1NHCH2COR3
Residue
Monoisotopic Mass
1
H = 1.007825
12
C = 12.00000
14
N = 14.00307
16
O = 15.99491
Glycine 57.02147
Alanine 71.03712
Serine 87.03203
Proline 97.05277
Valine
99.06842
Threonine 101.04768
Cysteine 103.00919
Isoleucine 113.08407
Leucine 113.08407
Asparagine 114.04293
Glycine 57.0520
Alanine 71.0788
Serine 87.0782
Proline 97.1167
Valine
99.1326
Threonine 101.1051
Cysteine 103.1448
Isoleucine 113.1595
Leucine 113.1595
Asparagine 114.1039
>P12345
acedfhsakdfqea
sdfpkivtmeeewe
ndadnfekqwfe
>P21234
acekdfhsadfqea
sdfpkivtmeeewe
nkdadnfeqwfe
acek
dfhsadfgeasdfpk
ivtmeeewenk
dadnfeqwfe
>P89212
acedfhsadfqeka
sdfpkivtmeeewe
ndakdnfeqwfe
acedfhsadfgek
asdfpk
ivtmeeewendak
dnfegwfe
Mass List
450.2017 (P21234)
609.2667 (P12345)
664.3300 (P89212)
1007.4251 (P12345)
1114.4416 (P89212)
1183.5266 (P12345)
1300.5116 (P21234)
1407.6462 (P21234)
1526.6211 (P89212)
1593.7101 (P89212)
1740.7501 (P21234)
2098.8909 (P12345)
1199
2211 (trp)
609
2098
450
1940 (trp)
698
500
1000
1500
2000
2500
450.2201
609.3667
698.3100
1007.5391
1199.4916
2098.9909
450.2017 (P21234)
609.2667 (P12345)
664.3300 (P89212)
1007.4251 (P12345)
1114.4416 (P89212)
1183.5266 (P12345)
1300.5116 (P21234)
1407.6462 (P21234)
1526.6211 (P89212)
1593.7101 (P89212)
1740.7501 (P21234)
2098.8909 (P12345)
Results
2 Unknown masses
1 hit on P21234
3 hits on P12345
Conclude the query
protein is P12345
Database search
PeptIdent (ExPasy)
Mascot (Matrix Science)
MS-Fit (Prospector; UCSF)
ProFound (Proteometrics)
MOWSE (HGMP)
Human Genome Mapping Project
Mascot
800
1200
1600
2000
2400
800
1200
1600
2000
m/z
m/z
theoretical
experimental
Protein ID
2400
http://129.85.19.192/profound_bin/WebProFound.exe
MOWSE
http://srs.hgmp.mrc.ac.uk/cgi-bin/mowse
PeptideSearch
http://www.narrador.emblheidelberg.de/GroupPages/Homepage.html
Mascot
www.matrixscience.com
PeptIdent
http://us.expasy.org/tools/peptident.html
ProFound
ProFound Results
MOWSE
PeptIdent
MASCOT
Mascot Scoring
The statistics of peptide fragment
matching in MS (or PMF) is very similar to
the statistics used in BLAST
The scoring probability follows an extreme
value distribution
High scoring segment pairs (in BLAST)
are analogous to high scoring mass
matches in Mascot
Mascot scoring is much more robust than
arbitrary match cutoffs (like % ID)
8000
7000
P(x) = 1 - e
6000
-x
-e
5000
4000
3000
2000
1000
0
<20
30
40
50
60
70
80
90
100
110
>120
MASCOT
Mascot/Mowse Scoring
The Mascot Score is given as S = -10*Log(P),
where P is the probability that the observed
match is a random event
Try to aim for probabilities where P<0.05 (less
than a 5% chance the peptide mass match is
random)
Mascot scores greater than 72 are significant
(p<0.05).
Advantages of PMF
How Tandem MS
sequencing works
Use Tandem MS: two mass analyzers
in series with a collision cell in
between
Collision cell: a region where the
ions collide with a gas (He, Ne, Ar)
resulting in fragmentation of the ion
Fragmentation of the peptides occur
in a predictable fashion, mainly at
the peptide bonds
The resulting daughter ions have
masses that are consistent with
known molecular weights of
dipeptides, tripeptides,
tetrapeptides
Ser-Glu-Leu-Ile-Arg-Trp
Collision Cell
Ser-Glu-Leu-Ile-Arg
Ser-Glu-Leu-Ile
Ser-Glu-Leu
Etc
Disadvantages
Requires more handling,
refinement and sample
manipulation
Requires more expensive
and complicated
equipment
Requires high level
expertise
NH
NH
H
N
S
*
*
*
*
H
N
I
O
Lyse &
Label
Quantification
MS
Light
100
100
MIX
Proteolysis
(ie trypsin)
Identification
MS/MS
NH2-EACDPLRCOOH
Heavy
550
570
m/z
590
200
400
m/z
600
ICAT Quantitation
ICAT
Advantages vs. Disadvantages
Estimates relative protein
levels between samples
with a reasonable level of
accuracy (within 10%)
Expensive
Slight chromatography
differences
Tag fragmentation
Meaning of relative
quantification information
No presence of cysteine
residues or not accessible by
ICAT reagent
Inlet
Sample Plate
Target
HPLC
GC
Solids probe
Ion
Source
Mass
Filter
MALDI
ESI
IonSpray
FAB
LSIMS
EI/CI
TOF
Quadrupole
Ion Trap
Mag. Sector
FTMS
Detector
Microch plate
Electron Mult.
Hybrid Detec.
Data
System
PCs
UNIX
Mac
MS Detectors
Mass Detectors
Limitations of Proteomics
-solubility of indiv. protein differs
-2D gels unable to resolve all proteins at a given time
-most proteins are not abundant (ie kinases)
-proteins not in the database cannot be identified
-multiple runs can be expensive
-proteins are fragile and can be degraded easily
-proteins exist in multiple isoforms
-no protein equivalent of PCR exists for amplification
of small samples
Shotgun Proteomics:
Multidimensional Protein
Identification Technology
(MudPIT)
Liquid
Chromatography
Peptides
Characterization
Mass Spectrometry
Identification
Post Translational modifications
Quantification
Database Search
MALDI-TOF MS
-(LC)-ESI-MS/MS
Tandem Mass
Spectrometer
2D Chromatography
RP
MS/MS Spectrum
PySpzS5609 #2438 RT: 66.03 AV: 1 NL: 8.37E6
T: + c d Full ms2 729.75@35.00 [ 190.00-1470.00]
545.31
100
95
90
85
80
75
658.36
70
65
900.36
Relative Abundance
60
55
1031.40
50
45
913.42
40
1240.53
782.23
896.29
35
546.19
771.24
25
1028.41
721.31
20
431.15
15
801.38
559.13
651.14
408.74
399.24
217.91
1241.39
914.34
427.27
317.17
10
5
1032.43
895.33
30
432.40
669.39
1027.22
882.07
600.24
481.13
869.23
915.53
986.50
1258.56
1033.60
1142.43
1123.49
1312.35
1356.10
1195.44
0
200
300
400
500
600
700
800
900
m/z
1000
1100
1200
1300
1400
SEQUEST
DTASelect &
Contrast
SCX
Peptide
Mixture
MudPIT
IEX-HPLC
Trypsin
+ proteins
p53
RP-HPLC
2D Chromatography
SCX
MudPIT Cycle
load sample
wash
salt step
wash
RP gradient
re-equilibration
RP
Tandem MS Spectrum
Peptide Sequence is Inferred from Fragment ions
x 3~18
MS
(MW Profile)
MS/MS
(AA Identity)
Experimental MS/MS
Spectrum
Theoretical MS/MS
Spectra
100
#1
CALCULATE #2
#3
#4
#5
95
90
85
80
75
658.36
70
65
900.36
Relative Abundance
60
55
1031.40
50
45
913.42
40
1240.53
782.23
896.29
35
546.19
771.24
25
1028.41
721.31
20
431.15
15
801.38
217.91
559.13
651.14
408.74
399.24
1241.39
914.34
427.27
317.17
10
5
1032.43
895.33
30
432.40
669.39
882.07
600.24
481.13
869.23
K.TVLIMELINNVAK.K
L.NAKMELLIDLVKA.Q
E.ELAILMQNNIIGE.N
A.CGPSRQNLLNAMP.S
L.FAPLQEIINGILE.G
1027.22
915.53
986.50
1258.56
1033.60
1142.43
1123.49
1312.35
1356.10
1195.44
0
200
300
400
500
600
700
800
900
1000
1100
1200
1300
1400
m /z
COMPARE
SCORE
SEQUEST-PVM
Beowolf computing cluster
55 mixed CPU: Alpha chips and AMD
Athlon PC CPU
Protein
List
ASSEMBLE
DTASelect
FILTER
Criteria Sets
Contrast
COMPARE
Summary Table
Control
Purification
Cells/Tissues
Multiprotein Complex/
Organelle
Total Protein
Characterization
Database
ORF
Unknown,
uncoding,
Known,
biochem.
MIPS
6368
hypothetical
1568
or genetics
4344
YPD
6145
1833
4270
SGD
~6000
NA
NA
Ionic Homeostasis
Cellular Organization
Protein Destination
Transcription
Transport
Protein Synthesis
Metabolism
Unclassified
Summary of MudPIT
2-DE vs MudPIT
Widely used, highly
commercialized
High resolving power
Visual presentation
14,305.14
ESI Transformation
Software can be used to convert these
multiplet spectra into single (zero charge)
profiles which gives MW directly
This makes MS interpretation much easier
and it greatly increases signal to noise
Two methods are available
Maximum Entropy
Denatured Azurin
THE END