This week did not have a homework, saved here just for myself as some of the details on the web were informative.
Week 12: Imaging and Measurement

<aside> <img src="/icons/slide_green.svg" alt="/icons/slide_green.svg" width="40px" /> Lecture Slides
</aside>
<aside> <img src="/icons/video-camera_yellow.svg" alt="/icons/video-camera_yellow.svg" width="40px" /> Class Recording
</aside>
<aside> <img src="/icons/slide_green.svg" alt="/icons/slide_green.svg" width="40px" /> Recitation Slides
</aside>
<aside> <img src="/icons/video-camera_yellow.svg" alt="/icons/video-camera_yellow.svg" width="40px" /> Recitation Recording
</aside>
<aside> 🧬 Homework is based on data that will be generated in the Waters Immerse Lab in Cambridge, MA. Students will be characterizing green fluorescent protein (eGFP, a recombinant protein standard) structure (primary, secondary/tertiary) in the lab using liquid chromatography and mass spectrometry. Data generated in the lab will be available on-line for students working remotely.
</aside>
<aside> <img src="/icons/exclamation-mark_orange.svg" alt="/icons/exclamation-mark_orange.svg" width="40px" /> Mandatory to MIT/Harvard Students, optional for Committed Listeners. Edited April 23 for clarity.
</aside>
For your final project:
<aside> <img src="/icons/exclamation-mark_orange.svg" alt="/icons/exclamation-mark_orange.svg" width="40px" /> Part 1 and 2 are mandatory for Committed Listeners and MIT/Harvard Students
</aside>
Experimental Investigations
Optional Section
We will be analyzing an eGFP standard onto a BioAccord LC-MS system to determine the molecular weight of intact eGFP and observe its charge state distribution in the denatured (unfolded) state. The conditions for LC-MS analysis of intact protein cause it to unfold and be detected in its denatured form (due to the solvents and pH used for analysis).
Questions
<aside> <img src="/icons/attachment_green.svg" alt="/icons/attachment_green.svg" width="40px" /> eGFP Sequence:
VSKGEELFTG VVPILVELDG DVNGHKFSVS GEGEGDATYG KLTLKFICTT GKLPVPWPTL VTTLTYGVQC FSRYPDHMKQ HDFFKSAMPE GYVQERTIFF KDDGNYKTRA EVKFEGDTLV NRIELKGIDF KEDGNILGHK LEYNYNSHNV YIMADKQKNG IKVNFKIRHN IEDGSVQLAD HYQQNTPIGD GPVLLPDNHY LSTQSALSKD PNEKRDHMVL LEFVTAAGIT LGMDELYKLE HHHHHH
Note: This contains a His-purification tag and a linker.
</aside>
<aside> <img src="/icons/mathematics_gray.svg" alt="/icons/mathematics_gray.svg" width="40px" /> Key Equations:
$n=(\frac{m}{z_{n+1}}) / (\frac{m}{z_n} - \frac{m}{z_{n+1}})$
$Accuracy = \frac{|MW_{experiment} - MW_{theo}|}{MW_{theo}}$
</aside>

Figure 1. Mass Spectrum of intact eGFP protein from the Waters BioAccord LC-MS (a mass spectrometer with 10,000 resolution) with individual charge state peaks labeled with m/z values.
We will be digesting eGFP protein standard into peptides using Trypsin (an enzyme that selectively cleaves the peptide bond after Lysine (K) and Arginine (R) residues. These peptides, resulting from the digested eGFP will be analyzed by LC-MS to measure their molecular weight and to fragment them to confirm the amino acid sequence within each peptide – generating a Peptide Map. This process is used to confirm the primary structure of the protein.
Questions
There are a variety of tools available online to calculate protein molecular weight and predict a list of peptides generated from a tryptic digest. We will be using tools within the online resource Expasy (bioinformatics resource portal of the SIB Swiss Institute of Bioinformatics) to predict a list of tryptic peptides from eGFP.

Figure 2. Example conditions for predicting the number of tryptic peptides from eGFP standard. Please replicate all parameters shown above.
Identify the mass-to-charge (m/z) of the peptide shown in Figure 3b. What is the charge (z) of the most abundant charge state of the peptide (use the separation of the isotopes to determine the charge state). Calculate the mass of the singly charged form of the peptide based on its m/z and z ([M+H]+).
Identify the peptide based on comparison to expected masses in the PeptideMass tool. What is mass accuracy of measurement?

What is the percentage of the sequence that is confirmed by peptide mapping (Figure 5).

Figure 3a. Example LC-MS Chromatogram for eGFP Peptide Map. The peak at 2.78 minutes is circled, and its MS data is shown in the mass spectrum in Figure 3b, below.

Figure 3b. Mass spectrum figure to show m/z for a peak at 2.78 min from Figure 3a above. The inset is a zoom-in of the peak at m/z 525.76, to discern the isotope peaks.

Figure 4. MS/MS Fragmentation Spectrum of the peptide at retention time 2.26 minutes.

Figure 5. Amino Acid Coverage Map of eGFP based on BioAccord LC-MS peptide identification data.
Bonus Question
<aside> ❗
</aside>
We will be analyzing eGFP in its native, folded state and comparing it to its denatured, unfolded state on a quadrupole time-of-flight MS. We will be doing MS only analysis (no liquid chromatography).
Bonus Lab Learnings:

Figure 6. Comparison of the mass spectra between native (top) and denatured (bottom) EGFP standard on the Waters Xevo G3 Q-Tof MS.