Protocols for Granny Weatherwax

Granny Weatherwax didn’t like maps. She felt instinctively that they sold the landscape short.

Unlike this beloved character from the Disc-world, I love maps. I love their cleanness and clarity. I love exploring my environment from my armchair (or from a place on the living room floor, when the armchair is covered in coats). But, there have been occassions when the quote from “Witches Abroad” has popped into my mind. For example: We once decided to go for a walk in Jerusalem. On the map it looked like a gentle meander – a pleasant way to while away an afternoon. Admittedly there were some clues in the map (tight circular patterns in the layout of the streets; the small space at the climax of the walk was divided into three religious sectors), but even so this was certainly an example where the map did not prepare us for how tiring and stimulating the afternoon would prove to be.

On looking at one of the protocols on my desk at the moment, the quote came to mind again. In many cases, scientific procedures lend themselves very well to the protocol format, and I love clean, clear steps invigorated by the use of the active tense. But occassionally we happen on a manuscript where there is a risk that too much information will be lost in the process of co-ercing the method into this format. In these cases, there is some value in allowing more discursive information in the steps as long as there is at least one sentence in each numbered step that is in the active tense. And hopefully in this way we somehow manage to not sell the landscape too short.

An example of how our Procedures normally read:

Steps 1 – 6: Synthesis of peracetylated mannose

Timing: ~12 h

1|  Add D-mannose 1 (1.0 g, 5.5 mmol) in pyridine (3.5 ml, 44.9 mmol) and cool the mixture to 0 °C in an ice bath. Add acetic anhydride (3.9 ml, 44.1 mmol).

2 | Add acetic anhydride to the reaction mixture through the addition funnel (addition takes ~15 min).

3 | Stir the reaction mixture for 12 h (or overnight) under N2.

4 | Evaporate the solvent and dissolve the residue in 15 ml of dichloromethane (DCM).

5 | Wash the DCM layer with 10 ml dH2O 2–3 times and separate the layers.

6 | Dry the organic layer over Na2SO4 (2 g) and evaporate the solvent, filter through filter paper and evaporate the solvent using a rotary evaporator; a white solid (2) is obtained. Throughout this protocol, MgSO4 can be used instead of Na2SO4.

Pause point: The intermediate can be stored for several months in the refrigerator if necessary.

 

Taken from “Continuous-flow reactor–based synthesis of carbohydrate and dihydrolipoic acid–capped quantum dots” (Paola Laurino, Raghavendra Kikkeri & Peter H Seeberger)

 

Some examples of more discursive Procedures:

1 | Select the fluorescent protein fragments to be used. Several combinations of fluorescent protein fragments support bimolecular fluorescence complementation11; those recommended for BiFC analysis are listed in Table 2. For most purposes, fragments of YFP truncated at residue 155 (designated YN155 and YC155) are recommended, because they exhibit a relatively high complementation efficiency when fused to many interaction partners, yet produce low fluorescence when fused to proteins that do not interact with each other2. Fragments of YFP truncated at residue 173 (designated YN173 and YC173) can also be used11, and may exhibit a different efficiency of complementation owing to differences in the steric constraints imposed by tethering of the fragments to the protein complex. Fragments of Venus (a mutated GFP with high fluorescence intensity)20 truncated at either residue 155 or 173 (designated VN155 and VC155, or VN173 and VC173, respectively) produce a significantly brighter fluorescent signal when fused to specific interaction partners21. However, these fragments also produce a brighter signal when fused to proteins that do not selectively interact with each other21. These fragments have the great advantage that the bimolecular fluorescent complex is readily detectable at 37 °C, which avoids the incubation at 30 °C that is generally necessary to detect complementation using YFP fragments. Other combinations of fluorescent protein fragments can also be used, especially when using BiFC analysis for the visualization of multiple protein complexes in the same cell11.

2 | Determine the sites where the fluorescent protein fragments can be fused to the putative interaction partners. Determine the positions of the fusions empirically to fulfill the three criteria described below.
First, ensure that the fusions allow the fragments of the fluorescent proteins to associate with each other if the putative partners interact. Information about the structure and location of the interaction interface may be useful to determine optimal positions for the fusions. However, this information is not essential because fusions that can be used for BiFC analysis can be identified by screening multiple combinations of fusion proteins for fluorescence complementation. One strategy for the identification of fusion proteins that allow bimolecular fluorescence complementation is to fuse each of the fluorescent protein fragments to the N- and C-terminal end of each interaction partner, and to test for complementation in all eight combinations that contain both fragments of the fluorescent protein (Fig. 2).

Second, confirm that fusions do not affect the localization or the stabilities of the proteins by comparing the localization and expression levels of the fusion proteins with those of wild-type proteins lacking the fusions; indirect immunofluorescence and immunoblot analyses can be used.

Third, test the fusion proteins for all known functions of the endogenous proteins to ensure that the fusions do not affect the functions of the proteins under investigation.

Troubleshooting

3 | Select linkers to connect the fragments to the proteins of interest. The linkers must provide flexibility for independent motion of the fluorescent protein fragments and the interaction partners, allowing the fragments to associate when the proteins interact. We have used the RSIAT and RPACKIPNDLKQKVMNH linker sequences in many fusion constructs used for BiFC analysis2, 11. These linkers have been used for the visualization of interactions between many structurally unrelated proteins. The sequence AAANSSIDLISVPVDSR encoded by the multiple cloning sites of the pCMV-FLAG vector (Sigma) has also been successfully used as a linker in many BiFC experiments. A peptide sequence designed to be flexible, such as (GGGS)n, can also be used, although it can potentially affect the degradation of the fusion protein. Although these linker sequences have worked well for the proteins examined previously, it is possible that linkers of a different length or sequence are optimal for BiFC analysis of interactions between other proteins.

Troubleshooting

4 | Select a cell culture system. Choose a cell culture system that represents the biological context to be investigated, and allows efficient introduction of DNA into a large fraction of the cells. Cells that grow as an adherent monolayer are generally easier to image. The BiFC assay has been used for the analysis of protein interactions in many mammalian cell lines including COS-1, HEK293, HeLa, Hep3B, TN4, and NIH3T3 cells as well as in intact organisms2, 10, 12, 18, 19, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66.

5 | Select a strategy for expression of the fusion proteins. Choose either transient expression (A) or stable expression (B) strategies, based on the purpose of the experiment.

 

Taken from “Design and implementation of bimolecular fluorescence complementation (BiFC) assays for the visualization of protein interactions in living cells” (Tom K Kerppola)

 

Step 32: Data preprocessing

A GC-TOF-MS data analysis

Timing: 4–6 h for target list generation, 2–3 h for raw data processing of data acquired over 5 d

(i) Using LECO’s terminology, perform a ‘peak find’ data processing method with a single QC sample injected in the middle of the block experiment. The data processing method should have ‘Baseline’, ‘Peak Find’, ‘Calculate Area/Height’ and ‘Retention Index’ functions activated. Key parameters in this method are the baseline offset, data points to be averaged for smoothing, expected chromatographic peak width, maximum number of unknown peaks to find and the minimum signal-to-noise ratio for the (automatically selected) quantitation mass. All parameters are sensitive to the chromatographic performance obtained and must be selected to reflect this. From representative chromatograms acquired in the HUSERMET project, in which we analyzed thousands of human serum samples with GC-MS, baseline offset was set at 0.5, data points to be averaged for smoothing was set at automatic, peak width was set at 1.8 s and the maximum number of unknown peaks to find was set to 400. A signal/noise (S/N) threshold of 100:1 was used; this was an informed compromise between comprehensive reporting and the collation of spectra of sufficient quality to be reliably found subsequently. A retention index method is prepared in the software by compiling a method table containing the retention indices (1,000, 1,200, 1,500, 1,900 and 2,200), the observed retention time and the quantitation ions used to confirm the detection of each retention index compound.

(ii) Step 32A(i) produces a table of potential candidates for inclusion in a reference table and annotated with a retention index, mass spectrum and single quantitation ion. From this table, delete candidates whose mass spectrum does not contain fragment ions expected for TMS derivatives at m/z 73 and 147, and whose quantitation ion chromatogram indicates that a single mass spectral feature has been reported as multiple features (‘peak splitting’). In these cases, delete the features with lowest S/N while retaining the feature with the highest S/N. Manually edit the mass spectrum for the isotopically labeled internal standards to remove ions present in the unlabeled endogeneous metabolite. Assess the automatically chosen quantitation masses for accuracy, a high S/N ratio and no interference to peak shape from co-eluting derivatized metabolite peaks. Amend the quantitation mass if necessary. The metabolite peaks are then exported to a reference file created before Step 32A(i). Parameters in the reference table are set at 100,000,000 for tolerance (to ensure all peaks are matched and reported independent of peak area), 20 for RI deviation, 700 for match threshold, 2,500 for minimum area and 5.0 for S/N threshold.

(iii) A separate study sample can then be processed through the deconvolution software, as described in 32A(i), with the ‘Compare’ function also enabled. To do this, set the mass threshold setting at 50. Derivatized metabolic features uniquely detected in this sample are marked, the mass spectrum and quantitation masses are assessed as described above in Step 32A(ii) and then exported to the reference file. This process is performed for a range of samples from the study.

Critical step: In large-scale studies, we recommend performing Step 32A(iii) on samples from different experimental blocks to ensure that all derivatized metabolite peaks are present in the reference file.

(iv) Each peak in the reference file is named with a unique label (e.g., internal standard succinic d4 acid, sample peak X). At this stage, definitive identification of each peak can be performed. To do this, compare the retention index and mass spectrum of each metabolite with those recorded for authentic chemical standards and present in in-house libraries (e.g., Golm metabolome database or MMD in-house library) or in commercially available mass spectral libraries (e.g., NIST, EPA or NIST05 libraries) (see Experimental design). If a match to a retention time/index (± 10) and mass spectrum (match >70%) is observed, the identification can be described as definitive and the peak can be labeled metabolite name_definitive. If a match to only a mass spectrum is observed, the identification can be described as putative and the peak can be labeled ‘metabolite name_putative’.

(v) The final stage is used to define the most appropriate internal standard for each peak. This can be performed by analyzing 60 QC injections in a single block. Calculate the peak area ratio (peak area metabolite/peak area internal standard) for each metabolite peak associated with each internal standard and calculate the relative standard deviation (RSD) for each of these peaks for injections 6–60. The internal standard providing the lowest RSD is chosen as the internal standard for that metabolite.

(vi) Perform raw data processing using the reference table described above (Step 32A(i–v)) for all samples to reliably find and report the selected metabolic features in all samples. Process all the blocks using the appropriate set of parameters and internal standard selections. As noted, automatic feature detection and measurement achieves a high success rate (estimated to be in excess of 98%), which was further improved by manually inspecting the peak area measurements for each internal standard in each sample, and manually correcting where required. Further outlier rejection tests can be performed on a block basis before accepting data. This has led to the rejection of <1% of the injections performed.

Pause point: Archive processed data for future use.

 

Taken from “Procedures for large-scale metabolic profiling of serum and plasma using gas chromatography and liquid chromatography coupled to mass spectrometry” (Dunn et al.)

Membrane protein protocols

While browsing recently published papers in other journals, I came across Supramolecular fishing for plasma membrane proteins using an ultrastable synthetic host-guest binding pair (published in Nature Chemistry). The idea here is that proteins on the surface of the cell are chemically modified by reaction with 1-trimethylammoniomethylferrocene (AFc). After cell lysis, the modified proteins bind to beads coated with cucurbit-7-uril. Cucurbit-7-uril and AFc interact to form an ultrastable host-guest binding pair. The captured proteins are then recovered either by treatment with a strong competitor or by heating to 95 C in buffer conditions.

I thought that this was rather pleasing! It reminded me of a protocol we had published on the metabolic labeling of glycans with azido sugars, and got me to thinking: “What sorts of protocols do we have for looking at membrane proteins?”.

Using our Browse functionality plus a PubMed search I came up with 17 that I thought would be relevant (please forgive me (and let me know!!) if I have missed something vitally important). These protocols can be roughly divided into those working with a specific protein that you already know (the majority), and those where you are working with membranes from whole cells. When looking at the structure and function of purified membrane proteins, one of the main challenges seems to be the design of a matrix such that you can get meaningful data from proteins that are in conditions as similar as possible to the cell membrane itself.

These are some notes that I made while going through the protocols:

Expression and purification

Preparative scale expression of membrane proteins in Escherichia coli-based continuous exchange cell-free systems

– This protocol includes methods for preparing E.coli s30 extracts and T7RNAP.

GFP-based optimization scheme for the overexpression and purification of eukaryotic membrane proteins in Saccharomyces cerevisiae

– This protocol includes a method for transforming a gene-vector construct into S. cerevisiae.

– Purification is via a His-tag

Purification of recombinant membrane proteins tagged with calmodulin-binding domains by affinity chromatography on calmodulin-agarose: example of nicotinamide nucleotide transhydrogenase

Looking mostly at structure

A general protocol for the crystallization of membrane proteins for X-ray structural investigation

– His-tagged proteins are expressed in E.coli. The membranes are isolated and solubilised. Purification on a nickel column is followed by removal of the his-tag and size exclusion chromatography. Crystallisation is by hanging-drop vapour diffusion.

Crystallizing membrane proteins using lipidic mesophases

Bicelle samples for solid-state NMR of membrane proteins

– Isotope labelled proteins are analysed in bicelles. The lipid bilayers of these are aligned perpendicular to the magnetic field forming a liquid crystalline phase.

Synthesis of TOAC spin-labeled proteins and reconstitution in lipid membranes

– The protein is synthesised via FMOC chemistry. When analysed by electron paramagnetic resonance, the TOAC residue accurately reports the position, orientation and dynamics of the peptide backbone near the labelled site.

Atomic force microscopy and spectroscopy of native membrane proteins

– The authors use purple membrane from Halobacterium salinarum as an example. The stylus of the AFM is used to obtain the topography of the membrane as well as to mechanically manipulate the protein. Measurements taken in the process of unfolding the protein provide information regarding the molecular interactions within it.

The fluorescence protease protection (FPP) assay to determine protein localization and membrane topology

– This protocol uses GFP-fusion proteins. The link will take you to a cartoon showing a single cell before and after exposure to digitonin and trypsin.

Site-directed alkylation of cysteine to test solvent accessibility of membrane proteins

– Each residue in the protein is systematically mutated to cysteine. The proteins are reacted with [N-ethyl-1-^14^C]ethyl-maleimide, separated by SDSPAGE and analysed using a PhosphoImager.

Looking mostly at function using specific proteins

Gel chromatography and analytical ultracentrifugation to determine the extent of detergent binding and aggregation, and Stokes radius of membrane proteins using sarcoplasmic reticulum Ca2+-ATPase as an example

– This method uses the radioactive detergent ^14^C-n-dodecyl-β-D-maltoside. The Stokes radius is determined either by size-exclusion chromatography or by ultracentrifugation sedimentation velocity analysis.

Using patterned supported lipid membranes to investigate the role of receptor organization in intercellular signaling

Expression cloning and radiotracer uptakes in Xenopus laevis oocytes

– This protocol is at the border of a few sections as it uses a functional assay (uptake of a radiotracer appropriate to the type of membrane protein that you are interested in) to either find new proteins or work what mutations will result in changes in transport activity.

– cRNA is injected into the oocytes.

Two protocols that I wasn’t certain how to classify

The preparation and development of cellular membrane affinity chromatography columns

– The membrane proteins are isolated by centrifugation (usually from a transfected cell line containing a target protein) and attached to a stationary phase.

– Frontal affinity chromatography using labelled ligands is used to measure molecular interactions.

Detecting interactions with membrane proteins using a membrane two-hybrid assay in yeast

– This protocol is for the MYTH system shown in the figure below.

Proteomic-type protocols

Quantitative proteomic approach to study subcellular localization of membrane proteins

– The method used is called LOPIT (localisation of organelle proteins by isotope tagging).

– Cell membranes are separated into fractions by equilibrium density gradient centrifugation. Each fraction is labelled using a different iTRAQ reagent, and the samples are pooled.

Identification of membrane proteins from mammalian cell/tissue using methanol-facilitated solubilization and tryptic digestion coupled with 2D-LC-MS/MS

– The key features of this protocol are (1) the use of 60 % methanol to dissolve the membrane-protein sample, and (2) that the strong cation exchange chromatography is done off-line to allow for optimisation of the fractionation to improve the capacity of the LC-MS/MS analysis. This is significant, because it allows the identification of 1500-2500 unique proteins.

Focusing on stem cells

Since Nature Protocols launched we’ve been having a shall-we-shan’t-we discussion about the relative merits of article series. On one hand they seem such a great idea – bringing together lots of protocols in a specific area to make life easier for researchers in that area. On the other hand we worry they’ll alienate researchers working on other things – why do the mass spectrometrists get special treatment and not us? And how do you decide what to include on such a page – anything vaguely connected or just the hard core of a particular topic? Then finally, as an Editor I just find myself wondering why I am making more work for myself…

But back in 2009 we took the plunge and launched our Stem Cell Series page. At the time stem cells were all the rage – if you watched the TV news at night you’d think all diseases were on the verge of being cured by these fantastic new cells. Unless you lived in the US, where they were not viewed so kindly by some.

Recently the page has slipped a bit in priorities – launching the Protocol Exchange was quite a job. But we feel it’s now time to revitalise it. Stem cells remain a hot research topic, and the optimism about their potential remains. Each week more and more possible applications are being devised. I’m pleased to see that our recently published stem cell protocols are reflecting this increased variety. Currently featured, in addition to our typical protocols showing how to get your stem cells initially, or how to go from stem cells to your cells of choice, there are some intriguing ways to look at your stem cells. Some of these methods can be applied to cells other than stem cells so do check it out even if your favourite cell is not a stem cell.

The juries still out on whether series pages are a useful addition to our site. So, if you’ve not checked for a while, or if you’ve not looked at all, we do suggest you take a peek. And do let us know what you think – should we run series or not? And if so, what topics should we cover?

Old and new protocols on stable isotope labelling of proteins for mass spec

On Thursday last week, we published a protocol for spike-in SILAC, a method to analyse the proteome of tissues and organisms developed by Yasushi Ishihama & Matthias Mann and co-workers. This is one of many protocols that we have published that uses stable isotope labelling of proteins or peptides prior to separation and analysis by mass spectrometry. A list of these can be found here.

In fact, the very first protocol that I edited was written by John Asara and was on in-gel stable isotope labeling for relative quantification using mass spectrometry (click here for an overview). This is an adaptation of standard in-gel digestion protocols where gel regions of interest are labelled at lysine residues with either light or heavy isotope-labeled reagents. The parallel slices from the samples-to-be-compared are combined before protease digestion and the resulting peptides are analysed by LC/MS to determine relative abundance of light- and heavy-isotope lysine-containing peptide pairs and analyzed by LC/MS/MS for identification of sequence and modifications.

The chances of success are greater in samples where the complexity has been reduced by e.g. immunoprecipitation or chromatographic separation, and while it has the advantage of simplicity, it has been found to have limited utility for looking at post-translational modifications.

In other approaches, stable-isotope labelling is performed after digestion of the sample. Two that look at specific post-translational medications are: Mass spectrometric identification of N-linked glycopeptides using lectin-mediated affinity capture and glycosylation site-specific stable isotope tagging and Chemical derivatization of histones for facilitated analysis by mass spectrometry

Two that are more generally applicable are: Simultaneous analysis of relative protein expression levels across multiple samples using iTRAQ isobaric tags with 2D nano LC-MS/MS and Multiplex peptide stable isotope dimethyl labeling for quantitative proteomics.

IGOT (isotope-coded glycosylation site-specific tagging) is applicable to the identification/analysis of N-linked glycoproteins and the steps are: (1) digestion of protein mixtures, (2) enrichment of glycopeptides by lectin column-mediated affinity capture, (2) purification of the glycopeptides by hydrophilic interaction chromatography (HIC); (3) reaction of the glycopeptides with [^18^O]-H~2~O in the presence of peptide-N-glycanase resulting in an ^18^O tag specifically at the N-glycosylation site (the Asn residue carrying the sugar chain is converted to Asp with concomitant incorporation of the ^18^O from the solvent); and (4) identification of ^18^O-tagged peptides by LC/MS. The protocol typically results in a list of hundreds of glycoproteins and their sites of glycosylation.

Looking at the post-translational modifications of histone proteins can be rather tricky; they contain a disproportionally large number of arginine and lysine residues especially on the N-termini where most of the PTMs occur making it difficult to obtain peptide sequences that can be easily or meaningfully analysed.

Garcia and co-workers’ method involves blocking the lysine residues of extracted histones with propionic anhydride and trypsin digestion (which will now only cleave after arginine). At this stage, you can do stable isotope labelling step where carboxylic acid groups are reacted with heavy or light forms of a freshly prepared “methyl ester reagent” (methyl acetate) made from methanol (CH3OH or CD3OD) and acetyl chloride. This would enable you to measure the relative abundance of a post-translational modification between two treatment types.

In iTRAQ, multiple samples are compared by labelling them individually tags that all have the same molecular mass (isobaric). The reactive group on the tag is N-hydroxysuccinimide, and the tags react with all available amine groups. The samples are pooled, fractionated and analysed by MS. The advantage of this isobaricness is that the initial mass spec peak of the peptide will have an intensity that is the sum of the intensities from all the samples. The amino-acid sequence ions also show this summed intensity, but on further fragmentation a tag-specific reporter ion is released. The relative intensities of these ions represent the relative amount of peptide in the analytes. There is a good article addressing accuracy and precision issues in iTRAQ quantitation.

In triplex stable isotope dimethyl labelling, these free amines are reacted with a mixture of formaldehyde (H~2~CO) and NaBH~3~CN; OR D~2~CO and NaBH~3~CN; OR D~2~^13^CO and NaBD3CN to form light, intermediate and heavy derivatives. This reaction can be performed in-solution, online with LC-MS or on-column using SepPak columns.

This article also includes a table comparing the relative advantages of dimethyl labelling, iTRAQ and SILAC.

This table is interesting in itself, but also because it illustrates that there are two overall approaches to labelling peptides with stable isotopes: non-metabolic and metabolic. And really, when we talk about metabolic labelling we mean stable isotope labelling by amino acids in cell culture. Even spike-in SILAC, the new protocol that we have published, seems to be a method that you would use when “regular SILAC” would not be possible or practical.

Here is a figure showing the differences between SILAC by the normal method and by the “spike-in” method, both methods developed by Matthias Mann and co-workers.

In SILAC, each sample of cells in culture are fed a diet that contain light, medium or heavy forms of arginine and lysine. The samples are then combined before the steps to lyse the cells and digest the proteins are performed. The main advantage of this approach is that multiple cellular states are combined and then processed in a single workflow enabling very accurate measurement of the differences in protein concentrations between the samples. Even elaborate organelle purifications, like nucleolar preparations, can be performed without any adverse effects on quantitative accuracy. Complex experimental designs involving multiplexing several three-state experiments by linking them with a common experimental state can also be performed.

Another protocol from Akhilesh Pandey’s lab demonstrates the use of SILAC for: studying inducible protein complexes, identifying tyrosine kinase substrates, differential membrane proteomics and studying temporal dynamics.

While SILAC can be performed on the organism level, it can take a long time to achieve complete labelling of an organism and it is restricted to systems where the complete proteome can be labelled by proteome turnover (this precludes human tisse samples, for example). The diet restrictions required may also interfere with the ability to answer the specific scientific question of interest, so for the most part SILAC is limited to experiments using cultured cells. These disadvantages can at least be partially overcome by “spike-in SILAC”. In this case, only one cell state (or experimental state of animal) is subjected to SILAC. Processing of this sample results in a reference protein or proteome that can be added to the other experimental states as an internal standard. This is typically done after cell lysis and before protein digestion. By choosing an appropriate mixture of different cell lines, it is also possible to create a reference proteome that could be used for the analysis of, for example, cancer tissue samples.

While these are the protocols we have in our content at the moment, I am sure that there are many that we have not covered. Whole other subject areas (perhaps for future blog posts) are absolute quantitation of proteins and stable isotope labelling for NMR. It is always amazing to me that each field of study is one of many, and yet behind every detail there is probably a whole thesis of complexity.