[Research highlight] Life re-coded

In an article recently published in Science, Isaacs et al describe the replacement of all 314 TAG stop codons in the Escherichia coli genome with synonymous TAA codons, representing an unprecedented effort in large-scale genome editing.

The scientists first replaced all TAG codons in batches of ten codons across 32 separate strains using their previously-published MAGE method (Wang et al, 2009). These edited genome segments were then progressively combined using a new conjugation-based genome assembly method (CAGE). They have currently produced four strains that each have a quarter of their TAG stop codons replaced, and they hope to produce the complete TAG replacement strain in the near future. Somewhat surprisingly, no severe phenotypic consequences were observed in these replacement strains, indicating that the TAG codon is not essential, despite its near-universal presence in the genetic code of all organisms.

Indeed, the only exception to the universality of the TAG stop codon is a small selection of methanogenic archaea, and one bacterium, in which TAG encodes for the non-canonical amino acid, pyrrolysine (reviewed in Krzykci et al, 2005). Following nature’s lead, the authors hope that once they have produced the complete TAG replacement strains, they will then be able to use this free codon as a “plug-and-play” system for incorporating unnatural amino acids into proteins.

More broadly, this technology will provide an attractive alternative to wholesale chemical genome synthesis when researchers need to systematically introduce multiple genetic alterations into a genome, especially since current synthetic organism designs hew closely to natural organisms. This work may also be a first step towards creating organisms with completely rewritten genetic codes. Such fully “re-coded” organisms would have an inherent genetic “fire-wall” since they would not be able to share their genetic material via horizontal transfer or be infected by naturally occurring viruses.


Isaacs FJ, Carr PA, Wang HH, Lajoie MJ, Sterling B, Kraal L, Tolonen AC, Gianoulis TA, Goodman DB, Reppas NB, Emig CJ, Bang D, Hwang SJ, Jewett MC, Jacobson JM, Church GM (2011) Precise manipulation of chromosomes in vivo enables genome-wide codon replacement. Science 333: 348-53

Krzycki JA (2005) The direct genetic encoding of pyrrolysine. Curr Opin Microbiol 8: 706-12

Wang HH, Isaacs FJ, Carr PA, Sun ZZ, Xu G, Forest CR, Church GM (2009) Programming cells by multiplex genome engineering and accelerated evolution. Nature 460: 894-8

[Research highlight] Transcription in action

In a work just published at Nature, Churchman and Weissman (2011) describe a new method for directly capturing and sequencing elongating, or nascent, RNA transcripts. The authors then use this method to provide a detailed look at the transcriptional process in action, revealing a histone modification-dependent mechanism that constrains genome-wide antisense transcription, and pervasive transcriptional pausing and backtracking throughout genes.

The work adds to a rapidly expanding functional genomics toolkit that allows researchers to dissect evermore precise steps in the Central Dogma — the DNA to RNA to protein cascade that transforms genomic information into cellular function. See also the recent work by Cramer and colleagues that describes a method for quantifying genome-wide mRNA synthesis and decay rates (Miller et al, 2011), and the ribosome profiling technique, also developed in the Weissman lab, which can provide genome-wide views of protein translation (Ingolia et al, 2009).


Churchman LS & Weissman JS (2011) Nascent transcript sequencing visualizes transcription at nucleotide resolution. Nature 469: 368–373

Ingolia NT, Ghaemmaghami S, Newman JR, Weissman JS (2009) Genome-wide analysis in vivo of translation with nucleotide resolution using ribosome profiling. Science 324:218-23

Miller C, Schwalb B, Maier K, Schulz D, Dümcke S, Zacher B, Mayer A, Sydow J, Marcinowski L, Dölken L, Martin DE, Tresch A, Cramer P (2011) Dynamic transcriptome analysis measures rates of mRNA synthesis and decay in yeast. Mol Syst Biol 7:458

[Research highlight] Laws of microbial growth

In a work recently published in Science, Scott et al reveal a series of microbial “growth laws” that describe simple relationships between translation, nutrition, and cellular growth. They show that these laws hold across different experimental perturbations and E. coli strains, and, ultimately, provide a phenomenological model describing the delicate balancing act cells maintain when deciding how much of their proteome to allocate to ribosome-related processes.


Scott M, Gunderson CW, Mateescu EM, Zhang Z, Hwa T (2010) Interdependence of cell growth and gene expression: origins and consequences. Science 330:1099-102

→ also see the related Perspective

Lerman J, Palsson BO (2010) Topping off a multiscale balancing act. Science 330:1058-9

q-bio 2010 Conference on Cellular Information Processing

This last August 11-14, systems biologists convened in beautiful Santa Fe, New Mexico, for the Fourth Annual q-bio Conference on Cellular Information Processing. The conference brought together a potent mix of theoretical and quantitative experimental biology across a wide range of topics. The full program and abstracts for each talk can be browsed on the conference’s Wiki page.

St. John’s College, the q-bio venue

Highlighting the value of systems-level analysis, many of the talks revealed the functional importance of features of biological systems that may often be tempting to disregard:

  • Thierry Emonet showed that noise in the chemotactic signaling pathways actually acts to help coordinate the bacteria’s multiple flagella.  (In fact, chemotaxis and bacterial swarming were popular topics. See also the talks by Jan Liphardt, Ned Wingreen, Victor Sourjik, Bonnie Bassler, Christopher Rao, and Yi Jiang).
  • Talks by Anat Burger and Narendra Maheshri explored the ways that non-functional transcription factor binding sites (sites that do not directly affect gene regulation) can nonetheless have dramatic effects on the dynamics of gene regulatory circuits.
  • Debora Marks discussed her work showing that saturation and competition play a potentially important role in determining the efficiency of siRNA and microRNA target gene repression. (See also her recent work in Molecular Systems Biology, Arvey et al. 2010).

The conference also hosted several excellent talks on cell cycle regulation — a classical model in systems biology research — including a closing lecture by James Ferrell and a talk by John Tyson describing his detailed stochastic model of the eukaryotic cell cycle (recently published in Molecular Systems Biology, Barik et al. 2010). See also talks by Jan Skotheim, Silvia Santos, and Xiaojing Yang. Galit Lahav also provided some exciting insights into another extremely well-studied system — p53 signaling (see Loewer et al. 2010).

In addition, two researchers studying HIV1 provided some of the most thought-provoking presentations:

    • Leor Weinberger proposed a way to treat HIV1 with a transmissible therapeutic agent, and described both cell culture experiments demonstrating the ability of their agent to slow HIV1 propagation, and computational modeling showing how this agent could spread through the human population.
  • Alex Sigal used a combination of modeling and cell culture experiments to make a compelling case that direct cell-to-cell transmission of HIV1 may help maintain a low-level “smoldering infection” during anti-retroviral drug treatment.

Naturally, these are just a few highlights from the conference, which hosted many other excellent talks. Once again, we encourage you to browse the full program and abstracts on the conference’s Wiki page.


Barik D, Baumann WT, Paul MR, Novak B, Tyson JJ (2010) A model of yeast cell-cycle regulation based on multisite phosphorylation. Mol Syst Biol 6:405

Arvey A, Larsson E, Sander C, Leslie CS, Marks DS (2010) Target mRNA abundance dilutes microRNA and siRNA activity. Mol Syst Biol 6:363

Loewer A, Batchelor E, Gaglia G, Lahav G (2010) Basal Dynamics of p53 Reveal Transcriptionally Attenuated Pulses in Cycling Cells. Cell 142:89-100

[Research highlight] NF-kappaB signaling goes digital

In a report published this week at Nature, Tay et al. reveal that populations of mouse 3T3 cells exposed to TNF-α show a digital NF-κB response, where increasing TNF-α concentrations lead to a higher proportion of cells with nuclear localized NF-κB — an effect that depends, in part, on pre-existing heterogeneity within the cell population. These results provide another compelling example of the way that studies using single cell measurements are transforming our understanding of cellular signaling mechanisms. Interestingly, these results seem to contrast with another recent single-cell-based study of NF-κB dynamics (Giorgetti et al. 2010), which observed a relatively uniform population-level NF-κB response to TNF-α in human HCT116 cells, indicating that there is still much to learn about the dynamics of NF-κB signaling.


Giorgetti L, Siggers T, Tiana G, Caprara G, Notarbartolo S, Corona T, Pasparakis M, Milani P, Bulyk ML, Natoli G (2010) Noncooperative interactions between transcription factors and clustered DNA binding sites enable graded transcriptional responses to environmental inputs. Mol Cell 37:418-28

Tay S, Hughey JJ, Lee TK, Lipniacki T, Quake SR, Covert MW (2010) Single-cell NF-kappaB dynamics reveal digital activation and analogue information processing. Nature 466:267-71

→ Also, see Cheong et al. for a history of systems biology modeling of NF-κB signaling:

Cheong R, Hoffmann A, Levchenko A (2008) Understanding NF-kappaB signaling via mathematical modeling. Mol Syst Biol 4:192.

[Research highlight] Cis-regulatory evolution, not so mysterious after all?

Animal genomes are littered with conserved non-coding elements (CNEs)—most of which represent evolutionarily constrained cis-regulatory sequences—however, it is often not clear why these sequences are so exceptionally conserved, since anecdotal examples have shown that orthologous CNEs can have divergent functions in vivo (Strähle and Rastegar 2008; Elgar and Vavouri 2008). In an article recently published in Molecular Biology & Evolution, Ritter et al. compare the functional activities of 41 pairs of orthologous conserved non-coding elements (CNEs) from humans and zebrafish (2010). Interestingly, sequence similarity was found to be a poor predictor of which CNEs had conserved function. In contrast, the authors found that measuring transcription factor binding site change, instead of simple sequence divergence, improves their ability to predict functional conservation. While this set of tested CNEs remains relatively small, these results are encouraging because they suggest that as scientists move from phenomenological measures of CNE evolution to models based explicitly on binding site evolution, the patterns of cis-regulatory evolution observed within animal genomes should become far less mysterious.


Elgar G, Vavouri T (2008) Tuning in to the signals: noncoding sequence conservation in vertebrate genomes. Trends Genet 24: 344–352

Ritter DI, Li Q, Kostka D, Pollard KS, Guo S, Chuang JH (2010) The Importance of Being Cis: Evolution of Orthologous Fish and Mammalian Enhancer Activity. Mol Biol Evol advance online publication May 21

Strähle U, Rastegar S (2008) Conserved non-coding sequences and transcriptional regulation. Brain Res Bull 75: 225–230

Keystone Symposium – Omics Meets Cell Biology (II)

thumb090202a.jpg

Before I carry on with a summary of the second part of the Keystone Symposium ‘Omics Meets Cell Biology’, I should clarify that this post and the previous one dedicated to this conference are not intended to provide an comprehensive account of all the talks but rather to communicate some general (and subjective) impressions of the meeting. To keep these posts reasonably short (and sometimes due to a lack of memory…), I had to omit several of the excellent presentations given at this meeting. The full program and complete list of speakers is available at the Keystone Symposium website.

Many of the presentations given during the second part of the meeting reported findings derived from cell-based high- or medium-throughput functional screens, most of them relying on RNAi-mediated knock-down. Here is an overview of the screens presented during this meeting, illustrating by their diversity in scope and scale the versatility of this method:

Focus # genes tested Type Speaker
autophagy 21’000? RNAi M Lipinski
sensory organ dev. 20’000 RNAi J Mummery-Widmer
cell polarity 16’000 RNAi J Ahringer
imatinib modifiers 9500 (pooled) RNAi D Sabatini
viral entry 4000 RNAi L Pelkmans
cell-cell contacts 2000 RNAi T Pawson
cell migration 1000 RNAi J Brugge
centrosome 113 RNAi L Pelletier
bipolar spindle 45 RNAi R Medema
DNA repair RNAi D Durocher
neuronal differentiation 700 TF overexpression M Snyder
gene-centered TF location yeast 1-hybrid library M Walhout
protein degradation reporter library S Elledge

Perhaps not surprisingly, many speakers emphasized that RNAi screens invariably need to be followed up by time-consuming and tedious validations. The off-target problem in mammalian cell-based RNAi screens appears also to be taken very seriously and it was reported that from 4-7 siRNA directed against the same gene were necessary to reach a good level of confidence. In view of the increasing number of RNAi-based functional screens, standards for the description of such experiments (eg. MIARE, MIACA) are likely to become increasingly useful.

In systems biology, network models are often central for the interpretations of omics data related to molecular interactions and they allow to generate biological insights which are different from those derived from the more classical screening-mechanistic-dissection paradigm. In this regard, Uwe Sauer presented exciting work on the relationship between transcriptional regulatory networks, protein expression and the state of the yeast metabolic network. Using a combination of genetic approach and drug perturbations, a series of parallel ‘fluxomic’ and metabolomic measurements revealed that metabolic fluxes, in contrast to metabolite concentrations, remain robust to perturbations and are apparently affected only by a handful of transcription factors in a given condition at steady state. At the computational level, integration of different types of data represents significant challenges. For example, it is far from trivial to find ways to exploit the information contained in interaction networks and integrate it with other types of large-scale molecular measurements. Trey Ideker exposed an efficient solution to this problem within the context of microarray profiling of breast cancers and showed that expression data can be combined with information on protein physical interactions to define improved and biologically meaningful pathway-based biomarkers for the classification of metastatic vs non-metastatic tumors.

While superposing parallel datasets leads to a ‘vertical’ integration of networks, Marian Walhout presented an approach to integrate ‘horizontally’ transcriptional and miRNA-dependent regulatory links and map a composite transcription factor/miRNA regulatory network in Caenorhabditis elegans. In this elegant work, the yeast one-hybrid assay was used as a gene-centric screening method to identify regulatory links between hundreds of transcription factors and promoters of both miRNA genes and genes encoding transcription factors. Closing the loop, the network was completed by computationally predicting the transcription factors potentially targeted by miRNAs. Interestingly, the resulting network showed numerous composite motifs including negative feedback loops (TF → miR –| TF), which are otherwise under-represented in pure transcriptional regulatory neworks.

Completion of network models may require tedious and repetitive work. To the question “who will fill the gaps?”, Steve Oliver replied: “a Robot Scientist”. He showed that an actual implementation of such a robot is able to iteratively use a computational model of the yeast metabolic network to automatically design informative experiments, perform them and use the results to extend the model. In an effort to provide a genome-scale overview of the molecular interactions that underly regulation of gene expression, Tim Hughes presented a variety of microarray-based technologies to systematically map transcription factor-DNA, nucleosome-DNA and protein-RNA interactions. The latter results were particularly intriguing given that the high-throughput identification of targets of RNA-binding proteins remains a relatively unexplored route and may reveal novel insights into the complexity of post-transcriptional regulation.

To conclude on a somewhat different note, it was also interesting to observe that an increasing number of studies were accompanied by extensive web resources providing access to the respective datasets:

Resource Lab
PhophoPep R Aebersold
Human Protein Atlas M Uhlen
3Dcomplexes.org S Teichmann
Nature Cell Migration Gateway J Brugge
EDGEdb.org M Walhout
CellCircuits T Ideker
STRING C von Mering

This situation underscores the need of a proper infrastructure to host and share (or publish?) large datasets in biology and the central role of web technologies in this regard. In view of the proliferation of biological databases, I wonder whether it might be helpful to have general recommendations on some minimal requirements for this type of databases—eg. type of searching, visualization, data integration functionalities, existence of a (web) APIs, download of datasets, possibility to integrate external datasets, etc…? Or would perhaps something like a ‘Minimum Information About a Biological Database’ be useful to specify the capabilities of databases? One may also dream that these databases will become progressively interoperable and eventually include web-based APIs facilitating programmatic access to the information stored, ultimately sending Omics in the Cloud

thumb090202b.jpg

And, oh yes, the slopes were very nice, even though, I have to admit the air was thin and a little fresh…