Nature Methods announces online methods

Nature Methods follows in the footsteps of Nature by ushering in an online methods section, fully integrated with the paper, for all original research articles. Details of the service described in the journal’s current (May) Editorial (Nature Methods 6, 313; 2009), and the editors welcome comments on the service at Methagora, the Nature Methods blog.

Daniel Evanko, Chief Editor of Nature Methods, writes: “We are relieved that we will no longer have to relegate important methodological details to Supplementary Information and we expect our authors will appreciate being able to include more citations in their papers. A potential downside of this change is that the print and online versions of papers have quite different levels of methodological detail. What do you think? Those of you who are online readers may not have very strong opinions on this, but what about our print readers? If anyone who regularly receives a print copy of the journal is reading this, we would ”https://blogs.nature.com/nmeth/methagora/2009/04/methods_section_remake.html">like your feedback as well."

From the Editorial: “We expect that our readers and authors will appreciate the advantages that Online Methods bring to Nature Methods. With this change effectively increasing the length of Nature Methods papers—and more than doubling the length of Brief Communications—our authors will have far more space to communicate their new methodologies and cite previous work. But by limiting the increase in length to the methods section we continue to emphasize the value of succinct scientific reports. The body of the paper will remain short enough that casual readers can easily obtain the important information. The details required for more in-depth understanding or reproduction of the work will be easily accessible if needed. We hope our authors and readers are as excited by this change as we are.”

Nature Methods journal website.

Nature Methods guide to authors.

Nature‘s formats for methods.

Methods in full, the Editorial announcing Nature’s introduction of this service (Nature 445, 684; 2007).

Nature Methods on “big data” and the scientific method

The rise of ‘omics’ methods and data-driven research presents new possibilities for discovery but also stimulates disagreement over how science should be conducted and even how it should be defined. Is the ability of these methods to amass extraordinary amounts of data altering the nature of scientific inquiry? These are the issues dicussed in the April Editorial of Nature Methods (6, 237; 2009).

“Methodological developments are now making it possible to obtain massive amounts of ‘omics’ data on a variety of biological constituents. These immense datasets allow biologists to generate useful predictions (for example, gene-finding and function or protein structure and function) using machine learning and statistics that do not take into account the underlying mechanisms that dictate design and function—considerations that would form the basis of a traditional hypothesis.

Now that the bias against data-driven investigation has weakened, the desire to simplify ‘omics’ data reuse has led to the establishment of minimal information requirements for different types of primary data. The hope is that this will allow new analyses and predictions using aggregated data from disparate experiments.”

The Editorial goes on to ask whether the generation of parts lists and correlations in the absence of functional models is, in fact, science? “Based on the often accepted definition of the scientific method, the answer would be a qualified no. But the rise of methodologies that generate massive amounts of data does not dictate that biology should be data-driven. In a return to hypothesis-driven research, systems biologists are attempting to use the same ‘omics’ methods to generate data for use in quantitative biological models. Hypotheses are needed before data collection because model-driven quantitative analyses require rich dynamic data collected under defined conditions and stimuli.

Correlations in large datasets may be able to provide some useful answers, but not all of them: ‘omics’ data can provide information on the size and composition of biological entities and thus determine the boundaries of the problem at hand. Biologists can then proceed to investigate function using classical hypothesis-driven experiments. It is still unclear whether even this marriage of the two methods will deliver a complete understanding of biology, but it arguably has a better chance than either method on its own.”

Comment on this Editorial at Nature Methods’ Methagora blog.

Raising the bar for micorarray standards

“Investigating the compliance of our publications with MIAME standards (minimum information about a microarray experiment; Editorial, ”https://www.nature.com/ng/journal/v38/n10/full/ng1006-1089.html">Nat. Genet. 38, 1089; 2006), we found that even when authors and referees are aware of community standards and even with editors mandating both data deposition and accession linking as a condition of publication, a proportion of microarray datasets were at that time unavailable or incomplete." So starts the Editorial in this month’s (February issue) Nature Genetics (41, 135; 2009).

In an Analysis article in the same issue of the journal (Nat. Genet. 41, 149-155; 2009), John P. A. Ioannidis and collaborators (four teams) treated the findings of a number of microarray papers published in Nature Genetics between 2005 and 2006 as their gold standard, and attempted to replicate a sample of the analyses conducted on each of them, with frankly dismal results.

According to the Editorial, “the findings of this Analysis should be used to improve practice rather than to critize the authors and referees of these publications. A certain amount of both skepticism and initiative must of course be assumed on behalf of all readers and users of research publications. Equally, there must be enough goodwill and professionalism in the research community to permit critical reanalysis of research findings at any and every moment without this core scientific practice implying any personal criticism. Any scientist should be prepared to reexamine published work, one’s own and one’s colleagues’ alike. In doing so it always helps to make clear one’s needs and assumptions, and the Analysis in this issue does indeed explain the limits of the analysts’ requirements and critical aims.”

The journal, and other Nature journals that publish papers describing microarrays, now insist that authors deposit their data to GEO or ArrayExpress before the submitted paper is sent for peer-review.

Nature journals’ policies on data and materials availability, including microarray deposition.

MGED website, specifying MIAME standards necessary to interpret and reproduce microarray data.

Incentives needed for genome annotation

Roy Welch and Laura Welch of Syracuse University, New York, examine why researchers seem reluctant to be more directly involved in the annotation of microbial genomes in the February issue of Nature Reviews Microbiology (7, 90; 2009). They write:

“To annotate an organism’s genome, biological information about the organism must be matched to the genes and genetic elements in the sequenced genome. The process is iterative and open-ended: new information is constantly incorporated into the annotation. It can also be recursive: analysis of the annotation may provide insight about the organism that in turn leads to changes to the annotation. Unfortunately, the generation of new information and annotation of the genome are at present completely separate processes. Often new information does not become incorporated into the annotation in a timely manner, a costly loss for those who rely on it to advance their research.

The community of expert researchers who study an organism produce most of the information that becomes part of the annotation and are also the primary group of end-users. It is therefore curious that the annotation process is circuitous and inefficient: researchers communicate new information not as direct updates to the annotation, but as research papers that must later be interpreted and incorporated into the annotation separately — most often by a third party! Indeed, some information never finds its way into the annotation. It would be far more efficient for the research community to contribute directly to genome annotation. Yet the life science community as a whole remains stuck in the old, inefficient paradigm.”

The authors go on to argue that technology is not the impediment, given the wide availability of wikis (collaborative editing websites) and the databases that have been created using these technologies, including EcoliWiki, GONUTS, Myxopedia and Wikipathways. Rather, state the authors, the impediment seems to be sociological: until contributions to a genome-annotation collaborative information repository can be credited by inclusion in a PhD thesis, curriculum vitae, tenure application or grant proposal, direct collaborative annotations are unlikely to fulfil their promise and potential to accelerate scientific achievement.

Call for authors to deposit microarrays in public databases

In a Correspondence to Nature Methods (5, 991; December 2008) responding to an Editorial in the March 2008 issue of the journal (Nat. Meth. 5, 209; 2008) , Scott A Ochsner, David L Steffen, Christian J Stoeckert, Jr and Neil J McKenna report a study showing that researchers are not routinely depositing supporting raw microarray datasets into a public database.

The Correspondence authors surveyed papers from the 2007 issues of 20 journals, searching the text for reference to deposition of a microarray dataset. They find that the rate of deposition of datasets was less than 50 per cent. The authors note the effort required by authors to deposit these complex data in public microarray repositories, even though repositories are simplifying submissions while encouraging compliance with MIAME (minimum information about a microarray experiment) standards. They write: “Although microarray datasets are most useful to bioinformaticians in their raw, unnormalized forms, which facilitate cross-comparison with other datasets, processed datasets are more useful to the bench scientist. Moreover, unless a description of the experimental details is available, neither form of the data are biologically interpretable.” They urge repositories to require deposition by authors and propose journals require a statement in the manuscript identifying a repository and accession number at the time of submission, with the record embargoed until acceptance of the paper. (Of the 16 Nature journal papers that were part of the survey, such accession numbers were provided in 15 cases.) They conclude: “Seven years after the elaboration of the MIAME principles, the emerging discipline of microarray meta-analysis, exemplified by the cancer gene expression resource ”https://www.oncomine.org/“>Oncomine, continues to be hobbled by the mundane, time-consuming and often fruitless exercise of tracking down annotated full datasets. We call for a renewed collective effort from researchers, publishers and funding organizations to redress this situation and secure these data-rich research resources for posterity.”

The full text of the Nature Methods Correspondence, with supporting data, is here.

Policy note: the Nature journals have for some years required authors to submit MIAME-compliant microarray data to the GEO or Arrayexpress public repository. Details of the journals’ polices can be found here.

Structural genomics – December update

The Structural Genomics Update for December reports a centralized system created by the Protein Structure Initiative (PSI) that allows investigators an easy way to submit protein target suggestions to the ten PSI structural genomics centres. These proposals are evaluated for feasibility and consistency with the overall goal of the programme. The four large-scale production centres are pursuing structural studies of more than 1,400 community-nominated targets. The six PSI specialized centres, which focus on various structure-determination bottlenecks, also consider target nominations. For further information, guidelines and submission service, see the news article.

In the rest of the December update, see the featured molecule (scavenger decapping enzyme DcpS), selected free-to-access research articles from across Nature Publishing Group journals, as well as other articles and news, including an events calendar.

The PSI-Nature structural genomics knowledgebase is a free service, designed to turn the products of the Protein Structure Initiative into knowledge that is important for understanding living systems and disease. Use the site to explore the PSI’s work, and stay informed about advances in structural biology and structural genomics by signing up to the monthly e-newsletter.

Nautilus post announcing launch of the structural genomics knowledgebase.

Previous Nautilus posts about structural genomics.

Nature Methods, looking back and moving forward

The fourth anniversary of Nature Methods’ arrival on the publishing scene and a change in leadership offer an opportunity for reflection and editorial fine-tuning, as described in the journal’s November Editorial (5, 911; 2008).

From the Editorial: “When Nature Methods made its debut in October 2004, just over 4 years ago, it was an anxious but exciting time for our founding chief editor Veronique Kiermer and manuscript editors Nicole Rusk and Daniel Evanko. We were all novices at scientific publishing and more comfortable calibrating a pipette than editing a fledgling journal.” The Editorial goes on to outline developments and other changes at the journal since then. Veronique is taking on the role of publisher for Nature Methods and Nature Protocols, and Daniel is taking over as Chief Editor of Nature Methods. Reviews, Perspectives and Research Highlights are to be expanded, while the Protocols section is closing. (Authors are encouraged to submit their protocols to the online publication Nature Protocols.) The Editorial concludes: “We hope that our journal has helped dispel the notion that methods are less important than results and deserving only of small print at the end of a paper. Debunking this myth has been and will continue to be our main mission. We will persist in our efforts to bring you, every month, a journal that allows methods to be featured prominently in their own right—as the cornerstones upon which results are based.”

Nature Methods guide to authors.

How to submit to Nature Methods.

Aims and scope of the journal.

Methagora, the Nature Methods blog.

Historical microbiology archive made free to all

In its November Editorial, Nature Reviews Microbiology (6, 794; 2008) reports that the archive of the International Journal of Systematic and Evolutionary Microbiology (IJSEM) has been made available free online: a boon for scientists, historians and the public. The Society for General Microbiology publishes IJSEM on behalf of the International Committee on Systematics of Prokaryotes of the International Union of Microbiological Societies. The society has now provided funding for the entire back archive of the journal to be made freely available worldwide without a journal subscription. (The current content, or past two years, remains subject to access controls.)

From the Nature Reviews Microbiology Editorial: Systematics is the foundation for studies of all types of organisms, because it helps us to understand how one organism relates to another. The value of systematics is often underappreciated, however, for bacteria and viruses. For example, there is a huge imbalance between the 7,000 named bacterial species and the 1,000,000 named insect species. This is particularly important given that it is now well-known that bacteria and viruses are the most populous organisms on Earth, and furthermore, that more than 99% of bacteria have yet to be cultivated. Why should we be interested in naming and characterizing different species of bacteria? The advent of metagenomics has swelled the literature with ever-increasing estimates of numbers and types of bacteria and viruses in the biosphere. An important adjunct to genomics-based approaches is the detailed characterization of these myriad species and investigation of the relationships between them. The availability of the IJSEM archive will hopefully spur renewed interest in this area.

Jean Euzeby, the IJSEM list editor, maintains an incredibly useful web resource that details all those species that have been ratified — the List of Prokaryotic names with Standing in Nomenclature. Another useful site named Bacterial Nomenclature Up-to-Date has an up-to-date list of bacteria and is based on the work of Norbert Weiss, who maintained the database until his retirement in February 2003. The current database is maintained under the supervision of Manfred Kracht. Finally, a comprehensive taxonomy of the Bacteria and Archaea can be found in the Taxonomic Outline of Bacteria and Archaea (TOBA) Release 7.7, which was last updated in 2007.

Other useful resources are described in the Editorial.

To show or not to show data

‘Data not shown’ is an outdated caveat that obscures the transparency of a scientific report and weakens the peer review process, according to Nature Chemical Biology (4, 575; 2008).

“Technology and competition perpetually raise the bar for the quality and quantity of experimental data that authors must include to publish a high-impact manuscript. Almost uniformly, journals have amended their formats to accommodate the increased volume of data while maintaining page restrictions by providing the supplementary information option online for authors and readers. Despite these changes, many authors still rely on the caveat of ‘data not shown’. At Nature Chemical Biology, we discourage the use of this phrase and the omission of important data for two major reasons. First, the exclusion of essential data undermines the peer review process, and second, readers need access to data to form independent opinions about and to replicate the results of published papers. Thus, we suggest that the time for ‘data not shown’ has passed.”

Read the rest of this Editorial here.

For additional details on presenting and consolidating methods, see the journal’s Guide to Authors.

See the NPG authors’ and referees’ website for more details on our data sharing and database deposition policies.

Launch of Protein Structure Initiative-Nature Structural Genomics Knowledgebase

Last week, Matt Day announced Nature Publishing Group (NPG)‘s latest website: the Protein Structure Initiative (PSI)-Nature Structural Genomics Knowledgebase. Matt writes that the new addition to NPG’s existing collection of gateways and databases is “a collaboration with the Protein Structure Initiative, a large scale NIH-funded consortium to develop and apply high-throughput techniques for protein structure determination. They’ve been highly successful in generating new technologies that are available for others to use, and they’ve shown that structure determination work can be scaled up significantly.

Now that the site is launched, we’ll be providing monthly editorial updates that put developments in structural work into context for a wide range of biomedical researchers….The website is hosted at Rutgers University by the same team that hosts one of most significant and long-established databases, the Protein Data Bank”.

The Structural Genomics Knowledgebase (SGKB) offers researchers and others an easy way of keeping abreast of developments both by the PSI and more generally in the fields of structural genomics and structural biology. It is a regularly updated portal to research data and other resources from the PSI, with NPG providing a monthly update with synopses of important research advances, recent additions to a categorized library of research articles, as well as news and events in structural biology. You can register to receive a monthly email newsletter and subscribe to RSS feeds. NPG resources and publications relevant to the Protein Structure Initiative can be accessed here.