« May 2007 | Main | July 2007 »

June 29, 2007

Foo and beyond

Exactly a week ago I was on my way to O'Reilly's Foo Camp in Sebastopol, so it's about time I posted some notes about a few of the amazing people I met during and since that event:

  • Beth Noveck told me that Peer to Patent, a wonderful project that she and others have been working on for many months, is going really well in it early days post launch. The aim is to open up patent applications to public scrutiny, which sounds like an excellent idea to me. Brady has a write-up about it on Radar.
  • Manolis Kelaidis, fresh from his triumph at the TOC conference demoed bLink. I first met Manolis last September, when he came to present his invention at Macmillan (Nature's parent company). He's a very creative and thoughtful (but self-effacing) guy, and I'm delighted that bLink is getting the attention it deserves.
  • Chris DiBona, who among many other things runs the team at Google that puts together SciFoo, was proudly showing off his iLiad, especially the Linux shell function. (Hey, this was Foo Camp.)
  • Some of the science-oriented attendees — Drew Endy and Saul Griffith spring to mind — were kind enough to rave about Nature Precedings. Even at Camp, Drew was busy working away on his latest amazing project, the BioBricks Foundation. He also had a test tube of E. coli that smelled of bananas. (Hey, he is a synthetic biologist.)
  • Speaking of science, I also met Marti Hearst from UC Berkeley, who's working on BioText, figure-based searching for the scientific literature — great!
  • There were plenty of other amazing people, of course, not least the O'Reilly crowd themselves, especially Tim O'R and Sara Winge, the people who make Foo Camp happen. (And SciFoo too.)

I also bumped into Ray Ozzie and told him about the great discussions we've been having with Tony Hey's group at Microsoft (known as 'Technical Computing', they run MS's collaborations and interactions with scientists and engineers). It was an extra delight, then, to meet Savas Parastatidis on London on my return. I can't remember the last time I came across someone outside Nature with whom I shared so many interests and opinions. We'll definitely be doing some cool stuff together, and we'll keep you posted.

Last and least, I have an article in this month's STM News (a periodical for science, technology and medical publishers). The full publication is members only, so my draft is reproduced below for anyone who's interested. Appropriately enough, it's basically a summary of the ways in which the O'Reilly alpha-geek crowd has influenced our activities at Nature.

The Web Opportunity

The web is the most disruptive influence on publishing since the invention of moveable type in 1450. It is often viewed by publishers as a profound threat (think of Amazon, Google, Wikipedia and craigslist). But as a techno-optimist, I see threats merely as poorly understood or unexploited opportunities. Indeed, I couldn't imagine working in publishing if it weren't undergoing a revolution. These are not the worst of times but the best of times: how dull the last 550 years have been by comparison.

Publishers exist to enable the flow of information between people. For those of us in STM, this communication expands the frontiers of human knowledge and gives rise to new technologies like the web itself. It therefore enriches all our lives and is of fundamental importance. And now we have at our disposal the most powerful information dissemination tool in human history. If that doesn't make you feel excited and empowered then nothing will. This article will focus on three areas that I believe hold great promise for online scientific communication: audio-video content, databases, and social software.

Audio-video content

The iPod has become the technological icon of the first decade of the 21st Century. This fact has certainly helped to propel podcasting into the cultural mainstream with unusual speed. The term "podcast" wasn't even coined until 2004, but within a couple of years it had become part of everyday life for millions of people. Not only did it provide the convenience of time-shifted audio delivered direct to a mobile player, and an almost infinite variety of content to choose from, but with those white cords dangling from their ears even people like me can kid themselves that they look cool.

Yet that's only part of the story. The real reason for the huge expansion in web-based audio is the dramatically falling cost of the hardware and software required to create professional-sounding audio. Only a decade ago this required investments on the order of tens of thousands of dollars. Now, with a low-end desktop computer, a twenty-dollar microphone and a piece of software costing anywhere between nothing and $350, anyone with a bit of skill and patience can compete with the best in the world. That means dorm-room students and basement bloggers, but it also means established publishing companies with long histories of putting out nothing but written content. Welcome to the era in which you too can be a broadcaster.

Why would you want to? Because audio complements the written word in many compelling ways. Take the Nature Podcast as an example. We began this as a three-month trial in October 2005. By the end of that year it was receiving around 30,000 downloads a week and had secured a major sponsor that enabled us to cover the production costs. It also proved valuable to our listeners and to us. Feedback indicated that researchers liked hearing the author interviews because it gives them an insight into reports from outside their fields that they would never normally read in the journal. It also allows them to connect with these scientists as people, unfiltered by the formal, passive style of research papers. (Needless to say, authors also love being given a platform to talk about their work in front of tens of thousands of fellow researchers.) More prosaically, the show enables researchers them to make more productive use of their time. "Please make your shows longer," pleaded one listener, "I have a lot of microscope time." And this hints at one of the benefits to us as the producers: it allows us to connect with scientists and clinicians during times when they could not possibly read one of our journals or browse our websites – for example, when conducting experiments, commuting or exercising. Of course, as well as serving our main constituency of established professional scientists, podcasts also enable us to reach a broader, younger audience than we traditionally encounter. This, in turn, helps to strengthen and rejuvenate our brand.

Video shares similarities with audio, but also has some important differences. There, too, falling hardware and software costs have put the tools of professional-quality production into the hands of almost anyone who cares to use them. But video is not just audio with pictures – it is consumed in very different ways (the main reason that television did not kill radio). In particular, video is not easily consumed while conducting an experiment or driving.

Most obviously, it is useful in conveying certain types of scientific concepts – from displays of animal behaviour to animations of cellular processes (see this amazing example). But perhaps its biggest practical effect in science will be the sharing of experimental protocols not only by detailed step-by-step instructions but also by watching someone else carry them out. Just as cookery benefits from having recipe books and TV chefs, so scientific methods will be more accurately replicated and more quickly refined when written details are supplemented by video. The Journal of Visualized Experiments (JoVE) is an interesting early example of this approach.

Databases

The world of online scientific information can be conveniently divided into two largely separate but complementary realms, and only one of them has much to do with publishers. On one hand we have the journals — traditional repositories of received scientific wisdom and now almost all available online at the click of a mouse. On the other we have databases, which in some disciplines have become more important in the everyday information needs of scientists than any traditional publication ever was.
Particularly in biology, 'cottage industry' science is giving way to an 'industrial' model with greater specialization and global inter-lab collaboration. Whereas before everything — from data collection through analysis to manuscript writing — was done by a small group of people in one lab, now we see different research groups specializing in each stages of the process. Witness, for example, the many genome sequencing initiatives, which are really enormous data acquisition operations, with most of the analysis left to others.

Databases are the conduits that enable such collaborations, and though they often seem not to realise it, publishers have important roles to play. That's why we at Nature have teamed up with a variety of groups — from NIH-funded initiatives like the Cell Migration Consortium and the Consortium for Functional Glycomics, to major publicly or privately funded organisations like the National Cancer Institute and the Allen Institute for Brain Science — to create a series of joint database-oriented community resources. None of these could have been created by any one organisation on its own. Nature's contribution varies by project, but typically involves some combination of editorial content (to provide primers and updates on the field), curation and peer-review of the database contents, and promotion of the service to those who might find it useful in their research. These are actually rather traditional publishing roles, albeit applied in a new context.

There is another more subtle reason for journal publishers to be interested in databases: the dividing line between the two realms is getting ever fuzzier, and may eventually disappear altogether. As journals have moved online, they have taken on some of the characteristics of databases (searchable, structured, constantly updated). Meanwhile, some databases are starting to mimic certain aspects of journals (peer-reviewed, archival, citable). This has led to the appearance of 'hybrid' publication that are both databases and journals depending on how you look at them. For example, the Molecule Pages, a collaboration between Nature and the University of California at San Diego, is a review journal covering several thousand proteins involved in intracellular signalling. But the information is held in a relational database, making it easy to query the data and represent it in numerous different ways; while being archival and citable, it is also continually update. I predict that we will see more and more such examples in future, because on the web we really can have the best of both worlds.

Social software

The web – and the Internet that underlies it – is not a traditional distribution channel, it's a many-to-many network. Most of the biggest online success stories — eBay, Wikipedia, craigslist — have harnessed this fact to create substantial communities and apparently unstoppable momentum. In short, they have used the network to create network effects of their own.

In some cases, even companies that on the face of it are ill-positioned to exploit this approach have seen it pay dividends. For example, Amazon started off as nothing more than a book catalogue and a piece of (very good) e-commerce software. But by adding user reviews, wishlists and other collaborative features — and using this information to serve their users better — they created what every provider of a website ought to be striving for: a service that gets better for everyone the more people use it. This concept is often considered to be at the core of the "Web 2.0" concept articulated by Tim O'Reilly. Similarly, Google overtook and dominated their rivals in search by turning the process of ranking into a massively collaborative one: suddenly anyone creating a link on the open web became a contributor.

The scientific web is replete with such opportunities (see figure below). These including blogs (e.g., ScienceBlogs), wikis (e.g., OpenWetWare), voting systems (e.g., DissectMedicine), file sharing (e.g., arXiv and Nature Precedings), social bookmarking (e.g., Connotea), social networks (e.g., Nature Network) and markets (e.g., InnoCentive), among others. It also includes virtual worlds, most notably Second Life, which, although at an early stage in their evolution, hold the same disruptive potential that the web had in the mid-1990s.

1.png

The idea that everyone can now do their own publishing, making publishers superfluous, is misguided. But publishers do need to adapt. Online communities don't just happen, they require initiators, motivators, organisers, moderators, summarisers and guides. They also need trust systems based on user identification and reputation. In many ways, these, too, are traditional publishing roles, but they require new skills. Writers and editors now need to double as moderators and hosts. Publishers need to become adept at mitigating gaming and spamming of their systems, and at monetizing web traffic rather than selling subscriptions. On top of that, they need to become better at cooperating — with each other and with other organisations outside the industry. This particularly applies to online interoperability (even horror of horrors, with competitors), which is a positive-sum game that can benefit all participants. CrossRef has blazed a trail in this area, and we should build in its success.

Above all, publishers need to be leading the online charge, not following the scientists we serve. We are the information dissemination experts, so if we aren't pushing the boundaries and testing what's possible in this new world then we're not merely missing out, we're also not doing our jobs. Cynics will point out that most apparent 'opportunities' are a long way from turning a profit, and many probably never will. They're right. Do any of the STM projects I've mentioned above make a lot of money? No. But are they representative of the future of scientific communication, and do they provide a platform on which to build information businesses of the future? You'd better believe it.

June 25, 2007

Allen Brain Atlas: New features for search, browsing and data mining

News from the Neuroscience Gateway: The Allen Institute for Brain Science has released an upgrade to the Allen Brain Atlas offering improved browsing and navigation and enhanced data mining. Updates to the Atlas application include:

* NeuroBlast, a blast search tool that allows users to easily retrieve a list of genes with expression patterns similar to a gene of interest.
* Easy Browsing and Quick View options that allow users to quickly access and browse raw data and data summaries by gene or brain structure.
* Improved navigation allows users to synchronize raw data images with corresponding anatomic reference plates from the Allen Reference Atlas.

The Neuroscience Gateway, a comprehensive source for the latest research, news and events in neuroscience and genomics research, is a collaboration between the Allen Institute for Brain Science and Nature Publishing Group. The Allen Brain Atlas is a freely available scientific resource developed by the Allen Institute, which provides maps of the expression of approximately 20,000 genes in the mouse brain. Together, the Neuroscience Gateway and the Allen Brain Atlas are new tools to help researchers navigate neuroscience and genomics research. See more details, please visit the Neuroscience Gateway.

June 21, 2007

Science in Virtual Worlds

On Tuesday evening, I was invited to speak on the panel at the Science in Virtual Worlds event at the Apple Store in Regent Street, the first Royal Institution event with Nature Network London.

It was an excellent turn out, with standing room only by the time it kicked off, and Dave Taylor from the National Physical Laboratory started the evening off with a survey on who in the audience had been into Second Life. Probably about a fifth of the audience had, which is far higher than any event I have ever attended before, but I think still small considering the amount of recent publicity. Dave introduced Second Life with a very interesting whistle-stop tour of the state of science within Second Life, covering all the highlights including the International Spaceflight Museum, the NASA CoLab HQ and the NOAA island with interactive tsunami, glacier and hurricane exhibits. Dave also looked to the future, introducing the virtual continent of SciLands and Imperial College's work on a virtual hospital for their medical school.

The second panelist was Aleks Krotoski, Guardian columnist, digital strategy consultant and PhD student amongst other things. Her PhD concerns social networks in cyberspace, and her subject for the evening was entirely new to me: the ethics of conducting social science research in virtual worlds and online environments. She raised all sorts of interesting questions about privacy, online identities, the association between real identities and avatars and asked to what degree is it OK to pretend to be something you're not in the interests of research? In the real world, people-observation is not very difficult: just sit in a cafe and keep your ears open. In virtual worlds, some of the barriers are removed and curious residents will soon begin to ask you about yourself. When is it OK to lie, to what degree will revealing the truth influence the behaviour you are trying to observe, and what can you do with the information you receive? Unsurprisingly there are no easy answers, but it was a very interesting discussion of an issue of which I think many of us remain blissful unaware.

I concluded the talks by talking a little about Second Nature and attempting to see the future of science in virtual worlds by looking at some of the biggest current success stories and where they might lead. The Virtual Hallucinations house drew much interest as always and the Ecosystem Working Group's attempts to create a stable, "living" ecosystem in Second Life sparked an interesting discussion on diagnostics in virtual worlds.

There was a lot of good discussion in the Q&A sessions which continued after the end of the event, highlighting the collaboration potential of virtual worlds as a crucial aspect for scientific research, but I'm sure this is a subject we've barely scratched the surface of. Were you at the Apple Store? I'd love to hear your views on the event itself, or any thoughts on the use of virtual worlds in science: are we wasting our time? Are we going to change the world? Is Second Life going to turn the whole world into a giant lab with secrecy a thing of the past? Answers on a postcard, in the comments or to the Nature Network Second Life group

Overall a very enjoyable and thought-provoking evening: thanks very much to Jonathan from the RI for organising it and Paul Carr for chairing. Further coverage of the event in The Telegraph, The Times and The Guardian podcast

June 20, 2007

A new Nature website on Stem Cells

Nature has just launched two new websites, Nature Reports Stem Cells and Nature Reports Climate Change, which both aim to be lively places for experts and non-experts alike, providing the news behind the science and science behind the news of each subject.

Since I'm the publisher for the stem cell site only, I'll concentrate on that. Right now it mostly contains editorial material -- including stimulating commentaries about the ethics of stem cell research from notable experts such as Ian Wilmut, who lead the team that cloned the first mammal from an adult cell. There's also the first tranche of introductory material for those who need a primer on what stem cells are and why studying them can be so difficult and contentious.

There are already a few ways in which the community can interact with the site, via posting/commenting in the site's blog (the Niche) or by recommending, commenting and voting on research articles in the Journal Club. But we are working on introducing more interactive community resources to help the site to become a key community hub, giving information and voice to scientists and anyone else who has something constructive to add or contribute.

Please do take a look, tell us what you think, and tell your friends.

June 18, 2007

Nature Precedings is live

Our new preprint server and document-sharing service, Nature Precedings is now live, so please go and give it a try.

For background info, have a look at the O'Reilly Radar, where Tim has been kind enough to post an overview that I sent out earlier today.

One thing that we've already added since then (i.e., this afternoon) is a 'bridge' from our journal manuscript submission system to Nature Precedings. This allows NPG authors to submit their manuscripts for immediate pre-publication in Nature Precedings while they are being considered by the relevant journal. It's heartening to see people already beginning to use this (though as I write the system is misbehaving — please hang on in there while we get it fixed).

I'm gathering coverage of Nature Precedings in Connotea. There have been some unfounded initial concerns that Nature will have some special rights to the content, or that we'll be charging for some aspect of the service. On the contrary, all the content is released under the Creative Commons Attribution License and the service is free to authors and readers. In fact we're working with some of our partners to mirror the content to ensure it's long-term free availability (whatever might happen to Nature Publishing Group).

And what a great list of partners they are: the British Library, the European Bioinformatics Institute, Science Commons, and the Wellcome Trust. See our press release for their statements about the project.

We expect to add one or two more partners in the not-too-distant future, and convene a group of forward-thinking senior scientists to advise us on future development of the service. Right from the beginning, Precedings was conceived not as an NPG-only project but as a collaborative endeavour to open up scientific communication. To that end, we'll also be reaching out to other publishers in the weeks to come to ensure that this initiative works effectively alongside the existing journal publishing channel, which Precedings seeks to complement.

If you've got comment, please post it below, go to the Nature Precedings Group on Nature Network, or write to precedings-at-nature-dot.com.

June 14, 2007

Word 2007 and the STM Publisher Ecosystem

As the CTO of Nature Publishing Group, I have become involved in a very lively conversation with Microsoft staff about why Word 2007 is not being actively endorsed by STM publishers. It has recently come to Microsoft's attention (see blogs Murray Sargent and Brian Jones) that Nature ( http://www.nature.com/nature/authors/submissions/template/index.html ), Science ( http://www.sciencemag.org/about/authors/prep/docx.dtl), and many other scholarly publishers do not accept files authored in Word 2007. Both Science and NPG have been in correspondence with Microsoft staff on this important issue. The staff there have been very willing to engage in this conversation. As Inera is one of NPG’s main suppliers of Word macros (eXtyles) and a general expert on Word, I asked Bruce Rosenblum of Inera to enter the discussion. The following was sent to Microsoft on 12 June 2007 by Bruce Rosenblum to explain why this situation exists.

“Over the past 10 years, Microsoft Word has become the standard for almost all content authoring. As a result of Microsoft's success with Office, and the relative stability of the Office environment and DOC format over that time, third parties have built sophisticated applications to address specific vertical market requirements for integration of Word into highly efficient workflows.

eXtyles is one such application; eXtyles is a suite of editorial and XML tools for Word in wide use by scholarly publishers. But eXtyles is only one organism in the larger ecosystem of domain-specific applications dedicated to scholarly publishing. Other tools include online submission and peer review applications, and other applications used in the post-editorial production workflow.

Like eXtyles, most of the applications in this workflow ecology are not yet compatible with DOCX format. For example, I surveyed the four largest vendors of online submission and peer review systems this week, and none support DOCX files. Nor could any of the four provide me with a date when they expect to have native DOCX compatibility.

If you detect no sense of urgency to upgrade systems in this vertical market, you are not mistaken. For most scholarly publishers, the challenge is to publish high quality and accurate information on a regular schedule. Software upgrades to critical publishing systems, unless they are seamless or provide a significant immediate benefit, are often not a priority.

In the case of Word 2007, upgrading is not seamless. Because files incorporating OMML equations are not semantically backwards compatible with older versions of Word, publishers must update an entire ecology of systems before they can accept DOCX files. Completing such updates requires work with third parties, careful testing, training, and finally deployment -- often one system at a time -- of updated applications. All of this takes time.

In the mean time, because a DOCX file with OMML equations renders the equations as graphics when used with today's systems, it's easier for publishers to ask authors to refrain from submitting DOCX files until every part of the workflow ecology is DOCX-compatible. And not just updated to accept DOCX, but also updated so that OMML can seamlessly be integrated into systems today that provide publishers with full text XML and tagged math according to the NLM DTD or other 12083-derived DTDs.

Had the conversion from DOCX to DOC provided a conversion from OMML to Equation Editor format, it would have provided the necessary backwards compatibility for publishers to upgrade one system at a time. But because this compatibility is not available, it's created the need for a "big bang" upgrade, or a delay until the ecosystem of inter-dependent systems is deliberately updated over time. In the environment of scholarly publishing, such substantive upgrades often take years, not months.

I hope this post clarifies some of the core issues DOCX format presents scholarly publishers and explains Word 2007 issues that are cause for publisher upgrade reticence. Those of us in the scientific community look forward to a dialog to articulate scholarly publishing requirements to Microsoft so that Microsoft can provide products that serve the needs of the entire scholarly community.”

June 12, 2007

Scintilla

Yesterday we opened up a new site, Scintilla, to the public.

Scintilla is an aggregator—of science weblogs, news stories and publication databases—but it works in a slightly different way from the existing online RSS readers that cover the whole internet. For a start, the sources are manually selected, and only related to science, so there shouldn't be any trouble with spam when searching for stories. Also there's no 'unread items' count, so you don't have to feel like you have lots of reading to catch up on. Browse the site, add sources to your collection, and visit your 'Read' page on Scintilla whenever you're looking for some juicy science stories to read.

The other important feature of Scintilla is ratings and recommendations. As you're browsing through stories or papers, give them a quick rating: high if you think it's interesting and you'd like to read more like this, low if it's not your cup of tea. These ratings will be analysed alongside everyone else's and used to recommend stories that you might like (if other people liked the same stories as you, you'll hopefully like the other things they found interesting as well).

You can also manually recommend stories to other people: either to individual members of your social network, or to groups that you've joined. There are already groups for bioinformatics, pop science, images, visualisation and open science, but if your speciality isn't covered then feel free to start up a group and invite your colleagues to join. The more interesting people in your network, the better your recommendations will be :-)

A quick note: we sent emails about Scintilla to a selection of bloggers, but it's hard to find contact details for 600 or so weblogs. For everyone we didn't manage to contact directly, please join the mailing list/discussion group and/or send feedback to scintilla@nature.com

June 08, 2007

Coming Soon: Nature Precedings

As described in an editorial in this week's issue of Nature, Nature Precedings is a new service that:

will enable researchers to share, discuss and cite their early findings. It provides a lightly moderated and relatively informal channel for scientists to disseminate information, especially recent experimental results and emerging conclusions. In this sense, it is designed to complement traditional peer-reviewed journals, allowing researchers to make informal communications such as conference papers or presentations more widely available and enabling them to be formally cited. This, in turn, allows them to solicit community feedback and establish priority over their results or ideas.

Even better, it's completely free to readers and authors alike.

We'll be launching in the next few days. There will be a lot more to write about after that. In the meantime, Pedro was quick off the mark. I'm collecting this and other coverage of Nature Precedings in Connotea.

I must run now to catch my flight to London after a very busy week in Tokyo — sayonara.

Update: If you're interested in a sneak preview and have some suitable documents to share then drop a note to precedings-at-nature-dot-com, giving details of your name and affiliation.

"Nascent Web publishing efforts have their genesis in a burning need to say something, but their ultimate success comes from people wanting to listen, needing to hear each other’s voices, and answering in kind."
Rick Levine
The Cluetrain Manifesto

Subscribe

Subscribe to this blog's feeds:

[What is this?]

Recent Comments

Powered by
Movable Type 3.2