« Nascent Will Be Down At The Weekend | Main | Richard Charkin's Blog »

Comparing Wikipedia and Britannica

This week's issue of Nature has a news feature that reports on an experiment our journalists conducted to discover how reliable the scientific content of Wikipedia is compared to Britannica's.

They sent 50 pairs of Wikipedia and Britannica articles on scientific topics to recognised experts and, without telling them which article came from which source, asked them to count the numbers of errors (mistakes, misleading statements or omissions). Among the 42 replies, Britannica content had an average of just under 3 errors per article whilst Wikipedia had an average of just under 4 errors — not as much difference, perhaps, as most people would expect. There is also an accompanying Nature editorial.

We also took the opportunity to interview Jimmy Wales about this when he came to visit the other week. You can hear the results in the 15 December 2005 Nature Podcast.

Going through the subject-by-subject results, I make the final score 22-10 in favour or Britannica, with 10 draws. The total numbers of errors are 123 in Britannica and 162 in Wikipedia (which therefore has just over 30% more errors than it's traditional rival).

Of course this only covers scientific content, and it doesn't look at every aspect of an encyclopedia's value (e.g., timeliness, depth, the quality of its references), but I think it's a useful addition to the less rigorous and anecdotal information that seems to have fuelled the debates about Wikipedia so far.

If you believe that an encyclopedia should be judged by it's weakest entries (in general I don't), or if you're the subject of an error or slur (thankfully I'm nowhere near famous enough), then the anecdotal outliers might be more important to you than averaged results. But most readers simply want to know whether a source can generally be relied upon. What these results say to me is that Wikipedia isn't bad in this regard — and that if it's really important to get your facts right then even Britannica isn't completely dependable. (In fact, I'm not sure that anything is.) Put this together with the fact that Wikipedia is generating content at many orders of magnitude lower cost than traditional encyclopedias and you've got yourself a classic Christensenesque disruption in reference publishing.

A key outstanding question is whether or not Wikipedia can ever surpass Britannica in quality. Since it evidently already does in some subjects, I think the answer is yes, but we will have to wait and see. Frankly, I still can't get over the fact that it works at all.

Postgenomic TrackBack

Similar items from Scintilla

TrackBack

Listed below are links to weblogs that reference Comparing Wikipedia and Britannica:

» Comparing Wikipedia and Britannica from Wolverton Lab
Two fascinating things about this: 1. Nature (the journal) has a blog? Wow. I’m shocked/impressed/intrigued. 2. It looks like Wikipedia is in the ballgame with respect to accuracy, at least for “scientific” articles. Nascent: Comparin... [Read More]

» links for 2005-12-16 from take one onion
Internet encyclopaedias go head to head : Nature A comparison between Wikipedia and Encyclopedia Britannica showing that they have pretty similar error rates (tags: wiki social science research comparison nature npg wikipedia encyclopedia_britannica) ... [Read More]

» 维基百科 VS 大英百科 from 半亩塘闲话

就在维基百科刚刚受到质疑其准确性问题时,《自然》杂志对40多篇维基百科条目与大英百科全书的相应条目进行了时,《自然》杂志对40多篇维基百科条目与大英百科全书的相应条目进行了 [Read More]

Comments

Science articles? Is that really a fair test? 50 biographies, or geography/history articles, might have shown a bigger difference.

I can understand why science articles were chosen (no subjectivity about whether something is accurate or not), but given the nature of Wikipedia, and its highly technical audience, strength in science is to be expected.

The link for "accompanying Nature editorial" appears to be a subscriber only link.

You're right, Christopher. Thanks for pointing that out. I'm trying to see if I can do something about it.

It's true that the outcome would probably have been different had they sampled history articles. But note that even this sample of science articles contained a good number of biographies, and Wikipedia performed on par with EB on those.

Also, had they sampled articles about information technology, current events, political controversies or popular culture, the outcome might have been more in Wikipedia's favor.

Wikipedia would very much like to correct any errors that Nature's referees found. All of the listed articles (except those with 0 errors) have been labelled as problematic. Is there any way that Nature's editors can publish or summarize the specific errors found?

See also Wikipedia's page on this Nature review.

One disappointing aspect of this comparison, for me, was that it was not controlled for article length. Errors per word might have been a more meaningful statistic.

Wikipedians have been very interested in this comparison, and are working hard to find the errors and improve the articles in question (along with all of the other 865,000 articles).

Some have put together a "quick table comparing the EB and Wikipedia file sizes for the articles. In Wikipedia's case, we selected the displayed text [...], removed all tables/external links/see also/references/etc, and we used the versions prior to the Nature article's publication. One result of interest: our 2nd largest article on the list, Robert Burns Woodward, had zero errors. That same article is EB's smallest article, and has 3 errors.

"Result: WP averages about 1 error every 2 kilobytes, and the average article size is 6.80 KB. Britannica averages about 6 errors every 2 KB, and the average article size is 2.60 KB."

You may find it interesting to read about Wikipedia's responce to your article here.

Steven, an excellent question. We're trying to see if we can publish the full list of errors found by our reviewers, or least send them to you (and to Britannica if they want). We'll post an update here as soon as we have a firm answer.

As for the Wikipedia page on the Nature study, wow! :)

Christopher, the Nature editorial is now open to non-subscribers too.

I'd like to thank all those involved with this comparison. It's very interesting to see such an excellent critique of our articles, especially when viewed alongside those of a professional Encyclopedia organisation.

I hope that most of the errors are simple, anonymously-introduced ones that we can quickly correct. If you can make the reviews available to us then we would be more than grateful, but even without those details we appreciate your investigations.

Regarding Maeve's comments about article size, unfortunately the comparison he/she refers to is erroneous. The "Robert Burns Woodward" article is not Britannica's shortest article — the person doing that comparison apparently didn't notice that Britannica breaks up long biographies into sub-articles, and hence only counted the introductory paragraph.

On the other hand, the counts here (errors/word) do count the "entire" article, and the Wikipedia articles currently have about twice as many words, on average, as Britannica.

One problem, of course, is that we don't know which versions of the articles were compared by Nature, and furthermore some topics have entries in multiple places. However, there are still some things that mystify me — Nature claimed that it compared articles of "approximately the same length". However, for example, the Wikipedia article on the West Nile virus has been over 1000 words for a year now, while the Britannica article of the same title is only 250 words, almost 5 times shorter. How did Nature compare these two?

More detailed information about the review process used, and a full list of the errors identified, is now available from Nature.com here (Word file).

For those who prefer a PDF version, I've put one here.

If you can't "trust" wikipedia, you can still use it. The references are usually elaborate and often peer reviewed. So, if you're worried that you can't cite wikipedia as a reference in your work, then you can sure site one of its references!

In regards to the comparative accuracy of wikipedia to encyclopedia Britannica the cause of misinformation in either source will mainly be at the result of human error or personal inference on any given topic. The Achilles heal of the wiki is the publics belief that there is a malevolent force out there who’s aim is to provide others with misinformation. Likewise those who are in support of Britannica use the armament that a sentence printed on paper and published in a book is infallible. Time and again throughout history cartographers have falsified maps going to print replacing the names of some remote region with their own name or that of a loved one. Further more it needs to be recognized that sources of information and the setting in which they are distributed ultimately have only marginal impact on the accuracy of well intended information. For example a recent anecdote has come to my attention where a public school teacher and a student got into a heated debate in the classroom with the instructor insisting that a kilometer was of greater length than a mile. In a letter to the students parents the teachers closing paragraph read…
“Although he was correct, Alex’s actions show a blatant disregard for authority, and a complete lack of respect for his school. In the future Alex would be better off simply accepting my teachings without resistance”

While this comment is staggering in of its self it makes it easy to loose focus on the real issue which is of all the facts that any person knows a certain percentage of them are in reality incorrect despite societies best effort to educate.

Post a comment

Comments will be reviewed by the editors before being published. You can be as critical or controversial as you like, but please don't get personal or offensive. We strongly encourage you to use your real, full name. Email addresses are useful in case we need to discuss your comment with you privately, or notify you in case we decide not to publish your comment. Email addresses will not be made public on the blog.

We have designed this blog to be as accessible to as many people as possible. If you are having difficulty leaving a comment because of the graphical security code below, please send your comment to 'nascent at nature.com'



"Nascent Web publishing efforts have their genesis in a burning need to say something, but their ultimate success comes from people wanting to listen, needing to hear each other’s voices, and answering in kind."
Rick Levine
The Cluetrain Manifesto

Subscribe

Subscribe to this blog's feeds:

[What is this?]

The Life Scientists on FriendFeed

Recent Comments

Out of 293 total comments.
The most recent three were on:
Powered by
Movable Type 3.2