Do longer papers gather more citations?

A longer paper gathers more citations : Nature News

From Nature 455, 274-275 (2008):

Researchers could garner more citations simply by making their papers longer, a study seems to imply (K. Z. Stanek, Preprint at https://arxiv.org/abs/0809.0692; 2008).

In an analysis of 30,027 peer-reviewed papers published between 2000 and 2004 in top astronomy journals, astronomer Krzysztof Stanek of Ohio State University in Columbus found that the median number of citations increases with the length of the paper — from just 6 for papers of 2–3 pages to about 50 for 50-page papers.

There is, however, a limit to the benefits of size: citations start to tail off when papers reach lengths of 80 pages or so, perhaps because fewer people have the stamina to read them.

It is unexpected, says astronomer Jörg Dietrich of the European Southern Observatory headquarters in Germany, who recently conducted a similar analysis and found the same results but didn’t publish them. “I expected that shorter papers would be cited more than longer ones,” he says. “I assumed that people don’t have the time to read long papers.”

Papers of about 4 pages — the length of Letters in Astrophysical Journal and Astronomy and Astrophysics, which report brief summaries of work that is usually published in more detail later — fare better than papers 5–10 pages long. But brevity offers no such benefit for papers in the other two journals considered, Astronomical Journal and Monthly Notices of the Royal Astronomical Society, which do not have Letters.

The zeroth theorem of the history of science

Andreas Trabesinger, a senior editor at Nature Physics, writes in News and Views this month (Nature Physics 4, 677; 2008):

“The past is by no means definite. It is rather open”, wrote the German historian of science Ernst Peter Fischer in Die Welt on 24 July 2006. In his column, Fischer introduced the “zeroth theorem of the history of science”; a discovery named after a person, the theorem says, did not originate from that person.

Take, for example, Avogadro’s number, named after Amedeo Avogadro, who asserted that there is the same number of molecules contained in a given volume of any gas at the same temperature and pressure. However, it was not the Italian savant who first estimated that number, but the Austrian scientist Johann Josef Loschmidt. Indeed, German-language texts sometimes refer to the number 6.022 1023 as ’Loschmidt’s number’. Much depends on who tells a story, and where and when. Fischer sees his zeroth theorem as an invitation to look with fresh eyes at the history of science, and in particular at how discoveries got their names.

That thought has now been picked up by J. David Jackson (Am. J. Phys. 76, 704–719; 2008). He has explored five examples from physics that illustrate Fischer’s zeroth theorem, and discusses the broader issue of credit-giving, and where it gives rise to inappropriate attributions. Jackson’s five examples take in various areas of physics, from the Dirac delta function to the Weizsäcker–Williams method of virtual quanta, to the Bargmann–Michel–Telegdi equation of spin dynamics. The journey includes encounters with big names such as Enrico Fermi or Nikola Tesla, but also with physicists whose biographies are far less commonly known, such as Oliver Heaviside, Llewellyn Thomas or Emil Wiechert. Their names are famous in some specific contexts, but little is known about their complete works."

Read more of the article at this link.

Citation patterns in geoscience

Nature Geoscience’s September editorial (1, 563; 2008) broaches the subject of impact measures. From the editorial:

The ripples of the revolution in science evaluation have long reached the relatively uncompetitive backwaters of the geosciences. Indeed, Nature Geoscience received questions regarding its likely future impact factor before it was even accepted into Thomson Scientific’s Web of Science in April this year. So here are a few thoughts on the topic from us, long before our own impact factor (due in 2010) may skew our perspective.

Citation patterns vary hugely between disciplines. The impact factors of Nature and Science have ranged between 26 and 32 in the past few years. But a quick estimate, based on a sample of papers, suggests that geoscience papers in these journals score an impact factor of around 15 when evaluated on their own. This is high considering that the impact factors of journals publishing exclusively geoscience research have not exceeded 5 in the past several years. But far higher citation counts in the biological sciences drive up the statistics of journals that publish across disciplines.

The timescales of the publication cycle in a field determine a journal’s impact factor. These are defined as all citations in one year to citable content published in the two preceding years, thereby excluding all references more than two years from publication. This can be problematic for the slower-moving sciences. For example, the ten most cited papers in Geology in 2004 were collectively referred to about 1.5 times more often in 2007 than in 2006 — citations that have never entered the index.

For geoscientists, taking guidance from impact factors alone would mean favouring interdisciplinary journals (whereas many biologists would, for the same reasons, favour their own disciplines). It would also lead to reading preferentially short-lived, quickly cited papers over those that develop more slowly — not necessarily a good idea. Other more time-consuming ways of assessing quality are therefore needed to supplement the quick and easy number check.

Mathematicians report on use and misuse of citation statistics

The International Mathematical Union has released a report on the use of citations in assessing research quality. The report, Citation Statistics, is written from a mathematical perspective and strongly cautions against the over-reliance on citation statistics such as impact factor and h-index. The belief that these parameters are accurate, objective and simple, is unfounded.

It states that the objectivity of citations is illusory because the meaning of citations is not well-understood. Its meaning can be very far from ‘impact’. Although having a single number to judge quality is indeed simple, it can lead to a shallow understanding of something as complicated as research. Numbers are not inherently superior to sound judgments.

The report, written by mathematicians, promotes the sensible use of citation statistics in evaluating research and points out several common misuses of this widespread application of mathematics. The authors of the report recognize that assessment must be practical and that easily-derived citation statistics will be part of the process, but caution that citations provide only a limited and incomplete view of research quality. Research is too important, they say, for its value to be measured with only a single coarse tool.

(This is a precis of the press release accompanying publication of the report, see links above.)

Further discussion of the report, together with other matters related to citation and quality metrics, is taking place online at the Nature Network Citation in Science group, which all are welcome to join.

2007 Journal Impact Factors are announced

The 2007 Impact Factors are now out (published on 17 June 2008). The ten Nature Publishing Group journals with the highest Impact Factors are as follows:

1 NAT REV MOL CELL BIO 31.921

2 NAT REV CANCER 29.190

3 NATURE 28.751

4 NAT REV IMMUNOL 28.300

5 NAT MEDICINE 26.382

6 NAT IMMUNOLOGY 26.218

7 NAT GENETICS 25.556

8 NAT REV NEUROSCI 24.520

9 NAT REV DRUG DISCOV 23.308

10 NAT BIOTECHNOLOGY 22.848

The Impact Factors of the Nature journals that publish original research are:

1 NATURE 28.751

2 NAT MEDICINE 26.382

3 NAT IMMUNOLOGY 26.218

4 NAT GENETICS 25.556

5 NAT BIOTECHNOLOGY 22.848

6 NAT MATERIALS 19.782

7 NAT CELL BIOLOGY 17.623

8 NAT NEUROSCIENCE 15.664

9 NAT METHODS 15.478

10 NAT NANOTECHNOLOGY 14.917

11 NAT PHYSICS 14.677

12 NAT CHEM BIOLOGY 13.683

13 NAT STRUCT MOL BIOLOGY 11.085

(Nature Photonics and Nature Geoscience are not old enough to have been awarded an Impact Factor this year.)

Readers can create their own lists of journals by subject area, title, Impact Factor or publisher, at ISI Web of Knowledge.

There is a free-access account at the ThomsonISI website which explains how the Impact Factor for journals is calculated.

Discussion of the 2007 Impact Factors, and of citation in science in general, is taking place at the Nature Network Citation in Science group, which you are warmly invited to join.

Navigating the geography of citation indices

Debbie Chaves of Wilfrid Laurier University, Waterloo, Ontario writes in Correspondence in the current issue of Nature (Nature 453, 719; 2008):

In his Correspondence ‘Hall and Keynes join Arbor in the citation indexes’ (Nature 452, 282; 2008 – see also Nautilus), Daniel Postellon describes the distinguished careers of Milton Keynes, Walton Hall and Ann Arbor. In the last case, I note that Professor Arbor has an h-index of 1 from the Web of Science database provided by Thomson Scientific’s ISI Web of Knowledge. This is based on her five citations for the year 2007: two articles, two letters and one abstract.

An author search in the Web of Science reveals that Chevy Chase MD (not to be confused with Chevy Chase, Maryland) has co-authored a letter with Howard Kaplan (H. Kaplan Am. Sci. 96, 3; 2008). My own institution, Wilfrid Laurier University, is also an author (S. Cadell et al. J. Palliat. Care 23, 273–279; 2007).

Irrespective of how these errors are created, the rising use of systems in which citation information moves directly from the search of a database or citation index to a bibliographic management system, and then into a reference list, means that inexperienced students and researchers who are not savvy enough to detect these errors will propagate them further.

Vigilance is required by all users of citation indexes and databases.

Nature Neuroscience on web traffic and citations

The June editorial in Nature Neuroscience (11, 619; 2008) discusses the relationship between web traffic and citations. The journal’s preliminary analysis indicates that the number of downloads a paper receives immediately following its appearance online correlates very well with its citation frequency years after publication. Noah Gray, one of the Nature Neuroscience editors, has written a post at Action Potential, the journal’s blog, to provide more of the details behind the data and analysis, and to initiate discussion. He writes (edited for length):

Everyone has their own pet problem with impact factors, but despite these concerns, these numbers are typically used to rate the importance or prominence of a particular journal, and thus by proxy, the importance of the individual papers published within. This is a seriously flawed use of association, leading scientists to often equate the total number of citations with scientific impact, which can be fraught with problems. Searching for an alternative measure of impact that is perhaps free of the “bias of authority” (citing a paper because it is from a famous lab) or the “lemming bias” (citing a paper just because everyone else seems to do so whenever broaching a particular subject) led us to explore readership….

The “number of downloads” measure potentially provides a piece of an alternative solution for deciphering the impact of an individual paper. In this current scientific climate where tenure and grant funding decisions are influenced by flawed metrics like impact factor, it is important to make good use of all available technology in an attempt to realize a better system of measuring the scientific impact of any particular paper. This analysis is obviously preliminary and flawed in its own ways, but perhaps metrics such as paper downloads can find a place in a compilation of aggregated stats, painting a more accurate and informative picture of manuscript influence.

The Nature Neuroscience editorial.

The Action Potential post and discussion.

Nature Network Citation in science forum discussion.

Nature Network Citation in science group homepage.

Futher reading: Connotea bookmarks “citation”

Further reading: Connotea bookmarks “impact factor” : thanks to Bob O’Hara for this library.

Surnames and citation indexes

Biji T. Kurien of the Oklahoma Medical Research Foundation writes (Nature 453; 450; 2008):

The Correspondence ‘Give south Indian authors their true names’ (Nature 452, 530; 2008) and earlier News Feature ‘Identity crisis’ (Nature 451, 766–767; 2008) are highly relevant to calculations of PubMed citations and h-index (the number n of a researcher’s papers that have received at least n citations).

For example, I used to use the south Indian form of my name: T. Biji Kurien, with Biji being my personal name. I have seven publications cited incorrectly in PubMed as being by ‘Kurien, T. B.’, ‘Bijikurien, T.’ or ‘Kurien, B.’. Four of these entries were cited often enough to be counted towards my h-index computation. As I had by then changed my name to conform with Western style, these publications unfortunately do not appear in the Web of Science or PubMed under my current name format. Consequently, my h-index ranking has fallen by 25% .

It is of paramount importance to adhere to a consistent name pattern right from the start, in order to maintain a correct list of publications in the public databases as well as the right h-index rankings.

Prabhu B. Patil, of the Centre for Cellular and Molecular Biology, Hyderabad, writes in the same issue of Nature (453, 450; 2008):

The Correspondence ‘Give south Indian authors their true names’ (Nature 452, 530; 2008), incorrectly states that people from the south do not traditionally have surnames.

I am from southern India and have a proper surname — as do all the families in my region. Besides Patil, surnames such as Naidu, Reddy, Rao and Gouda are common in the different states of southern India. One of the authors of the Correspondence has the surname Kutty.

Surnames have widely fallen into disuse because our fathers and forefathers avoided using them to prevent discrimination on grounds of caste.

It doesn’t make sense in this case to use only an author’s first name in scientific publications and to devise a special system to accommodate a different naming format. Instead, editors should encourage these authors to revive the use of their surnames.

Comments are welcome here and at Indigenus, the blog of Nature India.

Hall and Keynes join Arbor in the citation indexes

Daniel C. Postellon of the Helen DeVos Children’s Hospital, Michigan, writes in Nature’s Correspondence pages (Nature 452, 282; 2008):

The career of the non-existent author Ann Arbor is well-known to connoisseurs of computerized databases and citation indexes. Usually listed as the last author, she is sometimes credited with the academic degree “MI”. Ann is not actually a person, but the city of Ann Arbor, Michigan, home of the University of Michigan. Her ‘degree’ is a misinterpretation of the abbreviation for Michigan: MI. She pre-dates online computerized databases, and was often listed in the paper edition of Index Medicus.

Ms Arbor now has a UK rival in the team of Walton Hall and Milton Keynes. Like her, they are usually listed as last authors. The online database Google Scholar lists them as co-authors of 46 publications, in addition to their solo work. Walton Hall is actually a building on the campus of the Open University in Milton Keynes. These ‘authors’ have a useful role to play: they can be used to check the accuracy of the databases and indexes.

Language and languages of science

Martin Fenner, on his Nature Network blog Gobbledygook, notes that The Deutsche Ärzteblatt , the official journal of the German Medical Association, will from this month be publishing an English version. The reason? So that the journal is more clearly indexed in databases such as PubMed, hence available to more readers, leading to more citations of journal articles, a better Impact Factor, and enhanced reputation of the journal. Martin’s opinion is that although German was once an important scientific language, today only 2 per cent of articles indexed in Medline are in the language. “In the end”, he writes, " it makes the exchange of ideas between scientists much easier if we can all use the same language. And Nature Network is a good example for this."

In the stimulating discussion arising from the post, Nicolau Werneck comments that “to this day there are a bunch of interesting words and expressions from German that came into the international scientific jargon in the last 2 centuries, such as gedankenexperiment, eigenvector and gestalt…We must fight. But not to forbid people from talking in english, or other imperialistic arrogant language, and certainly not to make them speak only in English. We must fight for the plurality of languages.”

Nicholas Wigginton’s view is that of someone considering a postdoc in a country where English is not an official language. “Although the science that the groups I am looking into publish everything in English, some operate their labs in the national language whereas others prefer their science to be done exclusively in science. I find this very interesting.”