Applied Mechanics News

Tuesday, July 25, 2006

The long tail of papers

In an entry on pay per paper, I alluded to Chris Anderson's new book, The Long Tail. It should be straightforward to collect page views or down loads or citations of individual papers in a journal. You can plot the numbers of hits of individual papers against the rankings of the papers. Here is the curve for articles in Slate. (Not sure why data stopped at top 500 hits. Why not go further to see a really long tail?) Hope someone in Applied Mechanics will show the same data for JMPS, IJSS, MOM, etc. It will be fun.

Here is the gist of Anderson's observation: If you care about the total sale, as a publisher might, then what matters is the area under the curve; the contribution of the tail may rival that of the head. This much is objective, and should not be controversial.

Now allow me to play a variation of the theme, which is admittedly subjective and possibly controversial. Let's say the net contribution of a journal to new knowledge is proportional to the area under the curve (the subjective part). Then numerous less cited papers may make a significant contribution comparable to the contribution made by the best cited papers.

If you are interested in this argument, you might as well generalize the analysis from a single journal to all journals in a field, or to all journals in science, engineering and medicine. I'm not sure if such a curve has ever been plotted, but the job should not be too hard.

Now, if you are an individual author, surely you'd like to have a lot of hits for your own papers, just as Anderson is celebrating his book becoming a best seller. However, if your job is to increase the total knowledge, as the NSF is set up to do, then you might as well pay as much attention to the long tail as to the tall head.


  • For a short version of "The Long Tail", see here.

    By Blogger Teng Li, at 7/25/2006 9:27 AM  

  • Also see the anatomy of the long tail here.

    By Blogger Teng Li, at 7/25/2006 9:29 AM  

  • Here is an update by Chris Anderson published in the July 2006 Issue of Wired. In fact, the whole July Issue of Wired is interesting.

    By Blogger Zhigang Suo, at 7/25/2006 10:28 AM  

  • This is a realm of usage statistics that is just starting to be realized.

    Although libraries have COUNTER and SUSHI, COUNTER stats aren't terribly granular. The statistics provided generally are at the journal level.

    The bX project from LANL is a much more promising development, as it mines the OpenURL link resolver for specific citations and their usage.

    Georgia Tech's soon-to-be-launched (frantic debugging before the start of the semester) Umlaut resolver keeps similar statistics (or, will before the end the week) and when personalization is turned on, will keep such atomic usage statistics about specific communities. We should be able to tell what articles Mechanical Engineering Faculty are looking at and make collection development decisions accordingly.

    By Anonymous Ross, at 7/25/2006 11:11 AM  

  • Dear Ross:

    Many thanks for the input. I've just learned that some companies collect data on both issue and article views (abstracts & full text) through metapress. This information is usually presented to the editor and editorial board members for their assessment and analysis.

    By Blogger Zhigang Suo, at 7/25/2006 11:51 AM  

Post a Comment

<< Home