PubMedication: do you get your best ideas in the Pub? CC-BY-ND image via trombone65 on Flickr.

Many people claim they get all their best ideas in the pub, but for lots of scientists their best ideas probably come from PubMed.gov – the NCBI’s monster database of biomedical literature. Consequently, the database has spawned a whole slew of tools that riff off the PubMed name, with many puns and portmanteaus (aka “PubManteaus”), and the pub-based wordplays are very common. [1,2]

All of this might make you wonder, are there any decent PubMed puns left? Here’s an incomplete collection:

PubCrawlerpubcrawler.ie “goes to the library while you go to the pub…” [3,4]

PubChasepubchase.com is a “life sciences and medical literature recommendations engine. Search smarter, organize, and discover the articles most important to you.” [5]

PubFig nothing to do with PubMed, but research done on face and image recognition that happens to be indexed by PubMed. [7]

PubGetpubget.com is a “comprehensive source for science PDFs, including everything you’d find in Medline.” [8]

PubLons publons.comOK, not much to do with PubMed directly but PubLons helps you “you record, showcase, and verify all your peer review activity.”

PubMine “supports intelligent knowledge discovery” [9]

PubNetpubnet.gersteinlab.org is a “web-based tool that extracts several types of relationships returned by PubMed queries and maps them into networks” aka a publication network graph utility. [10]

GastroPub repackages and re-sells ordinary PubMed content disguised as high-end luxury data at a higher premium, similar to a Gastropub.

PubQuiz is either the new name for NCBI database search www.ncbi.nlm.nih.gov/gquery or a quiz where you’re only allowed to use PubMed to answer questions.

PubSearch & PubFetch allows users to “store literature, keyword, and gene information in a relational database, index the literature with keywords and gene names, and provide a Web user interface for annotating the genes from experimental data found in the associated literature” [11]

PubScience is either “peer-reviewed drinking” courtesy of pubsci.co.uk or an ambitious publishing project tragically axed by the U.S. Department of Energy (DoE). [12,13]

PubLick as far as I can see, hasn’t been used yet, unless you count this @publick on twitter. If anyone was launching a startup, working in the area of “licking” the tastiest data out of PubMed, that could be a great name for their data-mining business. Alternatively, it could be a catchy new nickname for PubMedCentral (PMC) or Europe PubMedCentral (EuropePMC) [15] – names which don’t exactly trip off the tongue. Since PMC is a free digital archive of publiclyaccessible full-text scholarly articles, PubLick seems like a appropriate moniker.

There’s probably lots more PubMed puns and portmanteaus out there just waiting to be used. Pubby, Pubsy, PubLican, Pubble, Pubbit, Publy, PubSoft, PubSort, PubBrawl, PubMatch, PubGames, PubGuide, PubWisdom, PubTalk, PubChat, PubShare, PubGrub, PubSnacks and PubLunch could all work. If you’ve know of any other decent (or dodgy) PubMed puns, leave them in the comments below and go and build a scientific twitterbot or cool tool using the same name — if you haven’t already.

In the last ten years, the Open Access movement has made huge progress in allowing:

“any users to read, download, copy, distribute, print, search, or link to the full texts of these articles, crawl them for indexing, pass them as data to software, or use them for any other lawful purpose, without financial, legal, or technical barriers.”

But there is still a long way to go yet, as much of the world’s scientific knowledge remains locked up behind publisher’s paywalls, unavailable for re-use by text-mining software and inaccessible to the public, who often funded the research through taxation.

Openly ironic?

Ironically, some of the papers that are inaccessible discuss or even champion the very Open Access movement itself. Sometimes the lack of access is deliberate, other times accidental – but the consequences are serious. Whether deliberate or accidental, restricted access to public scientific knowledge is slowing scientific progress [1]. Sometimes the best way to make a serious point is to have a laugh and joke about it. This is what the Open Access Irony Awards do, by gathering all the offenders in one place, we can laugh and make a serious point at the same time by naming and shaming the papers in question.

Serial offender and über-journal Science has an article by Elias Zerhouni on the NIH public access policy [3] (Subscribe/Join AAAS to View Full Text), another on “making data maximally available” [4] (Subscribe/Join AAAS to View Full Text) and another on a high profile advocate of open science [5] (Buy Access to This Article to View Full Text) Irony of ironies.

From Nature Publishing Group comes a fascinating paper about harnessing the wisdom of the crowds to predict protein structures [6]. Not only have members of the tax-paying public funded this work, they actually did some of the work too! But unfortunately they have to pay to see the paper describing their results. Ironic? Also, another published in Nature Medicine proclaims the “delay in sharing research data is costing lives” [1] (instant access only $32!)

From the British Medical Journal (BMJ) comes the worrying news of dodgy American laws that will lock up valuable scientific data behind paywalls [7] (please subscribe or pay below). Ironic? *

The “green” road to Open Access publishing involves authors uploading their manuscript to self-archive the data in some kind of public repository. But there are many social, political and technical barriers to this, and they have been well documented [8]. You could find out about them in this paper [8], but it appears that the author hasn’t self-archived the paper or taken the “gold” road and pulished in an Open Access journal. Ironic?

Last, but not least, it would be interesting to know what commercial publishers make of all this text-mining magic in Science [9], but we would have to pay $24 to find out. Ironic?

For added ironic amusement, take a screenshot of the offending article and post it to the Flickr group. Sometimes the shame is too much, and articles are retrospectively made open access so a screenshot will preserve the irony.

Join us in poking fun at the crazy business of academic publishing, while making a serious point about the lack of Open Access to scientific data.

* Please note, some research articles in BMJ are available by Open Access, but news articles like [7] are not. Thanks to Trish Groves at BMJ for bringing this to my attention after this blog post was published. Also, some “articles” here are in a grey area for open access, particularly “journalistic” stuff like news, editorials and correspondence, as pointed out by Becky Furlong. See tweets below…

After centuries with little change, scientific libraries have recently experienced massive upheaval. From being almost entirely paper-based, most libraries are now almost completely digital. This information revolution has all happened in less than 20 years and has created many novel opportunities and threats for scientists, publishers and libraries.

Today, we are struggling with an embarrassing wealth of digital knowledge on the Web. Most scientists access this knowledge through some kind of digital library, however these places can be cold, impersonal, isolated, and inaccessible places. Many libraries are still clinging to obsolete models of identity, attribution, contribution, citation and publication.

Based on a review published in PLoS Computational Biology, http://pubmed.gov/18974831 this talk will discuss the current chilly state of digital libraries for biologists, chemists and informaticians, including PubMed and Google Scholar. We highlight problems and solutions to the coupling and decoupling of publication data and metadata, with a tool called http://www.citeulike.org. This software tool exploits the Web to make digital libraries “warmer”: more personal, sociable, integrated, and accessible places.

Finally issues that will help or hinder the continued warming of libraries in the future, particularly the accurate identity of authors and their publications, are briefly introduced. These are discussed in the context of the BBSRC funded REFINE project, at the National Centre for Text Mining (NaCTeM.ac.uk), which is linking biochemical pathway data with evidence for pathways from the PubMed database.

Date: Monday 16th March 2008, Time: 12.00 midday, Location: Michael Smith Building, Main lecture theatre, Faculty of Life Sciences, University of Manchester (number 71 on google map of the Manchester campus). Please come along if you are interested…