Half the DNA on the New York City Subway Matches No Known Organism

The results of a massive new DNA sequencing project on the New York City subway have just been published. And, perhaps unsurprisingly, there's a lot of bacteria down there — though most of it is known to be harmless. What's really important, though, is what we don't know about it.

The PathoMap project, which involved sampling ticket barriers, benches, and keypads at 466 stations, found 15,152 life-forms in total, half of which were bacterial. The Wall Street Journal has created a fun, interactive microbial map of the underground out of the data, showing where on the lines the bacteria "associated with" everything from mozzarella cheese to staph infections was found.

But "associated with" is a pretty fuzzy term that runs up against the limits of science. In the past few years, genetic sequencing has become vastly more powerful and cheap, making metagenomic (multiple source) analyses possible. This means we can take all of the DNA in an environmental sample — human, plant, bacteria, cockroach, whatever — and sequence the hell out of it.

The problem, though, is that our genetic libraries are still incomplete. For example, if I don't know what the DNA sequences of a cockroach looks like, how can I know my DNA sequence belongs to a cockroach? That's how half of the DNA found in the project matched no known organism.

This is especially true when it comes to bacteria that are being discovered for the first time in these new metagenomic analyses. And what does "associated with," when it comes to bacteria, really mean? Maybe we found a certain bacterium on cheese once, but maybe we never sampled its true native habitat?

Even the best technologies we have now are ultimately crude tools to grope at a vast, unseen world. It maybe be hard to intuit whether it makes sense for Acinetobacter or Enterococcusto be on the subway, but the research team found plenty of non-microbial DNA, too, and a lot of it didn't make sense. According to the WSJ, human DNA was prevalent, as were beetle and fly DNA. Those make sense. (We actually don't know about cockroaches because their genome hasn't been sequenced yet.)

The next most prevalent type of DNA, though, was cucumber. Cucumber? When was the last time you saw someone gnawing on a cucumber on the subway? More likely, the computer program grouped all plant material under cucumber.

Initial database searches with subway DNA, for instance, turned up false matches to the Tasmanian devil, the Himalayan yak and the Mediterranean fruit fly—all creatures highly unlikely to be found in a New York transit system.

That's why some of the microbial DNA found that sounds alarming — anthrax, for instance — is no reason to be calling Homeland Security just yet.

None of this detracts, of course, from how incredibly interesting urban microbiology really is. Cities are organisms, and we're slowly beginning to understand its microbiome. Studying how bacteria — be they pathogens or scavengers or simply harmless microbes — spread through the urban environment could yield new insights into how a city works.

I'm just impatiently waiting for the future, when we're beyond the simple cataloguing of microbes, to truly understanding how they live and function together. How long before these guys get down on the London Underground? I wonder if fried chicken has its own DNA sequence. Would you want to know what germs are lurking on your morning commute on the Northern Line? [WSJ, Cell Systems]