From Quanta Magazine (find original story here).

It used to be that to find new forms of life, all you had to do was take a walk in the woods. Now it’s not so simple. The most conspicuous organisms have long since been cataloged and fixed on the tree of life, and the ones that remain undiscovered don’t give themselves up easily. You could spend all day by the same watering hole with the best scientific instruments and come up with nothing.

Maybe it’s not surprising, then, that when discoveries do occur, they sometimes come in torrents. Find a different way of looking, and novel forms of life appear everywhere.

A team of microbiologists based at the University of California, Berkeley, recently figured out one such new way of detecting life. At a stroke, their work expanded the number of known types—or phyla—of bacteria by nearly 50 percent, a dramatic change that indicates just how many forms of life on earth have escaped our notice so far.

“Some of the branches in the tree of life had been noted before,” said Chris Brown, a student in the lab of Jill Banfield and lead author of the paper. “With this study we were able to fill in many gaps.”

Life’s finest net
As an organizational tool, the tree of life has been around for a long time. Lamarck had his version. Darwin had another. The basic structure of the current tree goes back 40 years to the microbiologist Carl Woese, who divided life into three domains: eukaryotes, which include all plants and animals; bacteria; and archaea, single-celled microorganisms with their own distinct features. After a point, discovery came to hinge on finding new ways of searching.

“We used to think there were just plants and animals,” said Edward Rubin, director of the U.S. Department of Energy’s Joint Genome Institute. “Then we got microscopes, and got microbes. Then we got small levels of DNA sequencing.”

DNA sequencing is at the heart of this current study, though the researchers’ success also owes a debt to more basic technology. The team gathered water samples from a research site on the Colorado River near the town of Rifle, Colo. Before doing any sequencing, they passed the water through a pair of increasingly fine filters—with pores 0.2 and 0.1 microns wide — and then analyzed the cells captured by the filters. At this point they already had undiscovered life on their hands, for the simple reason that scientists had not thought to look on such a tiny scale before.

“Most people assumed that bacteria were bigger, and most bacteria are bigger,” Rubin said. “[Banfield] has shown that there are whole populations that are very small.”

The researchers extracted the DNA from the cellular material and sent it to the Joint Genome Institute for sequencing. What they got back was a mess. Imagine being handed a box of pieces from thousands of different jigsaw puzzles and having to assemble them without knowing what any of the final images look like. That’s the challenge researchers face when performing metagenomic analysis—sequencing scrambled genetic material from many organisms at once.

The Berkeley team began the reassembly process with algorithms that assembled bits of the sequenced genetic code into slightly longer strings called contigs.

“You no longer have tiny pieces of DNA, you have bigger pieces,” Brown said. “Then you figure out which of these larger pieces are part of a single genome.”

This part of the process, in which contigs are combined to reconstruct the genome sequence, is called genome binning. To execute it, the researchers relied on another set of algorithms, customized for the task by Itai Sharon, a co-author of the study. They also assembled some of the genomes manually, making decisions about what goes where based on the fact that some characteristics are consistent for a given genome. For example, the percentage of Gs and Cs will be similar on any part of an organism’s DNA.

When the assembly was complete, the researchers had eight full bacterial genomes and 789 draft genomes that were roughly 90 percent complete. Some of the organisms had been glimpsed before; many others were completely new.

The reason no one had found these organisms before is that the traditional method used to search for small forms of life doesn’t work for everything. That method involves the 16S rRNA gene, which is often compared to a fingerprint because the genetic code it contains is unique for every organism. When confronted with a DNA stew, like the one from the water samples in Rifle, scientists use substances called primers to draw out and amplify all the 16S rRNA genes. The problem is, not all 16S rRNA genes react with the primers, rendering some organisms effectively invisible.

“The primers don’t work as well as people would like them to,” Brown said. “We showed that many of the sequences we reconstructed would have been missed by the traditional 16S amplification-type method.”

By reconstructing complete or nearly complete genomes, Brown and his collaborators were able to locate 16S rRNA genes and identify organisms without relying on primers. The group published their results in the July 9 issue of Nature.

The fuller genomic picture they created also allowed them to tease out traits of the life forms they’d discovered. All the organisms they found have very short genomes, about one million base pairs (compare that to E. coli, which has about five million), and they all have minimal metabolic function, requiring them to use fermentation to generate energy. They are also missing many basic biosynthetic pathways and need help making nucleotides and amino acids.

“They must be dependent on other organisms in some capacity to survive. This also explains why no one’s been able to grow them in the lab,” Brown said.

A new domain?
The discovery of new organisms is fairly cut and dried: Either you’ve found one or you haven’t. Cataloging organisms, fitting them into the tree of life, involves more judgment calls.

The researchers divided the 789 organisms into 35 phyla—28 of which were newly discovered—within the domain bacteria. They based the sorting on the organisms’ evolutionary history and on similarities in the code on the organisms’ 16S rRNA genes—those with at least 75 percent of their code in common went into the same phylum.

With these new additions, there are now roughly 90 identified bacterial phyla. This is a lot more than there were a year ago, but also far fewer than the 1,300 to 1,500 phyla that microbiologists estimate we’ll have once a complete accounting is finished. Recent advances in genetic sequencing and genome binning make Brown and Banfield optimistic, though, that it won’t be long before we’ve mapped them all.

“I think that much of the tree of life will come into view in the next few years,” Banfield wrote in an email.

Of course, no sooner do we think we’ve seen everything than we come up with a new way to see. Rubin thinks that the development of tools like the ones used in the new study make the search for life “a growth industry,” and he thinks it’s likely that growth will occur in surprising ways.

“Looking at things from a different angle may offer that possibility of a fourth domain,” he said—an equal partner to bacteria, archaea and eukaryotes. “There will always be novel stuff that will teach us foundational info about how life operates.”

Reprinted with permission from Quanta Magazine, an editorially independent publication of the Simons Foundation whose mission is to enhance public understanding of science by covering research developments and trends in mathematics and the physical and life sciences.