World Repository of Human Genetics Will Move to Amazon's Cloud

The 200-terabyte 1,000 Genomes Project data will now be stored for free, although analytic computing resources will come at a price

Join Our Community of Science Lovers!

The U.S. National Institutes of Health announced Friday (March 30) that it'll be hosting data from its 1,000 Genomes Project for free on Amazon's cloud service. The 1,000 Genomes Project is the world's largest database of human genetics. It was created to act as a "reference population," including people of different ethnicities around the world, and it captures all the major ways in which humankind varies genetically. Now that they are hosted on Amazon's servers, the data in 1000 Genomes will be easier and cheaper for scientists to obtain and analyze.

"[The Amazon hosting] makes the data available to researchers in a way that is more useful and that avoids the researcher having to spend lots of money on storing the data themselves, on their local systems," Eric Schadt, director of the genomics institute at the Mount Sinai School of Medicine in New York, wrote to InnovationNewsDaily in an email. "This is definitely cool."

In spite of its name, the project actually holds genetic information from 1,700 anonymous people, with 900 more to come this year. The main difficulty with the database is that it's so large — 200 terabytes, an amount that would fill 30,000 DVDs. The information in the database has always been freely available at 1000genomes.org, but before the Amazon hosting deal, scientists had to pay for the Internet bandwidth and storage space to download the data, Schadt explained. People who did not have access to the powerful computers needed to store 1,000 Genome's data couldn't read the data at all. 


On supporting science journalism

If you're enjoying this article, consider supporting our award-winning journalism by subscribing. By purchasing a subscription you are helping to ensure the future of impactful stories about the discoveries and ideas shaping our world today.


Amazon Web Services also offers its superpowered computing resources to researchers who want to do calculations on the enormous genetics database. For that, Amazon will charge. The company charged one pharmaceutical client $1,279 an hour to run very large calculations, the New York Times' Bits blog reported. Yet researchers may still find it to be worth the price. "Many will be willing to bear this cost because it is far less expensive than buying 500 terabytes of disk storage and a modest-sized computer cluster to analyze those data locally," Schadt wrote. 

By making this genomics data more accessible and affordable to researchers, the Amazon deal may ultimately help scientists predict diseases more reliably, based on a person's genetics, Schadt wrote. 

The deal is a part of a new initiative from the Obama administration that will invest $200 million to researching better ways to store, analyze and find interesting points in extremely large datasets such as 1,000 Genomes. 

Copyright 2012 InnovationNewsDaily, a TechMediaNetwork company. All rights reserved. This material may not be published, broadcast, rewritten or redistributed.

It’s Time to Stand Up for Science

If you enjoyed this article, I’d like to ask for your support. Scientific American has served as an advocate for science and industry for 180 years, and right now may be the most critical moment in that two-century history.

I’ve been a Scientific American subscriber since I was 12 years old, and it helped shape the way I look at the world. SciAm always educates and delights me, and inspires a sense of awe for our vast, beautiful universe. I hope it does that for you, too.

If you subscribe to Scientific American, you help ensure that our coverage is centered on meaningful research and discovery; that we have the resources to report on the decisions that threaten labs across the U.S.; and that we support both budding and working scientists at a time when the value of science itself too often goes unrecognized.

In return, you get essential news, captivating podcasts, brilliant infographics, can't-miss newsletters, must-watch videos, challenging games, and the science world's best writing and reporting. You can even gift someone a subscription.

There has never been a more important time for us to stand up and show why science matters. I hope you’ll support us in that mission.

Thank you,

David M. Ewalt, Editor in Chief, Scientific American

Subscribe