
1,000 GENOMES: By sequencing several genomes in depth and hundreds lightly, scientists will create a map of human genetic diversity, which may provide a road map for uncovering the potential genetic roots of disease.
Image: © RGBSPACE/FOTOLIA.COM
-
The Best Science Writing Online 2012
Showcasing more than fifty of the most provocative, original, and significant online essays from 2011, The Best Science Writing Online 2012 will change the way...
Read More »
The number of sequenced human genomes will soon swell to more than 1,000 as part of a new international research consortium's effort to trace the potential genetic origins of disease. But first the mother, father and adult child of a European-ancestry family from Utah and a Yoruba-ancestry family from Nigeria will join an anonymous individual as well as famous geneticists Craig Venter and James Watson as part of the handful of humans to have on record a complete readout of their roughly three billion pairs of DNA. And these six will also each have their genetic codes examined at least 20 times, providing 10 times the accuracy of existing genetic sequences as well as paving the way for the ambitious effort dubbed the 1,000 Genomes Project, which will comprehensively map humanity's genetic variation.
"The reference sequence that we obtained in 2003 [from the anonymous individual] is just a human genome sequence, but there are six billion humans and it is the sequence of all of us that is important," says project co-chair Richard Durbin of the Wellcome Trust Sanger Institute in Cambridge, England. "We can't get that, but the output of the 1,000 Genomes Project will be a lot closer."
The project will proceed in three steps, according to the consortium. The first, currently underway and expected to be completed by year's end, is the detailed scanning of the six individuals. This will be followed by less detailed genome scans of 180 anonymous people from around the world and then partial scans of an additional 1,000 people. "If we look at about 1,000 individuals, we'll get genetic variants in those samples that are somewhere around 1 percent or lower frequency" in the human population, says geneticist Lisa Brooks, director of the Bethesda, Md.–based National Human Genome Research Institute's Genetic Variation Program.
The researchers plan to use common genetic sequences from the initial six individuals to allow less rigorous scanning as the project accelerates. For example, a shared stretch of DNA from one of the detailed individual scans could fill in the blanks for a less rigorously scanned later individual. "This all needs to be tested," Brooks says. "Is doing these scans 2x [twice] sufficient?"
Blood samples for the final project have already been collected from the first six candidates as well as randomly selected populations from throughout the world: Japanese from Tokyo, Chinese from Beijing, Luhya and Masai from Kenya, Toscani from Italy, Gujarati Asian Indians from Houston, Chinese from Denver, Mexican-Americans from Los Angeles and African-Americans from the southwestern U.S. "We wanted to look at a couple of populations, two or three from each of the major Old World continents. That gives you the variety of people but no one population is needed," Brooks notes. "We don't know exactly who these people are."
The researchers collected no identifying individual information, such as medical histories or basic height and weight data, because the study is not looking for specific diseases but rather the range of human genetic variation. "How many individuals with autism would be in 100 samples from a community?" Brooks says. "It's not a very good disease study but this is going to support a zillion disease studies that are properly designed."
For example, once complete, the 1,000-genome resource will allow researchers to pinpoint genes, structural variants in chromosomes and other individual genomic variations that are associated with diseases ranging from autism to cancer. Researchers using the incomplete genetic information presently available have already identified at least 100 regions of the genome associated with various diseases by comparing the DNA variation between healthy and ill subjects. "This project is talking about a hundred times as much data" as existing genetic resources, Durbin says.




See what we're tweeting about





3 Comments
Add CommentI've wondered this since Biology class in high school and the thought's come back to haunt me now with this article: How is it "clear that 99 percent of DNA is the same in all humans" when not all humans have been sequenced? Which part of the DNA was used as the *approximation*?
Reply | Report Abuse | Link to thisThat's a good question! You are right, it can't be absolutely clear, but it turns out that the probability of picking two humans whose genomes are 99% the same by pure chance is nearly zero. Let's see why.
Reply | Report Abuse | Link to thisSuppose we have a species whose genome is only 100bp long. Suppose the first individual of this species, whose genome we just sequenced, has genome of 100 As: AAAAA...AAAAA. Then we sequence the genome of another individual of this species. This one has TAAAA...AAAAA, that is 1 T and 99 As. Apparently these two are 99% similar.
One may say, "so what? maybe it's just a coincidence." Well, it may be coincidence, but what the chances of that are?
There are only 300 genomes that are 1 bp different from all As. These are TAA...AA, CAA...AA, GAA...AA, ATA...AA, ACA...AA, AGA...AA, and so on. 3 possible differences at each of 100 positions. 3*100=300.
Now the total number of all possible genomes is astronomical: 4^100. Any of the four nucleotides (A, T, C, G) at each of 100 positions. The probability to blindly pick an individual whose genome is 99% similar to a given genome is 300/4^100 that is basically zero.
That's why having sequenced genomes of only 2 individuals and observing a 99% similarity one may safely conclude that "99 percent of DNA is the same in all humans."
I hope this explanation helps. Have a good day!
So, let me see if I have this right. If you were to check two people's phone numbers, and they both live in, say, California, then all phones on the planet are located in the U.S.?
Reply | Report Abuse | Link to thisDNA can prove you DID NOT do something.
Proving you DID do something has an error rate.
ecarturo is right. DNA statistics have a way to go to become irrationally true.
Oh, eye witnesses are often wrong too.