Complete Genomics Sequences 14 Human Genomes
Complete Genomics has sequenced, analyzed and delivered 14 human genomes since March 2009. Considering that fewer than 20 genomes have been sequenced and published to date, this represents a significant advance for medical research.
Complete Genomics’ current customers are using human genome sequencing technology to conduct small pilot projects, each comprised of five to 10 genomes. These customers represent a mix of academic research institutions and biopharmaceutical companies and include Pfizer, the Flanders Institute for Biotechnology (VIB), Duke University, Brigham & Women’s Hospital, the HudsonAlpha Institute for Biotechnology, and the Ontario Institute for Cancer Research in addition to the Institute for Systems Biology and Broad Institute of MIT and Harvard. These customers send DNA samples to the company and receive their requested genome data. Because they do not need to purchase instruments or reagents, it greatly reduces the cost and complexity of sequencing complete human genomes.
The pilot projects are being used to evaluate the technology and also to conduct small-scale disease studies to investigate conditions as diverse as cancer (breast, lung, colorectal and melanoma), HIV and schizophrenia.
In the cancer studies, researchers are comparing patients’ tumor genome with their non-cancerous genome to identify variations that may provide insights into the cause or spread of their disease. The goal is to use this previously unavailable whole genome data to accurately characterize the tumor and identify its vulnerabilities. These data can then be used to design more effective therapies for patients.
Dr. George M. Church, professor of genetics at Harvard Medical School, director of the Center for Computational Genetics, and member of Complete Genomics’ Scientific Advisory Board, described his experience: “As part of the Personal Genome Project, we have had a single human genome sequenced by Complete Genomics. We have cross-validated Complete Genomics’ resulting data set, including a list of variants, to gauge its technical accuracy. I am pleased with the quality of the data provided. Complete Genomics’ technology can clearly deliver high-quality genomic data, which compare favorably with other published results, and at a low cost. I look forward to continuing to work with the company as it scales up the process to sequence thousands of genomes next year.”
Complete Genomics’ Chairman, President and CEO Dr. Clifford Reid agrees: “It’s all about scale. Sequencing one human genome is a scientific curiosity. We need to sequence thousands of them to be able to make meaningful discoveries about the genetic basis of disease. To that end, Complete Genomics plans to sequence 10,000 human genomes in 2010.”
Scaling up
Complete Genomics is conducting its customer pilot projects using its existing R&D sequencing instruments. However, the company is also in the process of scaling up its genome center and transitioning to a commercial operation. After its commercial launch in January 2010, Complete Genomics will conduct much larger sequencing projects and employ new commercial-scale instruments that will be able to read well over one terabase (1012 bases) per run.
Complete Genomics will achieve this goal by increasing the efficiency of its technology on multiple levels. The company is building higher-density DNA nanoarrays (DNBs) that will contain 2.85 billion spots of DNA arranged in a grid with 70 bases per spot, allowing it to sequence an entire human genome on a single array. It is also developing new sequencing instruments that will be able to read the sequences from multiple DNA nanoarrays concurrently, thereby sequencing well over one terabase in a single run. These advances will dramatically increase the company’s sequencing throughput to one instrument-day per genome.
The company’s high volume genomic sequencing will naturally generate massive amounts of data, which will be managed and analyzed in its data center using its proprietary software on a high-performance computing cluster. The key to success will be sharp focus on providing only complete human genome sequencing in a massively-parallel process, optimizing all of its systems for this one task.