Research & Development World

  • R&D World Home
  • Topics
    • Aerospace
    • Automotive
    • Biotech
    • Careers
    • Chemistry
    • Environment
    • Energy
    • Life Science
    • Material Science
    • R&D Management
    • Physics
  • Technology
    • 3D Printing
    • A.I./Robotics
    • Software
    • Battery Technology
    • Controlled Environments
      • Cleanrooms
      • Graphene
      • Lasers
      • Regulations/Standards
      • Sensors
    • Imaging
    • Nanotechnology
    • Scientific Computing
      • Big Data
      • HPC/Supercomputing
      • Informatics
      • Security
    • Semiconductors
  • R&D Market Pulse
  • R&D 100
    • 2025 R&D 100 Award Winners
    • 2025 Professional Award Winners
    • 2025 Special Recognition Winners
    • R&D 100 Awards Event
    • R&D 100 Submissions
    • Winner Archive
  • Resources
    • Research Reports
    • Digital Issues
    • Educational Assets
    • R&D Index
    • Subscribe
    • Video
    • Webinars
    • Content submission guidelines for R&D World
  • Global Funding Forecast
  • Top Labs
  • Advertise
  • SUBSCRIBE

Collaboration yields new genetic variant data set for 1000 Genomes Project

By R&D Editors | December 5, 2013

DNAnexus, which develops cloud-based solutions for large-scale DNA data management and analysis, today announced a collaboration with Stanford University that has resulted in a new 1000 Genomes Project data set of genetic variation. Through this collaboration, scientists have analyzed and identified genetic variants in more than 80 terabytes (TB) of raw BAM files—a binary format for storing DNA sequence data—using the DNAnexus Platform-as-a-Service (PaaS) and have made these data publicly available for follow-on biomedical research.

Launched in January 2008, the 1000 Genomes Project was the first international effort to sequence a large number of individual genomes with the goal of developing a comprehensive and freely accessible resource on human genetic variation. The project has grown to include genomic data from more than 2,500 individuals across 26 separate ethnic populations and is expected to conclude in the spring of 2014.

One of many international teams contributing to the 1000 Genomes Project is the lab of Carlos D. Bustamante, PhD, professor of genetics at Stanford Univ. School of Medicine. The data generated by the project is expected to support a deeper understanding of genetic variation patterns in underserved populations, including African-Americans and Hispanic-Latinos. These data will also enable the development of rich catalogs of DNA variants that could help researchers develop new medical tools such as tests for evaluating disease susceptibility in different populations.

“We believe that many genetic variants exist in present-day admixed populations that have never been seen in typical European-centric biomedical studies. By looking at these groups more closely, we hope not only to increase the overall understanding of haplotype, nucleotide, and structural variation diversity, but also to bring these underserved communities into the fold of medical genetics research,” said Andrew Carroll, PhD, lead DNAnexus scientist on this collaboration.

In this project, Stanford scientists performed variant calling on low-coverage (4-5x) whole-genome sequencing data from 2,535 individuals across 26 different global populations. The variant-calling pipeline was developed by Real Time Genomics and ported to the DNAnexus platform, which is built on top of Amazon Web Services. After computational filtering, some 56 million single-nucleotide polymorphisms (SNPs) and 5.6 million inserts or deletions (indels) were identified across the samples. The call set shows high sensitivity for standard variant sets and considerable variation was observed in the number of polymorphisms across populations. Principal component analysis shows that these variants capture genetic variation at continental and sub-continental levels.

“The size and scope of DNA sequencing projects is rapidly moving toward an era where the analysis of data from thousands of human genomes is the norm. To realize the promise of these projects will necessitate IT infrastructures that exceed the in-house capabilities of most research labs and require bursts of computational resources that would be prohibitively expensive and time-consuming to deploy,” said Richard Daly, CEO of DNAnexus. “DNAnexus pioneered the cloud-based genomics platform and has successfully demonstrated its capacity to cost effectively perform in a number of very large, high-value projects.”

DNAnexus provides an enterprise-focused API-based PaaS that enables clinical and research enterprises to efficiently move their analysis pipelines into the cloud, using their own algorithms alongside industry-recognized tools and reference resources to create customized workflows in a secure, cost-effective and compliant environment. With DNAnexus, labs of any size can build and run their data analysis applications and workflows from anywhere in the world, and work securely with research and clinical collaborators.

1000 Genomes Project

Source: DNAnexus

Related Articles Read More >

Lab automation is “vaporizing”: Why the hottest innovation is invisible
Google on how AI will extend researchers
Kythera Labs’ Wayfinder remasters incomplete medical data for AI analysis
Adviser Labs raises $1M to simplify cloud HPC for in AI and scientific computing
rd newsletter
EXPAND YOUR KNOWLEDGE AND STAY CONNECTED
Get the latest info on technologies, trends, and strategies in Research & Development.
RD 25 Power Index

R&D World Digital Issues

Fall 2024 issue

Browse the most current issue of R&D World and back issues in an easy to use high quality format. Clip, share and download with the leading R&D magazine today.

Research & Development World
  • Subscribe to R&D World Magazine
  • Sign up for R&D World’s newsletter
  • Contact Us
  • About Us
  • Drug Discovery & Development
  • Pharmaceutical Processing
  • Global Funding Forecast

Copyright © 2025 WTWH Media LLC. All Rights Reserved. The material on this site may not be reproduced, distributed, transmitted, cached or otherwise used, except with the prior written permission of WTWH Media
Privacy Policy | Advertising | About Us

Search R&D World

  • R&D World Home
  • Topics
    • Aerospace
    • Automotive
    • Biotech
    • Careers
    • Chemistry
    • Environment
    • Energy
    • Life Science
    • Material Science
    • R&D Management
    • Physics
  • Technology
    • 3D Printing
    • A.I./Robotics
    • Software
    • Battery Technology
    • Controlled Environments
      • Cleanrooms
      • Graphene
      • Lasers
      • Regulations/Standards
      • Sensors
    • Imaging
    • Nanotechnology
    • Scientific Computing
      • Big Data
      • HPC/Supercomputing
      • Informatics
      • Security
    • Semiconductors
  • R&D Market Pulse
  • R&D 100
    • 2025 R&D 100 Award Winners
    • 2025 Professional Award Winners
    • 2025 Special Recognition Winners
    • R&D 100 Awards Event
    • R&D 100 Submissions
    • Winner Archive
  • Resources
    • Research Reports
    • Digital Issues
    • Educational Assets
    • R&D Index
    • Subscribe
    • Video
    • Webinars
    • Content submission guidelines for R&D World
  • Global Funding Forecast
  • Top Labs
  • Advertise
  • SUBSCRIBE