Research & Development World

  • R&D World Home
  • Topics
    • Aerospace
    • Automotive
    • Biotech
    • Careers
    • Chemistry
    • Environment
    • Energy
    • Life Science
    • Material Science
    • R&D Management
    • Physics
  • Technology
    • 3D Printing
    • A.I./Robotics
    • Software
    • Battery Technology
    • Controlled Environments
      • Cleanrooms
      • Graphene
      • Lasers
      • Regulations/Standards
      • Sensors
    • Imaging
    • Nanotechnology
    • Scientific Computing
      • Big Data
      • HPC/Supercomputing
      • Informatics
      • Security
    • Semiconductors
  • R&D Market Pulse
  • R&D 100
    • Call for Nominations: The 2025 R&D 100 Awards
    • R&D 100 Awards Event
    • R&D 100 Submissions
    • Winner Archive
    • Explore the 2024 R&D 100 award winners and finalists
  • Resources
    • Research Reports
    • Digital Issues
    • R&D Index
    • Subscribe
    • Video
    • Webinars
  • Global Funding Forecast
  • Top Labs
  • Advertise
  • SUBSCRIBE

High-Performance Storage Powers Sanger Institute Research to Reduce Global Health Burden

By R&D Editors | October 2, 2013

Clara, CA — To accelerate advancements in biomedical research, the Wellcome Trust Sanger Institute has deployed over 22 petabytes of high-performance storage from DataDirect Networks (DDN). As one of the top five scientific institutions in the world specializing in DNA sequencing, Sanger Institute embraces the latest technologies to research the genetic basis of global health problems, including cancer, malaria, diabetes, obesity and infectious diseases.

In order to manage the massive surge in the volume of data required to evaluate genetic sequences, Sanger Institute, a charitably funded genomic research center based in the United Kingdom, selected DDN’s SFA high-performance storage engine and EXAScaler Lustre file system appliance to deliver unprecedented levels of throughput and scalability to support tens of thousands of data sequences requiring up to 10,000 CPU hours of computational analysis. With more than 2,000 scientists around the world, DDN SFA storage will also help facilitate data access and sharing including for those who access data through the Sanger Institute’s website, which results in 20 million hits and 12 million impressions each week.

As the 27 DNA sequencers in Sanger Institute’s Illumina Production Sequencing core facility each pump out about one terabyte of data daily, with DDN technology the Sanger Institute has an easy-to-manage, integrated system that offers unparalleled scalability to address both complex computing problems and ever-changing collaboration requirements associated with its leading-edge research.

DDN’s experience serving some of the world’s fastest computers ensures that the Sanger Institute can deliver the highest levels of compute performance and throughput, as well as maximum system uptime, to optimize the latest sequencing technologies. This is critical, as today’s sequencers produce a million times more data than those used a decade ago.

Moreover, the institute now can provide its diverse scientific community with an essential tool for leveraging its approximately £80 million research budget to the fullest in order to further the exploration of groundbreaking scientific and medical discoveries.

To accommodate demands for increased bandwidth, Sanger Institute is upgrading its 10GbE network to 40GbE and plans to scale its current DDN storage to support expanded network capacity.

“If you need 10,000 cores to perform an extra layer of analysis in an hour, you have to scale a significant cluster to get answers quickly. You need a real solution that can address everything from very small to extremely large data sets,” Tim Cutts, acting head of scientific computing, Wellcome Trust Sanger Institute, said. “We have to explore emerging technologies that could play a significant role in our future architecture. We need solutions that give us a much better way to provide storage to our expanding user community with good access controls through iRODS.”

“The sequencing machines that run today produce a million times more data than the machine used in the human genome project. We produce more sequences in one hour than we did in our first 10 years. For instance, a single cancer genome project sequences data that requires up to 10,000 CPU hours for analysis and we’re doing tens of thousands of these at once. The sheer scale is enormous and the computational effort required is huge,” Phil Butcher, director of information communications technology, Wellcome Trust Sanger Institute, said. “Our storage strategy gives us incredible scaling. If we need to add a new sequencer, we can expand quickly and without disruption.”

Related Articles Read More >

QED-C outlines road map for merging quantum and AI
Quantum computing hardware advance slashes superinductor capacitance >60%, cutting substrate loss
Hold your exaflops! Why comparing AI clusters to supercomputers is bananas
Why IBM predicts quantum advantage within two years
rd newsletter
EXPAND YOUR KNOWLEDGE AND STAY CONNECTED
Get the latest info on technologies, trends, and strategies in Research & Development.
RD 25 Power Index

R&D World Digital Issues

Fall 2024 issue

Browse the most current issue of R&D World and back issues in an easy to use high quality format. Clip, share and download with the leading R&D magazine today.

Research & Development World
  • Subscribe to R&D World Magazine
  • Enews Sign Up
  • Contact Us
  • About Us
  • Drug Discovery & Development
  • Pharmaceutical Processing
  • Global Funding Forecast

Copyright © 2025 WTWH Media LLC. All Rights Reserved. The material on this site may not be reproduced, distributed, transmitted, cached or otherwise used, except with the prior written permission of WTWH Media
Privacy Policy | Advertising | About Us

Search R&D World

  • R&D World Home
  • Topics
    • Aerospace
    • Automotive
    • Biotech
    • Careers
    • Chemistry
    • Environment
    • Energy
    • Life Science
    • Material Science
    • R&D Management
    • Physics
  • Technology
    • 3D Printing
    • A.I./Robotics
    • Software
    • Battery Technology
    • Controlled Environments
      • Cleanrooms
      • Graphene
      • Lasers
      • Regulations/Standards
      • Sensors
    • Imaging
    • Nanotechnology
    • Scientific Computing
      • Big Data
      • HPC/Supercomputing
      • Informatics
      • Security
    • Semiconductors
  • R&D Market Pulse
  • R&D 100
    • Call for Nominations: The 2025 R&D 100 Awards
    • R&D 100 Awards Event
    • R&D 100 Submissions
    • Winner Archive
    • Explore the 2024 R&D 100 award winners and finalists
  • Resources
    • Research Reports
    • Digital Issues
    • R&D Index
    • Subscribe
    • Video
    • Webinars
  • Global Funding Forecast
  • Top Labs
  • Advertise
  • SUBSCRIBE