DataDirect Networks Scalable, High-Performance Storage Powers Wellcome Trust Sanger Institute’s Worldwide Research Efforts to Reduce Global Health Burden
DDN’s Unrivaled Performance and Throughput Helps Leading Research Institute Evaluate Surge of DNA Sequencing Data to Uncover Causes of Genetic and Infectious Diseases
- To accelerate advancements in biomedical research, the Wellcome Trust Sanger Institute, a charitably funded genomic research center based in the United Kingdom, has deployed DataDirect Networks (DDN) high-performance storage as part of a 22 petabyte genomic storage environment.
- As one of the top five scientific institutions in the world specializing in DNA sequencing, Sanger Institute embraces the latest technologies to research the genetic basis of global health problems, including cancer, malaria, diabetes, obesity and infectious diseases.
- In order to manage the massive surge in the volume of data required to evaluate genetic sequences, Sanger Institute selected DDN’s SFA® high-performance storage engine and EXAScaler™ Lustre® file system appliance to deliver unprecedented levels of throughput and scalability to support tens of thousands of data sequences requiring up to 10,000 CPU hours of computational analysis.
- With more than 2,000 scientists around the world, DDN SFA storage will also help facilitate data access and sharing including for those who access data through the Sanger Institute’s website, which results in 20 million hits and 12 million impressions each week.
- As the 30 DNA sequencers in Sanger Institute’s Illumina Production Sequencing core facility each pump out about one terabyte of data daily, with DDN technology the Sanger Institute has an easy-to-manage, integrated system that offers unparalleled scalability to address both complex computing problems and ever-changing collaboration requirements associated with its leading-edge research.
- DDN’s proven experience serving some of the world’s fastest computers ensures that the Sanger Institute can deliver the highest levels of compute performance and throughput, as well as maximum system uptime, to optimize the latest sequencing technologies. This is critical as today’s sequencers produce a million times more data than those used a decade ago.
- Moreover, the institute now can provide its diverse scientific community with an essential tool for leveraging its approximately £80 million research budget to the fullest in order to further the exploration of groundbreaking scientific and medical discoveries.
DDN Flexible, High Performance Infrastructure Supports Diverse Research Workloads
- With DDN storage, the Sanger Institute can achieve its goal of supporting different research workloads with a wide range of computational analysis and storage requirements while being able to expand quickly and without disruption.
- Since installing its initial SFA storage platform, the Sanger Institute keeps pace with ever-increasing computational and analytical demands by taking advantage of DDN’s ongoing performance increases to achieve speeds of up to 20 GBps, which enables meeting the needs of the most demanding workloads.
- To accommodate demands for increased bandwidth, Sanger Institute is upgrading its 10GbE network to 40GbE and plans to scale its current DDN storage to support expanded network capacity.
- Additionally, Sanger Institute is exploring DDN WOS® distributed object storage platform, which could be ideal for increased collaboration and data sharing as part of a private cloud.
Tim Cutts, acting head of scientific computing, Wellcome Trust Sanger Institute:
- “If you need 10,000 cores to perform an extra layer of analysis in an hour, you have to scale a significant cluster to get answers quickly. You need a real solution that can address everything from very small to extremely large data sets.”
- “We have to explore emerging technologies that could play a significant role in our future architecture. We need solutions that give us a much better way to provide storage to our expanding user community with good access controls through iRODS.”
Phil Butcher, director of information communications technology, Wellcome Trust Sanger Institute:
- “The sequencing machines that run today produce a million times more data than the machine used in the human genome project. We produce more sequences in one hour than we did in our first 10 years. For instance, a single cancer genome project sequences data that requires up to 10,000 CPU hours for analysis and we’re doing tens of thousands of these at once. The sheer scale is enormous and the computational effort required is huge.”
- “Our storage strategy gives us incredible scaling. If we need to add a new sequencer, we can expand quickly and without disruption.”
- Wellcome Trust Sanger Institute case study
- More on SFA12K
- More on EXAScaler
- Follow DDN via Blog and Twitter
About DataDirect Networks
DataDirect Networks (DDN) is the world leader in massively scalable storage. Our data storage and processing solutions and professional services enable content-rich and high growth IT environments to achieve the highest levels of systems scalability, efficiency and simplicity. DDN enables enterprises to extract value and deliver business results from their information. Our customers include the world’s leading online content and social networking providers, high performance cloud and grid computing, life sciences, media production, and security and intelligence organizations. Deployed in thousands of mission critical environments worldwide, DDN’s solutions have been designed, engineered and proven in the world’s most scalable data centers to ensure competitive business advantage for today’s information powered enterprise. For more information, go to datadirect.wpengine.com or call 1-800-837-2298.
©2013 All rights reserved. DDN, Storage Fusion Architecture, SFA12K, WOS and Information In Motion are trademarks owned by DataDirect Networks. All other trademarks are the property of their respective owners.