DNAnexus Case Study

2014

DNAnexus, based in Mountain View, California, has created the global network for genomics by providing an API-based platform for the sharing and management of data and tools that accelerate genomic research. On DNAnexus, enterprises scale genomic analysis workloads, deploy tools consistently, control data provenance, and facilitate global collaboration in a secure and compliant environment. The DNAnexus Platform enables scientists and clinicians worldwide to accelerate medical advances, improve patient care, and advance research and development in areas such as cancer, heart disease, Alzheimer’s disease, noninvasive prenatal testing, and enhanced agricultural production.

start a python tutorial
kr_quotemark

With AWS, DNAnexus enables enterprises worldwide to perform genomic analysis and clinical studies in a secure and compliant environment at a scale not previously not possible.”

Richard Daly
CEO, DNAnexus

The Challenge

The rapid pace of development in genomics is revolutionizing scientific research, causing a seismic shift in areas like precision medicine and improving the world food supply. The increase in speed and decrease in cost of next-generation sequencing (NGS) technologies has outpaced Moore’s law. The $1,000 genome is now a reality. The bottlenecks are now genome informatics and data portability; researchers don’t have the compute power nor the infrastructure to manage the volume of raw sequence data they are creating. Additionally, the global scope of research and clinical studies requires a secure and compliant environment within which researchers can share and collaborate on datasets and tools in real time.

“The management and analysis of genomic data at the scale needed to power large-scale genome studies requires computational and storage infrastructure that exceeds the capacity of most institutions,” said Richard Daly, CEO of DNAnexus. “We sought a way to provide enterprises performing research and clinical work the resources to focus on the science, and the ability to collaborate with peers in a secure and compliant environment.” To accomplish this, DNAnexus developed a cloud-based genome informatics and data management platform.

Why Amazon Web Services

DNAnexus carefully examined the available cloud-service options to find a provider that could meet the company’s requirements while also supporting clinical compliance standards. The company chose Amazon Web Services (AWS) as its cloud provider because AWS could deliver tens of thousands of compute cores, store petabytes of data, and offer the reliability, security, and compliance required for highly complex and sensitive research activities.

DNAnexus deploys its customers’ genomic pipelines on Amazon Elastic Compute Cloud (Amazon EC2), using Amazon Simple Storage Service (Amazon S3) for high-performance, high-durability storage and Amazon Glacier for low-cost data archiving. AWS also provided DNAnexus with a Business Associates Agreement (BAA), allowing DNAnexus to offer best-in-class security and compliance with US and international healthcare laws. 

The Benefits

On AWS, the DNAnexus domain-specific data management platform offers tailored compliance and security and provides fine-grained data management for transparency, reproducibility, and data provenance for consistent bioinformatics pipelines and results.

Using AWS, DNAnexus has scaled to more than 56,000 simultaneous compute cores, multiple petabytes of storage, and tens of millions of core-hours of analysis. DNAnexus provides both a GUI and a command-line tool, allowing customers to upload raw DNA data directly from sequencing instruments to the cloud and eliminating the need for costly on-premises computational and storage infrastructure.

Amazon EC2 hosts platform services for DNAnexus customers’ genomic analysis. For DNA analysis, the company developed a custom queuing system that operates on Amazon EC2 instances and robustly handles any interruptions in data processing. DNAnexus leverages the wide variety of instance types and multiple pricing plans for Amazon EC2, including Reserved and Spot Instances. Running on AWS allows DNAnexus to provide visibility into resource consumption, and bill by the minute rather than the hour, offering a compelling pricing model for the DNAnexus customers.

Amazon Web Services' demonstrated compliance to standards such as HIPAA, ISO27001, SOC 1/2/3, ISO9001, FedRamp, and FISMA allows DNAnexus to provide a platform that lets its customers pursue clinically compliant projects of enormous scale and scope, confident in the quality and efficiency of analysis and the security and ease of collaboration.

“AWS allows us to deploy a cloud-based infrastructure that supports our customers’ research and clinical needs,” according to Omar Serang, chief cloud officer at DNAnexus. “The combination of the AWS infrastructure and the DNAnexus platform controls and certified compliance allows us to meet the demanding requirements of HIPAA, CAP/CLIA, GxP, and other privacy laws and regulations.”


About DNAnexus

DNAnexus, based in Mountain View, California, has created the global network for genomics by providing an API-based platform for the sharing and management of data and tools that accelerate genomic research.

Benefits of AWS

  • Scaled to more than 56,000 simultaneous compute cores, multiple petabytes of storage, and tens of millions of core-hours of analysis
  • Eliminated the need for on-premises computational and storage infrastructure
  • Allows customers to pursue clinically compliant projects

AWS Services Used

Amazon EC2

Amazon Elastic Compute Cloud (Amazon EC2) is a web service that provides secure, resizable compute capacity in the cloud. It is designed to make web-scale cloud computing easier for developers.

Learn more »

Amazon S3

Amazon Simple Storage Service (Amazon S3) is an object storage service that offers industry-leading scalability, data availability, security, and performance.

Learn more »

Amazon Glacier

Amazon S3 Glacier and S3 Glacier Deep Archive are a secure, durable, and extremely low-cost Amazon S3 cloud storage classes for data archiving and long-term backup.

Learn more »


Get Started

Companies of all sizes across all industries are transforming their businesses every day using AWS. Contact our experts and start your own AWS Cloud journey today.