This document discusses how genomics and DNA sequencing data can be analyzed using big data technologies like Hadoop. It describes how early DNA sequencing efforts faced bottlenecks but scaling out storage and computing using Hadoop helped overcome these issues. Large-scale analysis of sequencing data allows for variant calling and genome-wide association studies to identify disease causes at a large scale. Recent efforts like India's biometric identity system Aadhaar show how similar techniques can be applied to very large biological data sets to gain health insights.
1 of 34
More Related Content
Hadoop and Genomics - What You Need to Know - London - Viadex RCC - 2015.03.17