Big dados: astronômicos ou genômicos?

segunda-feira, julho 13, 2015

Big Data: Astronomical or Genomical?

Zachary D. Stephens, Skylar Y. Lee, Faraz Faghri, Roy H. Campbell, Chengxiang Zhai, Miles J. Efron, Ravishankar Iyer, Michael C. Schatz , Saurabh Sinha , Gene E. Robinson

Published: July 7, 2015DOI: 10.1371/journal.pbio.1002195

Source/Fonte: iStock


Genomics is a Big Data science and is going to get much bigger, very soon, but it is not known whether the needs of genomics will exceed other Big Data domains. Projecting to the year 2025, we compared genomics with three other major generators of Big Data: astronomy, YouTube, and Twitter. Our estimates show that genomics is a “four-headed beast”—it is either on par with or the most demanding of the domains analyzed here in terms of data acquisition, storage, distribution, and analysis. We discuss aspects of new technologies that will need to be developed to rise up and meet the computational challenges that genomics poses for the near future. Now is the time for concerted, community-wide planning for the “genomical” challenges of the next decade.




Um grama de DNA pode armazenar cerca de 455 exabytes (um exabyte equivale a 1018 bytes. Um CD-ROM armazena cerca de 700 milhões (7 x 108) de bytes de dados. Um grama de DNA armazena a quantidade de dados equivalente de 600 bilhões de CD-ROMs. Um livro comum requer 1 megabyte de armazenagem de dados, um grama de DNA pode armazenar 455 trilhões de livros!

