{"id":10831,"date":"2016-09-30T13:00:48","date_gmt":"2016-09-30T12:00:48","guid":{"rendered":"https:\/\/blogs.nature.com\/naturejobs\/?p=10831"},"modified":"2016-09-26T11:03:00","modified_gmt":"2016-09-26T10:03:00","slug":"how-is-the-rise-of-data-intensive-research-changing-what-it-means-to-be-a-scientist","status":"publish","type":"post","link":"https:\/\/blogs.nature.com\/naturejobs\/2016\/09\/30\/how-is-the-rise-of-data-intensive-research-changing-what-it-means-to-be-a-scientist\/","title":{"rendered":"How is the rise of data-intensive research changing what it means to be a scientist?"},"content":{"rendered":"<h2>Data-intensive research requires a new breed of scientist: interdisciplinary analysts who enjoy swimming in data, says Atma Ivancevic.<\/h2>\n<p>There has always been an emphasis on the generation of novel data in science. Being a scientist involves progressing from observation to hypothesis to experiment to output. In the past, a combination of scarce data to look at and low throughput machinery to make more has led to limited experimental outcomes.<\/p>\n<div id=\"attachment_10841\" style=\"width: 1878px\" class=\"wp-caption alignright\"><a class=\"wpn-image-link\" href=\"https:\/\/blogs.nature.com\/naturejobs\/files\/2016\/09\/2016-09-12-Atma-Ivancevic-04-smaller-cropped.jpg\"><img loading=\"lazy\" decoding=\"async\" aria-describedby=\"caption-attachment-10841\" class=\"wpn-image wp-image-10841 size-full\" title=\"2016-09-12-Atma Ivancevic 04-smaller-cropped\" src=\"https:\/\/blogs.nature.com\/naturejobs\/files\/2016\/09\/2016-09-12-Atma-Ivancevic-04-smaller-cropped.jpg\" alt=\"2016-09-12-Atma Ivancevic 04-smaller-cropped\" width=\"1868\" height=\"1913\" srcset=\"https:\/\/blogs.nature.com\/naturejobs\/files\/2016\/09\/2016-09-12-Atma-Ivancevic-04-smaller-cropped.jpg 1868w, https:\/\/blogs.nature.com\/naturejobs\/files\/2016\/09\/2016-09-12-Atma-Ivancevic-04-smaller-cropped-293x300.jpg 293w, https:\/\/blogs.nature.com\/naturejobs\/files\/2016\/09\/2016-09-12-Atma-Ivancevic-04-smaller-cropped-1000x1024.jpg 1000w\" sizes=\"auto, (max-width: 1868px) 100vw, 1868px\" \/><\/a><p id=\"caption-attachment-10841\" class=\"wp-caption-text\">Atma Ivancevic<\/p><\/div>\n<p><!--more-->As a result of the rise of computational power, scientists are facing a paradigm shift from data production to data interpretation. Advances in technology have revolutionised the field of science. Now, our experiments are high output, data-intensive projects that come with their own problems. We find ourselves with the ability to capture and store vast amounts of data \u2013 and not enough people to meaningfully interpret it.<\/p>\n<p>The problem is that many scientists are not expected to have the skills needed for data-intensive research.<\/p>\n<p>Consider the following scenario: a biologist in the early 2000s wants to explore the evolution of a particular gene in animals. Using a template DNA sequence, she runs lab experiments to extract the gene from each species and align the sequences. Species differences are assessed, a conclusion is drawn and the results are published, with appropriate analysis on the potential function of the gene and how it has evolved over time. The requirement for publication here is the production of data.<\/p>\n<p>A few years go by, and suddenly the cost of sequencing drops dramatically. Companies and consortia are publishing genomes at such an alarming rate that the number of publicly available species is in the thousands. The biologist wants to see how the previous hypotheses hold up against a larger subset of species. Where to start? She will need high-performance computing machines to store and process the genomes. Manual inspection of the data is too laborious: she needs an automated workflow. She could hire a programmer, but they might lack the background knowledge to find biologically significant phenomena.<\/p>\n<p>Even publishing is a challenge \u2013 printing the raw results would make <em>Nature<\/em> a lot thicker than anyone would want to read. The biologist finds herself at a loss because she cannot perform the computation needed for large datasets, and does not know how to convert the data into publishable format.<\/p>\n<p>What went wrong? The original approach was reproducible and true to the scientific method \u2013 for small datasets. But it can\u2019t be adapted to the immensely complex datasets we see today. The rise of data affects all scientific disciplines: we have next-generation sequencing machines in biology, the Large Hadron Collider in physics, and satellite data collection devices in climate sciences. Scientific practice now requires complete familiarity with a wide set of computational tools and algorithms.<\/p>\n<p>Suppose our biologist decides to embrace the data revolution. There are plenty of open source resources online: she starts by enrolling in a course for programming. She posts questions on message boards and forums. At conferences, she learns how to visualise her results. Eventually she starts offering her own advice to others. Because she recognised her limitations and worked to address them, the biologist is ready to begin her journey as a data scientist. \u00a0<a class=\"wpn-image-link\" href=\"https:\/\/blogs.nature.com\/naturejobs\/files\/2016\/09\/pic-smaller.jpg\"><img loading=\"lazy\" decoding=\"async\" class=\" wp-image-10835 wpn-image alignright\" title=\"pic-smaller\" src=\"https:\/\/blogs.nature.com\/naturejobs\/files\/2016\/09\/pic-smaller.jpg\" alt=\"pic-smaller\" width=\"269\" height=\"476\" srcset=\"https:\/\/blogs.nature.com\/naturejobs\/files\/2016\/09\/pic-smaller.jpg 1520w, https:\/\/blogs.nature.com\/naturejobs\/files\/2016\/09\/pic-smaller-170x300.jpg 170w, https:\/\/blogs.nature.com\/naturejobs\/files\/2016\/09\/pic-smaller-579x1024.jpg 579w\" sizes=\"auto, (max-width: 269px) 100vw, 269px\" \/><\/a><\/p>\n<p>The fundamental characteristics of being a scientist have not changed. We still need systematic, logical researchers with robust methodologies (and substantial funding). But computational experience is now a prerequisite. The future belongs to <a href=\"https:\/\/blogs.nature.com\/naturejobs\/2016\/08\/29\/transferable-skills-what-are-scientists-good-at-other-than-science\/\">all-rounders<\/a>: computer-literate scientists who can quickly transform data into results, and effectively present their results to a broad scientific audience.<\/p>\n<p>&nbsp;<\/p>\n<p><em>Atma Ivancevic is an amateur writer and soon-to-be PhD graduate in bioinformatics at the University of Adelaide, Australia. In her spare time, she enjoys binging on Netflix and spending lazy days at the beach. <\/em><\/p>\n<p><em>This piece was selected as one of the winning entries for the Publishing Better Science through Better Data writing competition. <a href=\"https:\/\/www.eventbrite.co.uk\/e\/publishing-better-science-through-better-data-2016-scidata16-tickets-16695868793\">Publishing Better Science through Better Data<\/a> is a free, full day conference focussing on how early career researchers can best utilise and manage research data. The conference will run on October 26<sup>th<\/sup> at Wellcome Collection Building, London.<\/em><\/p>\n<p>&nbsp;<\/p>\n<p><strong>Suggested posts<\/strong><\/p>\n<p><a href=\"https:\/\/blogs.nature.com\/naturejobs\/2016\/08\/29\/transferable-skills-what-are-scientists-good-at-other-than-science\/\">What are scientists good at (other than science?)<\/a><\/p>\n<p><a href=\"https:\/\/blogs.nature.com\/naturejobs\/2016\/09\/09\/big-data-jobs-are-out-there-are-you-ready\/\">Big data jobs are out there \u2013 are you ready?<\/a><\/p>\n<p><a href=\"https:\/\/blogs.nature.com\/naturejobs\/2016\/08\/08\/scientific-data-effective-communication-big-changes\/\">Scientific data + effective communication = big changes<\/a><\/p>\n<p><a href=\"https:\/\/blogs.nature.com\/naturejobs\/2013\/03\/18\/so-you-want-to-be-a-data-scientist\/\">So you want to be a data scientist?<\/a><\/p>\n<p>&nbsp;<\/p>\n","protected":false},"excerpt":{"rendered":"<p>There has always been an emphasis on the generation of novel data in science. Being a scientist involves progressing from observation to hypothesis to experiment to output. In the past, a combination of scarce data to look at and low throughput machinery to make more has led to limited experimental outcomes.&nbsp; <a href=\"https:\/\/blogs.nature.com\/naturejobs\/2016\/09\/30\/how-is-the-rise-of-data-intensive-research-changing-what-it-means-to-be-a-scientist#more-10831\" class=\"more-link\">Read more<\/a> <a href=\"https:\/\/blogs.nature.com\/naturejobs\/2016\/09\/30\/how-is-the-rise-of-data-intensive-research-changing-what-it-means-to-be-a-scientist\/\">Continue reading <span class=\"meta-nav\">&rarr;<\/span><\/a><\/p>\n","protected":false},"author":90925,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[1021,190,186,865,199,200],"tags":[4015373,4015371,4015361,4015363,429,72611,306,865,364485,4015369,1059,3382713,497,4015365,3282481,441,38453,256,2911587,563,3115289,4015367,286,321],"class_list":["post-10831","post","type-post","status-publish","format-standard","hentry","category-scientistonthemove-2","category-academia-2","category-communication-2","category-data","category-research-2","category-technology-2","tag-analyst","tag-analysts","tag-atma-ivancevic","tag-big","tag-big-data","tag-bioinformatics","tag-competition","tag-data","tag-development","tag-gene","tag-genetics","tag-information","tag-london","tag-production","tag-professional","tag-publication","tag-publishing-better-science-through-better-data","tag-science","tag-scientific","tag-scientific-data","tag-scientist","tag-sequencing","tag-skills","tag-writing"],"_links":{"self":[{"href":"https:\/\/blogs.nature.com\/naturejobs\/wp-json\/wp\/v2\/posts\/10831","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/blogs.nature.com\/naturejobs\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/blogs.nature.com\/naturejobs\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/blogs.nature.com\/naturejobs\/wp-json\/wp\/v2\/users\/90925"}],"replies":[{"embeddable":true,"href":"https:\/\/blogs.nature.com\/naturejobs\/wp-json\/wp\/v2\/comments?post=10831"}],"version-history":[{"count":0,"href":"https:\/\/blogs.nature.com\/naturejobs\/wp-json\/wp\/v2\/posts\/10831\/revisions"}],"wp:attachment":[{"href":"https:\/\/blogs.nature.com\/naturejobs\/wp-json\/wp\/v2\/media?parent=10831"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/blogs.nature.com\/naturejobs\/wp-json\/wp\/v2\/categories?post=10831"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/blogs.nature.com\/naturejobs\/wp-json\/wp\/v2\/tags?post=10831"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}