{"id":11491,"date":"2016-12-05T13:00:43","date_gmt":"2016-12-05T13:00:43","guid":{"rendered":"https:\/\/blogs.nature.com\/naturejobs\/?p=11491"},"modified":"2016-11-29T12:48:20","modified_gmt":"2016-11-29T12:48:20","slug":"boost-research-and-avoid-embarrassing-retractions-by-working-openly-and-reproducibly","status":"publish","type":"post","link":"https:\/\/blogs.nature.com\/naturejobs\/2016\/12\/05\/boost-research-and-avoid-embarrassing-retractions-by-working-openly-and-reproducibly\/","title":{"rendered":"#scidata16: Boost research and avoid embarrassing retractions by working openly and reproducibly"},"content":{"rendered":"<h2>Experiments fail to be reproduced, research data from others is hard to come by, and steps between data and figure are described as <a href=\"https:\/\/www.sciencecartoonsplus.com\/gallery\/math\/math07.gif\">\u2018here, a miracle happens\u2019<\/a>.<\/h2>\n<p><em>Speakers at the <a href=\"https:\/\/blogs.nature.com\/scientificdata\/2016\/06\/03\/announcing-the-scidata16-draft-programme-and-call-for-lightning-talks\/\">Publishing Better Science through Better Data<\/a> (#scidata16) conference addressed these issues and more.<\/em><\/p>\n<p><em>Publishing Better Science through Better Data journalism competition winner R\u00e9ka Nagy.<\/em><\/p>\n<p>Most research happens behind closed doors, and the results can only be gleaned once they\u2019ve been published. The raw data that lead to results, however, are rarely made public, and the steps taken to get from data to figures in a publication is not always clear, which has led to the <a href=\"https:\/\/www.nature.com\/news\/reproducibility-1.17552\">reproducibility crisis<\/a> currently facing research. It\u2019s clear that something needs to be done to address this, and the ever-inventive collective mind of science is finding inventive solutions.<\/p>\n<div id=\"attachment_11497\" style=\"width: 1577px\" class=\"wp-caption alignright\"><a class=\"wpn-image-link\" href=\"https:\/\/blogs.nature.com\/naturejobs\/files\/2016\/11\/Network_Visualization.png\"><img loading=\"lazy\" decoding=\"async\" aria-describedby=\"caption-attachment-11497\" class=\" wpn-image wp-image-11497\" title=\"Network_Visualization\" src=\"https:\/\/blogs.nature.com\/naturejobs\/files\/2016\/11\/Network_Visualization.png\" alt=\"Network_Visualization\" width=\"1567\" height=\"1044\" srcset=\"https:\/\/blogs.nature.com\/naturejobs\/files\/2016\/11\/Network_Visualization.png 1280w, https:\/\/blogs.nature.com\/naturejobs\/files\/2016\/11\/Network_Visualization-300x200.png 300w, https:\/\/blogs.nature.com\/naturejobs\/files\/2016\/11\/Network_Visualization-1024x682.png 1024w\" sizes=\"auto, (max-width: 1567px) 100vw, 1567px\" \/><\/a><p id=\"caption-attachment-11497\" class=\"wp-caption-text\">The steps taken to get from data to figures in a publication is not always clear {credit}SlvrKy\/Wikimedia Commons CC-BY-SA-4.0 {\/credit}<\/p><\/div>\n<p><!--more--><\/p>\n<p><a href=\"https:\/\/www.markowetzlab.org\/\">Florian Markowetz<\/a> is a strong advocate of working reproducibly. In his papers, he provides raw data and programming that show how he gets from data to figure. With this information, anyone can re-run the whole analysis \u2014 members of his lab, reviewers, or anyone reading his paper, which makes the correction of mistakes easy, catalogues the whole analysis process, and allows other scientists to re-create the result files quickly and easily should the input data change.<\/p>\n<p>This works really well when your entire analysis is done using one freely-available statistics program, and if you\u2019re free and able to publish your raw data. But if your research involves many programs, some of which need a paid license, and your dataset is in the terabytes and not freely distributable, this approach starts hitting some barriers.<\/p>\n<p>These hurdles don\u2019t stop the average scientist from working reproducibly, though \u2014 by meticulously cataloguing experiments, and keeping track of changes made to data, scientists will know exactly how each other\u2019s results were obtained and don\u2019t have to start from scratch.<\/p>\n<p>Rachel Harding, who is studying Huntington\u2019s Disease at the University of Toronto, has been quick to grasp the benefits of sharing her research far and wide. She created <a href=\"https:\/\/labscribbles.com\/\">Lab Scribbles<\/a> \u2014 an online platform where she shares her work, rationale, methods and raw data. This helps her to keep track of the science, and lets others interact, collaborate and learn from her successes (and failures).<\/p>\n<p>Sometimes, it\u2019s not your own data that are the issue. You can be the most meticulous person ever, but if your research relies on others\u2019 data, you can be in for a headache. Antica \u010culina and Nathan Golightly have spent many sleepless nights searching for data they can use in their work. She needs to bring together information collected globally to gain deeper insights into her work on divorce in birds (yes, that\u2019s <a href=\"https:\/\/www.newscientist.com\/article\/mg22429934-400-why-bird-divorces-are-good-news-for-the-females\/\">a real thing<\/a>). He compares the performance of machine learning algorithms to predict cancer based on freely available gene expression data. Finding these data is only the first challenge \u2014 when I pressed them during a Q&amp;A session, they admitted to throwing away potentially valuable datasets because these were not properly annotated, contained the wrong data, or were in some arcane file format. \u201cThey made me doubt science a bit, made me doubt the quality of these studies\u201d, says \u010culina. They strongly advocate that people who share their data make it FAIR \u2014 findable, accessible, interoperable and reusable.<\/p>\n<p>In addition to individual efforts to make data open and reproducible, there are examples of large-scale research initiatives doing the same, such as the Human Genome Project, or, more recently, the <a href=\"https:\/\/exac.broadinstitute.org\/about\">Exome Aggregation Consortium (ExAC)<\/a>. Anyone can download, modify, use and share their data, as long as they attribute it accordingly and keep any redistributed versions open. Wikipedia enjoys the same mentality of openness, which has also been translated to software development. Linux, for example, is open source \u2014 it can be freely modified and distributed by anyone.<\/p>\n<p>The Montreal Neurological Institute has taken this one step further, making all research data and samples obtained at the institute freely available after publication (within the bounds of sample availability and patient confidentiality), and foregoing the pursuit of patents. Guy Rouleau, the mastermind behind this bold step, says that research should be used to <a href=\"https:\/\/www.nature.com\/news\/corporate-culture-has-no-place-in-academia-1.20724\">accelerate science for the benefit of helping people instead of making money<\/a>. He hopes that this will speed up drug discovery for the treatment of neurological disorders, and that other institutes follow suite.<\/p>\n<div id=\"attachment_10781\" style=\"width: 472px\" class=\"wp-caption aligncenter\"><a class=\"wpn-image-link\" href=\"https:\/\/blogs.nature.com\/naturejobs\/files\/2016\/09\/RekaNagy-smaller.jpg\"><img loading=\"lazy\" decoding=\"async\" aria-describedby=\"caption-attachment-10781\" class=\" wp-image-10781 wpn-image\" title=\"RekaNagy-smaller\" src=\"https:\/\/blogs.nature.com\/naturejobs\/files\/2016\/09\/RekaNagy-smaller.jpg\" alt=\"R\u00e9ka Nagy\" width=\"462\" height=\"316\" srcset=\"https:\/\/blogs.nature.com\/naturejobs\/files\/2016\/09\/RekaNagy-smaller.jpg 2322w, https:\/\/blogs.nature.com\/naturejobs\/files\/2016\/09\/RekaNagy-smaller-300x205.jpg 300w, https:\/\/blogs.nature.com\/naturejobs\/files\/2016\/09\/RekaNagy-smaller-1024x699.jpg 1024w\" sizes=\"auto, (max-width: 462px) 100vw, 462px\" \/><\/a><p id=\"caption-attachment-10781\" class=\"wp-caption-text\">R\u00e9ka Nagy<\/p><\/div>\n<p>There are still many challenges that need to be addressed before researchers and institutions adopt open, reproducible science. But ten years ago, the open data movement was barely learning how to crawl. Today, we\u2019re hearing the starter pistol. In another ten years, we may see it run.<\/p>\n<p>&nbsp;<\/p>\n<p><em>R\u00e9ka Nagy is a PhD student at the MRC Institute of Genetics and Molecular Medicine at the University of Edinburgh, unravelling how genetics shapes our health using large family-based datasets. When she\u2019s not busy writing scripts and analysing data, she can be found communicating science or using a computer to play video games and design anything from posters to dream homes. You can find her on\u00a0<a href=\"https:\/\/www.linkedin.com\/in\/Reka-Nagy\">LinkedIn<\/a>\u00a0and\u00a0<a href=\"https:\/\/twitter.com\/RekaGene\">Twitter<\/a>.<\/em><\/p>\n<p><em>You can access all the slides and videos from <\/em>Publishing Better Science through Better Data 2016<em>, as well as the great visual summary of the day, on the <a href=\"https:\/\/www.nature.com\/openresearch\/scidata16\/\">event website<\/a>.<\/em><\/p>\n<p>&nbsp;<\/p>\n<p><strong>Suggested posts<\/strong><\/p>\n<p><a href=\"https:\/\/blogs.nature.com\/naturejobs\/2016\/09\/26\/the-era-of-big-data-is-coming-scientists-need-to-step-out-of-their-comfort-zone\/\">The era of big data is coming: Scientists need to step out of their comfort zone<\/a><\/p>\n<p><a href=\"https:\/\/blogs.nature.com\/naturejobs\/2016\/10\/31\/why-should-we-work-so-hard-to-make-our-work-reproducible\/\">Why should we work so hard to make our work reproducible?<\/a><\/p>\n<p><a href=\"https:\/\/blogs.nature.com\/naturejobs\/2016\/10\/19\/what-are-the-benefits-of-reproducibility-in-science\/\">What are the benefits of reproducibility in science?<\/a><\/p>\n<p>&nbsp;<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Speakers at the Publishing Better Science through Better Data (#scidata16) conference addressed these issues and more.&nbsp; <a href=\"\/naturejobs\/2016\/12\/05\/boost-research-and-avoid-embarrassing-retractions-by-working-openly-and-reproducibly#more-11491\" class=\"more-link\">Read more<\/a> <a href=\"https:\/\/blogs.nature.com\/naturejobs\/2016\/12\/05\/boost-research-and-avoid-embarrassing-retractions-by-working-openly-and-reproducibly\/\">Continue reading <span class=\"meta-nav\">&rarr;<\/span><\/a><\/p>\n","protected":false},"author":90925,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[192,323,385,865,65,359,199],"tags":[3163831,4334729,865,3382713,1293,3789011,38453,4633663,3994549,1539,4105227,256,292,1373],"class_list":["post-11491","post","type-post","status-publish","format-standard","hentry","category-blog-2","category-career-paths","category-careers-articles","category-data","category-phd","category-publishing-2","category-research-2","tag-scidata16","tag-code","tag-data","tag-information","tag-journalism-competition","tag-programming","tag-publishing-better-science-through-better-data","tag-publishing-data","tag-reka-nagy","tag-reproducibility","tag-scidata","tag-science","tag-working","tag-writing-competition"],"_links":{"self":[{"href":"https:\/\/blogs.nature.com\/naturejobs\/wp-json\/wp\/v2\/posts\/11491","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/blogs.nature.com\/naturejobs\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/blogs.nature.com\/naturejobs\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/blogs.nature.com\/naturejobs\/wp-json\/wp\/v2\/users\/90925"}],"replies":[{"embeddable":true,"href":"https:\/\/blogs.nature.com\/naturejobs\/wp-json\/wp\/v2\/comments?post=11491"}],"version-history":[{"count":0,"href":"https:\/\/blogs.nature.com\/naturejobs\/wp-json\/wp\/v2\/posts\/11491\/revisions"}],"wp:attachment":[{"href":"https:\/\/blogs.nature.com\/naturejobs\/wp-json\/wp\/v2\/media?parent=11491"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/blogs.nature.com\/naturejobs\/wp-json\/wp\/v2\/categories?post=11491"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/blogs.nature.com\/naturejobs\/wp-json\/wp\/v2\/tags?post=11491"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}