Open Science

What Do We Mean By Open Science?

While there are numerous definitions of Open Science, many folks tend to emphasize the importance of open computational research practices and open access publishing. I like to remind myself that open science is and should be broader than that. Open science at its best is science that is not only transparent, reproducible, and freely available, but it is also science that is inclusive and open to other views. Below I’ve given some description of these various aspects of Open Science and places in my career where I’ve intersected with them.

In this lab, we aim to practice open science in its broadest sense to the best of our ability.

Data and Code Availability

For most research in the STEM disciplines, there has been a big push to train scientists to use reproducible approaches to managing their data (Hart et al. 2016) and analyses (Hampton et al. 2015). Part of this is ensuring, as many journals now require, that the data and code used to generate the analyses in a published article are made available freely with the manuscript or in a public archival repository.

Computational Reproducibility

Even if all code and data are freely available, being able to take the publicly available code and data and use them to re-run the analyses the authors conducted is often quite challenging. This is because the computational environment required to run the code may be quite specific to the individual scientist’s machine and not fully documented. To help mitigate this, analysis code can be made reusable and publicly available as a software package (e.g. an R package I helped write for analysis of terrestrial LiDAR data, Atkins et al. 2018).

Another important approach is having the pipeline or order of code execution well-documented or even better, automated. In all of my undergraduate courses at USF, I introduce students to the use of R and Rmarkdown to generate reproducible reports for their projects in class. For my graduate and undergraduate Bioinformatics courses, we go a step futher and learn the use of the program make for automating complex computational workflows in an efficient manner.

Here’s an example of these approaches in practice. Emma Gibson was an honors undergraduate student in my lab at USF. She completed an honors thesis looking at communities of foliar endophytes in leaves of one of the most common street trees in San Francisco, Metrosideros excelsa. She conducted her analyses and wrote her thesis using R and Rmarkdown, and when we were ready to submit it for publication, we added the required computational environments to reproduce all analyses using Docker and renv. This project is available as a public repository on the Lab GitHub organization.

Preprints

Preprints are completed scientific manuscripts that are publicly posted and assigned a permanent Digital Object Identifier (DOI) through preprint archive servers like bioRxiv or EcoEvoRxiv. These have become much more popular in recent years, because they enable formal scientific exchanges and communication to happen much more rapidly than the months to years that it takes for a manuscript to work its way through peer review. Generally scientists will submit their complete manuscript both to a preprint server and to a journal for formal peer review at the same time, and then once the peer-reviewed article is accepted, the preprint is updated to reflect this fact. As preprints are freely available, posting them also helps advance the next goal, which is free and open access to research articles.

Open Access Publishing

Open access publishing (at its core) is the idea that journal articles should not be locked behind paywalls. As much of the research in the United States (and elsewhere) is publicly funded, it doesn’t seem right to have the results of this work locked up and unavailable to most of the public (and even to many scientists who don’t have the means to afford access). This has become a key issue in the European Union and the US in the last several years, and soon I imagine this will become the status quo.

Open access publishing discussions also often involve a discussion of more open peer review. While it varies between disciplines, peer review in many journals is what is called ‘single blind’, which means that the reviewers know the identity of the author(s), but the author(s) doesn’t know the identities of the reviewers. This opacity can be both good and bad — it could protect more junior scientists from retaliation if they submit a negative review, but it also clearly results in a number of power imbalances that may not lead to optimal outcomes (such as reviewers writing overly critical reviews instead of offering constructive evaluation and feedback). When I was a graduate student, we worked with the editors of many of the journals published by the Ecological Society of America to think about ways that graduate students could become more involved in the review process and to build training programs that would help inculcate best practices at an early stage (Zimmerman, Salguero-Gomez, and Ramos 2011).

International Connections

During the time when I was a graduate student and postdoc, I became quite involved with the Ecological Society of America Student Section, eventually becoming chair of the section. One of the topics that repeatedly came up in discussions at that time was the lack of professional networks for students outside of countries with large professional societies in Ecology (e.g. the US and UK). With a team of other students and collaboration with the International Association for Ecology (INTECOL) we authored a number of articles articulating this problem and proposing some steps forward (Peter Sogaard Jørgensen et al. 2011; White et al. 2015; Peter Søgaard Jørgensen et al. 2015).

Social Equity and Inclusion

In the last several years, I have become involved with a collection of folks called the Microbes and Social Equity Working Group (Ishaq et al. 2021). Through this highly interdisciplinary group, I have helped to organize several sessions at ESA meetings over the years; these sessions were focused on widening the focus of microbial research to include the broader social context of the research as well as its implications for human communities.

References

Atkins, Jeff W., Gil Bohrer, Robert T. Fahey, Brady S. Hardiman, Timothy H. Morin, Atticus E. L. Stovall, Naupaka Zimmerman, and Christopher M. Gough. 2018. “Quantifying Vegetation and Canopy Structural Complexity from Terrestrial Li DAR Data Using the Forestr r Package.” Edited by Sarah Goslee. Methods in Ecology and Evolution 9 (10): 2057–66. https://doi.org/10.1111/2041-210x.13061.
Hampton, Stephanie E., Sean S. Anderson, Sarah C. Bagby, Corinna Gries, Xueying Han, Edmund M. Hart, Matthew B. Jones, et al. 2015. “The Tao of Open Science for Ecology.” Ecosphere 6 (7): art120. https://doi.org/10.1890/ES14-00402.1.
Hart, EM, P Barmby, D LeBauer, F Michonneau, S Mount, P Mulrooney, T Poisot, KH Woo, NB Zimmerman, and JW Hollister. 2016. “Ten Simple Rules for Digital Data Storage.” PLoS Comput Biol 12 (10): e1005097. https://doi.org/10.1371/journal.pcbi.1005097.
Ishaq, Suzanne L., Francisco J. Parada, Patricia G. Wolf, Carla Y. Bonilla, Megan A. Carney, Amber Benezra, Emily Wissel, et al. 2021. “Introducing the Microbes and Social Equity Working Group: Considering the Microbial Components of Social, Environmental, and Health Justice.” Edited by Jack A. Gilbert. mSystems 6 (4). https://doi.org/10.1128/msystems.00471-21.
Jørgensen, Peter Søgaard, Frederic Barraquand, Vincent Bonhomme, Timothy J. Curran, Ellen Cieraad, Thomas G. Ezard, Laureano A. Gherardi, et al. 2015. “Connecting People and Ideas from Around the World: Global Innovation Platforms for Next-Generation Ecology and Beyond.” Ecosphere 6 (4): 1–11. https://doi.org/10.1890/ES14-00198.1.
Jørgensen, Peter Sogaard, Vincent Bonhomme, Thomas HG Ezard, R Andrew Hayes, Timothee Poisot, Roberto Salguero-Gomez, Salvatrice Vizzini, and Naupaka Zimmerman. 2011. “A Global Network of Next Generation Ecologists.” INTECOL e-Bulletin 5 (2): 3–5. https://www.researchgate.net/publication/228328646_A_global_network_of_next_generation_ecologists.
White, Rachel L, Alexandra E Sutton, Roberto Salguero-Gómez, Timothy C Bray, Heather Campbell, Ellen Cieraad, Nalaka Geekiyanage, et al. 2015. “The Next Generation of Action Ecology: Novel Approaches Towards Global Ecological Research.” Ecosphere 6 (8): art134. https://doi.org/10.1890.
Zimmerman, Naupaka B., Rob Salguero-Gomez, and Jorge Ramos. 2011. “The Next Generation of Peer Reviewing.” Frontiers in Ecology and the Environment 9: 199. https://doi.org/10.1890/1540-9295-9.4.199.