Presentation topics

The following are presentation topics for Day 1 and Day 2.

The hands-on topics offered have been shaped by the preferences of prospective participants, as indicated in a May 2023 survey.

Theme: What is open science? Open science @ PSU

What is open science? Why do open science? (Rick Gilmore)

This talk will describe the main practices and principles of open science and describe how these practices and principles can strengthen and enrich the scholarship of researchers at every career stage.

Talk slides: HTML

Open science @ PSU

What initiatives and resources are available at Penn State related to open science?

Keynote (Tim Errington)

Replication of preclinical cancer biology: Challenges or opportunities?

The Reproducibility Project: Cancer Biology was a large-scale systematic investigation into the replicability of high-impact, preclinical cancer biology experiments. Overall, there were challenges at every stage of the research process to design and conduct the replications. It was hard to understand what was originally done, there was not always access to the original data or reagents to conduct the experiments, and model systems frequently did not behave as originally reported. The limited transparency and incomplete reporting made the efforts to replicate the findings much harder than was necessary. Of the replication experiments able to be completed, the evidence was much weaker on average than the original findings even though all the replications underwent peer review before conducting the experiments to maximize their quality and rigor. These findings suggest that there is unnecessary friction in the research process that may be slowing the advancement of knowledge, solutions, and treatments.

Good enough practices for data and project management (Alaina Pearce)

Best practices can be hard to implement, so what open science practices are both useful and “good enough” (Wilson et al., 2017) to have an impact on your work? What are the first steps you can take to make your data more FAIR (Wilkinson et al., 2016)? How can these practices improve your data lifecycle and/or pipeline (e.g., data use and re-use within and between labs)?

How to do open science?

These are some possible presentations for Day 2. Day 2 would be conducted in 3-5 breakout sessions.

Preregistration & registered reports (Frank Hillary & [Hollie Mullin])

What is preregistration? What are registered reports? When and why would you want to use these tools?

Prerequisites: None

Skill level: Novice

Time required: 50 min

Offered: Thu AM Session 2

Replication studies (Kevin McManus)

This session will provide an introduction and overview of the replication process in social science research, focusing on three components: (1) what is replication and why is it important? (2) considerations for selecting a study to replicate (3) recommended practices for executing and writing up a replication study. Taken together, this session seeks to address several questions on the conceptual and practical aspects of doing replication research.

Prerequisites: None

Skill level: Novice

Time required: 50 min

Offered: Thu AM Session 2

Responsible and robust research (Nicole Lazar & Jennifer Valcin)

What are questionable research practices? Why are they questionable? How do we avoid them?

Prerequisites: None

Skill level: Novice

Time required: 50 min

Offered: Thu AM Session 1

Reproducible research with R Markdown and Quarto (Rick Gilmore & Jennifer Valcin)

This workshop will demonstrate how to create reproducible research workflows with R Markdown and its successor, Quarto. These tools can be used to analysis reports, create slides, papers, and web sites (like this one) using the same source materials.

Prerequisites: None

Skill level: Novice

Time required: 50 min

Offered: Thu AM Session 2.

Talk slides: HTML

Data management: Policies (Briana Wham & Alaina Pearce)

How to manage data to meet funder requirements and make sharing with collaborators and other researchers less burdensome.

Prerequisites: None

Skill level: All

Time required: 50 min

Offered: Thurs AM Session 1.

Data management: Practicalities (Alaina Pearce & Briana Wham)

How can you manage data to make sharing with collaborators and other researchers less burdensome? Data management is often learned through trial and error. Therefore, this workshop will provide a set of practices that can make data management and data sharing easier. After a brief overview, this will be a Choose-Your-Own-Adventure workshop so we can dive into the details of data management that will be most helpful for you. Examples include discussing file naming conventions, directory organization for large lab projects, standardized data architectures (e.g., BIDS; MIxS), how to make a data dictionary, and data manipulation/creation of new variables.

Prerequisites: None

Skill level: All

Time required: 50 min

Offered: Thurs PM Session 3.

Where to start? Open science for early career researchers (Daisy Lei, [Hollie Mullin], & Sai Koneru)

How can early career researchers start their open science journey (Allen & Mehler, 2019; Kathawalla et al., 2021)?

Prerequisites: None

Skill level: All

Time required: 50 min

Offered: Thu PM Session 3

Reproducible research with Jupyter notebooks (Mike Burnham)

Jupyter notebooks are another literate programming tool, and also another research super-power.

Prerequisites: None

Skill level: Novice

Time required: 50 min

Offered: Thu PM Session 4.

Talk materials Sample Jupyter notebook

Data & materials sharing: Why, when, where, & how (Rick Gilmore & Bruce Desmarais)

Where can you share data and materials? Where should you share data and materials? When can and should you share data and materials? This workshop will demonstrate how researchers can use Dataverse, the Open Science Framework (OSF), Databrary, and other data and materials repositories both to share data and materials and to re-purpose data and materials shared by others. The workshop will also discuss recent initiatives in some fields that require data and code sharing prior to the publication of an article so that statistical results and figures can be verified.

Prerequisites: None

Skill level: Novice

Time required: 50 min

Offered: Thu PM Session 4

Talk slides: PDF