Survey language and questions

Email to prospective participants

Read the latest version of the recruiting email text (survey_recruiting_email.txt):

survey_recruiting_email <- readr::read_file("survey_recruiting_email.txt")

And then display it:

Dear Penn Stater:

We are a group of faculty who are interested in knowing more about what students, postdocs, staff, faculty, and adminstrators know about open science research and whether they incorporate open practices into their research.

Our research goal is to determine whether there is a need for an initiative at Penn State that provides training and support for open research practices and serves as a focal point for open research here.

To that end, we have devised a short survey that should take you no more than 10 min to complete. The survey is created using a Google Form. Participation is voluntary. You may start the survey and choose not to complete it, or you may choose not to complete specific questions. Unless you decide to provide us with your name and email for follow up, no identifiable information will be collected. There is no compensation for participating and no penalty to you for withdrawing once you have started completing the survey.

Survey link: https://forms.gle/BpK1Zsyw1k3tVnSE7

Once we have analyzed the results, we plan to share them with the Penn State community.

If you have any questions about this research, please contact Rick Gilmore (; 814-865-3664).

Thank you for your consideration.

Yours truly,

Rick Gilmore, Department of Psychology

Frank Hillary, Department of Psychology

Nicole Lazar, Department of Statistics

Briana Wham, University Libraries

Survey introduction

Read the lastest version of the introduction to the survey (survey_intro.txt):

survey_intro <- readr::read_file("survey_intro.txt")

Display the survey intro:

We are asking you to be in a research study. Whether or not you take part is up to you. You can choose not to take part. You can agree to take part and later change your mind. Your decision will not be held against you, and there will be no penalty.

The study involves a taking a brief survey. The survey focuses on data management and sharing practices and principles of open science. You may choose to answer some questions and not others or start the survey and choose not to complete it. Unless you provide us your name and email, your responses are anonymous. If you provide us your name or contact information we may contact you at a later time.

The target audience for the survey is Penn State students, faculty, and staff who are over the age of 18. If you are not 18 and not a Penn State student, faculty member, or staff member, please do not take this survey.

Questions about the survey may be directed to Rick Gilmore (; 814-865-3664). If you have questions regarding your rights as a research subject or concerns regarding your privacy, you may contact the Human Research Protection Program at 814-865-1775.

Survey questions

Note: Here is a useful reference for what language to use for Likert scale questions https://www.extension.iastate.edu/documents/anr/likertscaleexamplesforsurveys.pdf

Read the survey questions from the most recent copy of the downloaded dataset.

suppressPackageStartupMessages(library("tidyverse")) # for pipe %>% operator

# Read local copy of survey
survey <- readr::read_csv("csv/open-science-survey-2022-fall.csv")
## Rows: 104 Columns: 35
## ── Column specification ────────────────────────────────────────────
## Delimiter: ","
## chr (35): Timestamp, What Penn State campus do you represent?, W...
## 
## ℹ Use `spec()` to retrieve the full column specification for this data.
## ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.
# Drop the Timestamp since it is provided by Google Forms automatically
survey_questions <- names(survey)[2:dim(survey)[2]]

survey_qs_df <- tibble::tibble(q_num = seq_along(survey_questions), question = survey_questions)

survey_qs_df %>%
  kableExtra::kbl(.) %>%
  kableExtra::kable_classic(.)
q_num question
1 What Penn State campus do you represent?
2 What is your primary department/unit?
3 What is your position at Penn State?
4 How many years have passed since you completed that degree?
5 What are the primary types of digital data that are used in your research? (choose all that apply)
6 Do you collect data that have legal or ethical restrictions governing who may access it or how it may be used?
7 Where do you store data for active projects where data collection and analysis is still ongoing?
8 How important to you is sharing data from active projects with research collaborators at Penn State or outside of Penn State?
9 How convenient is it for you to share data from active projects with research collaborators at Penn State or outside of Penn State?
10 What are the main barriers to sharing data from active projects with research collaborators?
11 How important to you is sharing data from completed projects with the broader research community (i.e., not direct collaborators)?
12 Which of the following obstacles make sharing data with the research community harder for you? Mark all that apply.
13 Do research funders in your field require data sharing?
14 Do journals in your field require data sharing?
15 If you have shared data with the research community, where have you shared it?
16 How well-equipped do you feel you, your colleagues, and trainees are to meet data management and sharing requirements of sponsors/funders or journals?
17 How often do you create computer scripts or data analysis code in the conduct of your research?
18 How often do you share computer scripts or data analysis code with direct research collaborators ?
19 Do you create other kinds of software in the conduct of your research?
20 How often do you use open source code sharing tools (e.g., GitHub, GitLab, BitBucket)?
21 Do funders in your field require code sharing?
22 Do journals in your field require code sharing?
23 How often do you openly share other materials related to your research (protocols, reagents, samples, apparatus, designs, etc.) with other researchers?
24 What is your experience with/knowledge of open science practices?
25 Describe your awareness of the FAIR (findable, accessible, interoperable, reusable) principles pertaining to research data.
26 Do you apply FAIR principles in your own data management and sharing practices?
27 Have you heard of the “reproducibility crisis” in science?
28 Is there a reproducibility crisis in your area of research?
29 How much benefit would you derive from a center at Penn State focused on supporting the adoption of best practices in data management and sharing, code sharing, open science, and reproducible research?
30 Select the services that would most benefit your research if offered by such a center.
31 Any final comments about data management, data sharing, and open science?
32 (Optional) Provide us with your contact information if you would like us to follow up.
33 What is the highest post-secondary degree you have earned?
34 How often do you share computer scripts or data analysis code openly?

Let’s export these questions for transparency and to use as an attachment for IRB submissions:

# Write as text file
fileConn3 <- file("survey_questions.txt")
writeLines(survey_questions, fileConn3)
close(fileConn3)

# Save as CSV
readr::write_csv(survey_qs_df, "csv/open-science-survey-2022-fall-questions.csv")

TODO: Use the Google API to upload and download questions.