02:00
These slides available at: https://arcus.github.io/first_steps_in_r_rstudio_skills_series/session_2.html
Arcus is an initiative by the Research Institute aimed at promoting data discovery and reuse and increasing research reproducibility.
Among the many teams in Arcus, I represent Arcus Education!
Arcus education provides data science training to researchers …
(and often this is useful to non-researchers too!).
https://arcus.chop.edu/i-want-to/arcus-education
Email us! arcus-education@chop.edu
Arcus Education provides “Skills Series” for the entire CHOP community.
This Skills Series is a 5-session series aimed at helping you take your first steps in R and RStudio!
Projects and File Ingestion
Goals:
install.packages()
and when to use library()
https://posit.cloud is a great place for learning or practice with public (NOT CHOP!) datasets.
Please open your First Steps in R and RStudio Exercises project in Posit.cloud now.
If you did not already set up a Posit.cloud account with the exercise files in the first session of this Series, please do the following, now:
02:00
session_2_exercise_janedoe.qmd
)session_2_exercise.qmd
you can get the new version and not mess up your “janedoe” versionIn Posit.cloud, “New Project” is a big blue button:
In RStudio Desktop on your computer, you have to go to the File menu and choose “New Project”:
tidyverse
instead.Data can be ingested into R from lots of sources:
We’re supplying you with .csv data.
(You probably did this already!)
01:00
install.packages("tidyverse")
downloads the package (do once)
library(tidyverse)
loads the package (do once per session)
data_frame <- read_csv(file_name)
read_csv
ingests a file, creating an object that exists in your R environmentdata_frame <- read_csv(file_name)
read_csv
is the function namedata_frame <- read_csv(file_name)
read_csv
is the function namefile_name
is an argument passed to the function.data_frame <- read_csv(file_name)
read_csv
is the function namefile_name
is an argument passed to the function.data_frame
is a named object that will receive the output of the function.data_frame <- read_csv(file_name)
read_csv
is the function namefile_name
is an argument passed to the function.data_frame
is a named object that will receive the output of the function.<-
is the assignment operator that makes what’s on the right be assigned to the named object on the right
12:00
/
means “go into a child directory” (\
in Windows)/
as the first symbol means “start at the root”.
means “this directory”..
means “the parent directory of this directory”~
means “my home directory”In our team, we like to measure our effectiveness.
Goals:
install.packages()
and when to use library()
Exploring Data Visually, Using ggplot2
Arcus Education, Children’s Hospital of Philadelphia