02:00
These slides available at: https://arcus.github.io/first_steps_in_r_rstudio_skills_series/session_2.html
Arcus is an initiative by the Research Institute aimed at promoting data discovery and reuse and increasing research reproducibility.
Among the many teams in Arcus, I represent Arcus Education!



Arcus education provides data science training to researchers …
(and often this is useful to non-researchers too!).
https://arcus.chop.edu/i-want-to/arcus-education
Email us! arcus-education@chop.edu
Arcus Education provides “Skills Series” for the entire CHOP community.
This Skills Series is a 5-session series aimed at helping you take your first steps in R and RStudio!
Projects and File Ingestion
Goals:
install.packages() and when to use library()https://posit.cloud is a great place for learning or practice with public (NOT CHOP!) datasets.
Please open your First Steps in R and RStudio Exercises project in Posit.cloud now.
If you did not already set up a Posit.cloud account with the exercise files in the first session of this Series, please do the following, now:
02:00
session_2_exercise_janedoe.qmd)session_2_exercise.qmd you can get the new version and not mess up your “janedoe” versionIn Posit.cloud, “New Project” is a big blue button: 
In RStudio Desktop on your computer, you have to go to the File menu and choose “New Project”: 
tidyverse instead.Data can be ingested into R from lots of sources:
We’re supplying you with .csv data.


(You probably did this already!)

01:00

install.packages("tidyverse") downloads the package (do once)
library(tidyverse) loads the package (do once per session)
data_frame <- read_csv(file_name)
read_csv ingests a file, creating an object that exists in your R environmentdata_frame <- read_csv(file_name)
read_csv is the function namedata_frame <- read_csv(file_name)
read_csv is the function namefile_name is an argument passed to the function.data_frame <- read_csv(file_name)
read_csv is the function namefile_name is an argument passed to the function.data_frame is a named object that will receive the output of the function.data_frame <- read_csv(file_name)
read_csv is the function namefile_name is an argument passed to the function.data_frame is a named object that will receive the output of the function.<- is the assignment operator that makes what’s on the right be assigned to the named object on the right

12:00
/ means “go into a child directory” (\ in Windows)/ as the first symbol means “start at the root”. means “this directory”.. means “the parent directory of this directory”~ means “my home directory”In our team, we like to measure our effectiveness.
Goals:
install.packages() and when to use library()Exploring Data Visually, Using ggplot2

Arcus Education, Children’s Hospital of Philadelphia