This one-hour online training equips participants with powerful data wrangling techniques using R and the tidyverse ecosystem. The tidyverse is a cohesive ecosystem of R packages designed to make data science workflows more intuitive and efficient through consistent syntax and design principles. Designed for both beginners and those looking to refine their skills, this training tackles the challenges of messy datasets.
By the end of this training, attendees will be able to:
- Demonstrate how to clean messy clinical data using R
- Implement methods for standardizing text, dates, and numerical values
- Discuss the different ways to automate data transformations and aggregations using tidyverse functions
- Transform and organize data using the dplyr and tidyr packages
- Reshape data, handle missing values, create calculated fields, and prepare clean datadsets ready for visualization and analysis
Requirements
Attendees are expected to have a basic understanding of R and RStudio. To proceed, attendees should have done the following:
- Installed R and RStudio.
- Have a basic understanding of R and RStudio.
- Reviewed our R basics training on the NIH Data Services: On Demand Content YouTube Playlist, if you are new to R.
Register Here