Taming Messy Data: Practical R Wrangling with the Tidyverse

Event Description

This one-hour online training equips participants with powerful data wrangling techniques using R and the tidyverse ecosystem. The tidyverse is a cohesive ecosystem of R packages designed to make data science workflows more intuitive and efficient through consistent syntax and design principles. Designed for both beginners and those looking to refine their skills, this training tackles the challenges of messy datasets.  

By the end of this training, attendees  will be able to:

  • Demonstrate how to clean messy clinical data using R
  • Implement methods for standardizing text, dates, and numerical values
  • Discuss the different ways to automate data transformations and aggregations using tidyverse functions
  • Transform and organize data using the dplyr and tidyr packages
  • Reshape data, handle missing values, create calculated fields, and prepare clean datadsets ready for visualization and analysis

Requirements

Attendees are expected to have a basic understanding of R and RStudio. To proceed, attendees should have done the following:

  • Installed R and RStudio.
  • Have a basic understanding of R and RStudio.
  • Reviewed our R basics training on the NIH Data Services: On Demand Content YouTube Playlist, if you are new to R.

Register Here

Event Dates
-

This page last reviewed on September 23, 2025