Data Wrangling Workshop
When: May. 13th, 2024 10:00 am - 12:00 pm
Learning Level: Any
To Know
About this Class
This in-person workshop will focus on data wrangling using tidy data principles. Tidy data describes a standard way of storing data that facilitates analysis and visualization within the tidyverse ecosystem. There will be a discussion of what makes data "tidy," and methods for reshaping your data using dplyr and tidyr functions. Prior to attending this class, you will need to have:
- Installed R and RStudio
- Taken the Introduction to R and RStudio class. If not, here are some resources for getting started:
By the end of this class, attendees will be able to demonstrate how to describe the purpose of the dplyr and tidyr packages, select certain columns in a data frame, select certain rows in a data frame according to filtering conditions, and add new columns to a data frame that are functions of existing columns.
Note on Technology
The NIH Library has 24 pre-configured Windows laptops that you are welcome to use during this training on a first come, first served basis. You are also welcome to bring your own laptop (PC or Mac). NIH Staff bringing their own NIH-laptop can easily connect to the staff Wi-Fi. If participants are bringing a personal laptop, they are restricted to using the NIH-Guest-Network Wi-Fi.
Registrants will receive an email with information and instructions to install and verify access to R and RStudio before the class. If you register the day before the class, you may not have time to download and properly install the necessary software. If you do not have the software installed, this training will be demo only.