This repository provides materials for a session that is part of the I2DS Tools for Data Science workshop run at the Hertie School, Berlin in November 2022. The student-run workshop is part of the course Introduction to Data Science taught by Simon Munzert at the Hertie School, Berlin, in Fall 2022.
This session will introduce you to the modern data wrangling workflow with R and dplyr. Data wrangling is one of the core steps in the data science workflow. dplyr is a grammar of data manipulation, providing a consistent set of verbs that help you solve the most common data manipulation challenges, including the manipulation of datasets and variables.
The goals of this session are to (1) equip you with conceptual knowledge about the dplyr package and data wrangling workflow, (2) show you the three key verbs of the pacakge, and (3) provide you with practice material as well as some further readings.
- dplyr overview at dplyr.tidyverse.org
- Hands-on dplyr tutorial by Data School on YouTube
- R for Data Science book - part on data wrangling
The material in this repository is made available under the MIT license.
Simon Munzert prepared the practice material and post-processed the recording.
Kermit the Frog prepared the presentation slides and recording. He also provided an example to the practice material.