R MASTERCLASS | Q3 2024 (Jul-Sep)
-
WEEK 1 (PREWORK DUE BEFORE FIRST CLASS)
Welcome to the Masterclass2 Topics|2 Quizzes -
Using ChatGPT1 Quiz
-
R basics revision3 Quizzes
-
Learn to Type Fast! (optional)
-
Workshop 1 | Tidyverse and R markdown revision | RMC Q3 2024
-
WEEK 2 (PREWORK DUE WED, 17 JULY)Cohort Notes on the Git Lesson
-
Git & GitHub for R Users1 Topic
-
Workshop 2 | GitHub Practice | RMC Q3 2024
-
WEEK 3 (PREWORK DUE WED, 24 JULY)HTML Documents & Quarto1 Topic
-
Publishing HTMLs with GitHub Pages1 Topic
-
Creating a Personal Website (optional)
-
Workshop 3 | Interactive Plots and HTML with GitHub Pages | RMC Q3 2024
-
WEEK 4 (PREWORK DUE WED, 31 JULY)Joining 1: Intro to Joining Datasets1 Topic
-
Joining 2: Mismatched Values, One-to-Many & Multi-Key Joins1 Topic
-
Workshop 4 | Joining Datasets | RMC Q3 2024
-
WEEK 5 (PREWORK DUE WED, 7 AUG)For Loops1 Topic
-
Functions and Conditionals1 Topic
-
Workshop 5 | For Loops & Functions | RMC Q3 2024
-
WEEK 6 (PREWORK DUE WED, 14 AUG)Dates 1: Recognizing and Formatting Dates1 Topic
-
String Manipulation1 Topic
-
Workshop 6 | Dates & Strings | RMC Q3 2024
-
WEEK 7 (PREWORK DUE WED, 21 AUG)Snippet | The dplyr::across function
-
Data Cleaning 1: Data Diagnostics1 Topic
-
Data Cleaning 2: Fixing Inconsistencies1 Topic
-
Workshop 7 | Data Cleaning | RMC Q3 2024
-
WEEK 8 (PREWORK DUE WED, 28 AUG)Snippet | Intro to Simple Features Data
-
Thematic Maps1 Topic
-
Physical Features1 Topic
-
Density Maps1 Topic
-
Workshop 8 | Geospatial Visualization | RMC Q3 2024
-
WEEK 9 (PREWORK DUE WED, SEP 4)Snippet | Sources of Map Data
-
Shapefile Data1 Topic
-
Boundary Data1 Topic
-
Layers1 Topic
-
Workshop 9 | Advanced Geospatial Visualization | RMC Q3 2024
-
WEEK 10 (PREWORK DUE WED, SEP 11)Dashboards with Quarto1 Topic
-
Workshop 10 | Dashboards with Quarto | RMC Q3 2024
-
FINAL PROJECT (DUE SUN, SEP 29)Final Project | RMC Q3 2024
-
Workshop 11 | Progress Check | RMC Q3 2024
Joining 1: Intro to Joining Datasets
🚗 Video too fast or too slow? Click the gear icon ⚙️ at the bottom-right to change the speed!
Lesson code
Please download the zip folder below to code along with the instructor!
After downloading, unzip the folder, open the RStudio Project file within, then open the “VIDEO_CODE_ALONG” script and type along with the instructor in the video above!
Lesson notes
The written guide below is an accompaniment to the lesson video. You may read through it if you prefer written tutorials to video content.
👋 Before you go, leave an anonymous rating & feedback
Average rating 4.9 / 5. Vote count: 19
No votes so far! Be the first to rate this post.
Please share any positive or negative feedback you may have.
Feedback is completely anonymous
In this code,
what role does selecting by name have? I am asking because I get similar results even when I don’t select by name,
Hello Rachel! In this case, it is indeed true that
by = "name"does not need to be specified, since “name” is the only column that appears in both the datasets that you are joining. In such cases, dplyr will automatically guess that it should join by the variables that appear in both datasets. If you don’t add the by argument, you will get a message saying “Joining withby = join_by(name)“, which is R’s way of telling you that it is automatically choosing a “key” column to join by. This is explain is section 5 of the lesson: https://the-graph-courses.github.io/fdar/FDAR_EN_joining_1/FDAR_EN_joining_1.html#5_Joining_syntax. The by argument becomes important when you have NO common names between two datasets, or if you have MULTIPLE columns in common but you only want to join by specific columns.