Back to Course

R MASTERCLASS | Q3 2024 (Jul-Sep)

0% Complete
0/0 Steps
  1. WEEK 1 (PREWORK DUE BEFORE FIRST CLASS)

    Welcome to the Masterclass
    2 Topics
    |
    2 Quizzes
  2. Using ChatGPT
    1 Quiz
  3. R basics revision
    3 Quizzes
  4. Learn to Type Fast! (optional)
  5. Workshop 1 | Tidyverse and R markdown revision | RMC Q3 2024
  6. WEEK 2 (PREWORK DUE WED, 17 JULY)
    Cohort Notes on the Git Lesson
  7. Git & GitHub for R Users
    1 Topic
  8. Workshop 2 | GitHub Practice | RMC Q3 2024
  9. WEEK 3 (PREWORK DUE WED, 24 JULY)
    HTML Documents & Quarto
    1 Topic
  10. Publishing HTMLs with GitHub Pages
    1 Topic
  11. Creating a Personal Website (optional)
  12. Workshop 3 | Interactive Plots and HTML with GitHub Pages | RMC Q3 2024
  13. WEEK 4 (PREWORK DUE WED, 31 JULY)
    Joining 1: Intro to Joining Datasets
    1 Topic
  14. Joining 2: Mismatched Values, One-to-Many & Multi-Key Joins
    1 Topic
  15. Workshop 4 | Joining Datasets | RMC Q3 2024
  16. WEEK 5 (PREWORK DUE WED, 7 AUG)
    For Loops
    1 Topic
  17. Functions and Conditionals
    1 Topic
  18. Workshop 5 | For Loops & Functions | RMC Q3 2024
  19. WEEK 6 (PREWORK DUE WED, 14 AUG)
    Dates 1: Recognizing and Formatting Dates
    1 Topic
  20. String Manipulation
    1 Topic
  21. Workshop 6 | Dates & Strings | RMC Q3 2024
  22. WEEK 7 (PREWORK DUE WED, 21 AUG)
    Snippet | The dplyr::across function
  23. Data Cleaning 1: Data Diagnostics
    1 Topic
  24. Data Cleaning 2: Fixing Inconsistencies
    1 Topic
  25. Workshop 7 | Data Cleaning | RMC Q3 2024
  26. WEEK 8 (PREWORK DUE WED, 28 AUG)
    Snippet | Intro to Simple Features Data
  27. Thematic Maps
    1 Topic
  28. Physical Features
    1 Topic
  29. Density Maps
    1 Topic
  30. Workshop 8 | Geospatial Visualization | RMC Q3 2024
  31. WEEK 9 (PREWORK DUE WED, SEP 4)
    Snippet | Sources of Map Data
  32. Shapefile Data
    1 Topic
  33. Boundary Data
    1 Topic
  34. Layers
    1 Topic
  35. Workshop 9 | Advanced Geospatial Visualization | RMC Q3 2024
  36. WEEK 10 (PREWORK DUE WED, SEP 11)
    Dashboards with Quarto
    1 Topic
  37. Workshop 10 | Dashboards with Quarto | RMC Q3 2024
  38. FINAL PROJECT (DUE SUN, SEP 29)
    Final Project | RMC Q3 2024
  39. Workshop 11 | Progress Check | RMC Q3 2024
Lesson 13 of 39
In Progress

Joining 1: Intro to Joining Datasets

🚗 Video too fast or too slow? Click the gear icon ⚙️ at the bottom-right to change the speed!


Lesson code

Please download the zip folder below to code along with the instructor!

After downloading, unzip the folder, open the RStudio Project file within, then open the “VIDEO_CODE_ALONG” script and type along with the instructor in the video above!

Lesson notes

The written guide below is an accompaniment to the lesson video. You may read through it if you prefer written tutorials to video content.

👋 Before you go, leave an anonymous rating & feedback

Average rating 4.9 / 5. Vote count: 19

No votes so far! Be the first to rate this post.

Please share any positive or negative feedback you may have.

Feedback is completely anonymous

2 Comments
Newest
Oldest Most Voted
Inline Feedbacks
View all comments
Rachel

In this code,

left_join(x = test_info_xavier, y = demographic, by = "name")

what role does selecting by name have? I am asking because I get similar results even when I don’t select by name,

Joy

Hello Rachel! In this case, it is indeed true that by = "name" does not need to be specified, since “name” is the only column that appears in both the datasets that you are joining. In such cases, dplyr will automatically guess that it should join by the variables that appear in both datasets. If you don’t add the by argument, you will get a message saying “Joining with by = join_by(name)“, which is R’s way of telling you that it is automatically choosing a “key” column to join by. This is explain is section 5 of the lesson: https://the-graph-courses.github.io/fdar/FDAR_EN_joining_1/FDAR_EN_joining_1.html#5_Joining_syntax. The by argument becomes important when you have NO common names between two datasets, or if you have MULTIPLE columns in common but you only want to join by specific columns.

2
0
Questions or comments?x