What is the correct way to join two dataframes using dplyr?

I'm a bit perplexed by this. Why does the code below not work as I expect? I want the codes in 'data' to replace the NA's in 'data2'. What am I doing wrong?

(I want to use dplyr)

library(dplyr)

data <- data.frame(
  Name = c("Alice", "Bob", "Charlie", "David"),
  Code = c(1234, 5678, 9012, 3456)
)

data2 <- data.frame(
  Name = c("Alice", "Bob", "Charlie", "David", "Alice", "Bob"),
  Code = c(1234, 5678, 9012, 3456, NA, NA)
)


left_join(data2, data, join_by(Name))

Solution

A join will not replace any missing values. But you can achieve your desired result using an additional mutate step with coalesce to replace the NAs:

library(dplyr, warn=FALSE)

left_join(data2, data, join_by(Name)) |>
  mutate(
    Code = coalesce(Code.x, Code.y),
    .keep = "unused"
  )
#>      Name Code
#> 1   Alice 1234
#> 2     Bob 5678
#> 3 Charlie 9012
#> 4   David 3456
#> 5   Alice 1234
#> 6     Bob 5678

Rcpp Rf_warningcall compiler warnings
Modify the name of factor variables in lm function(summary function)
Extreme value analysis and quantile estimation using log Pearson type 3 (Pearson III) distribution - R vs Python
How to hide NAs when using xlsx::saveWorkbook?
How do I retrieve a simple numeric value from a named numeric vector in R?
Matching pair-wise columns from left to right across rows in one dataframe to another dataframe and adding new columns with matching values
Income to outcome flow chart in Sankey plotly R
color mapping in geom_conn_bundle not showing correctly
Print R package startup message AFTER automatic package conflict messages instead of before
Summing a set of R dataframe rows (column-wise), while retaining the first n columns
Added variable / partial regression plots for groups in an interaction?
how to make a topoplot in R with coordinates variable distribution
List of all functions in base R?
Plotting multiple plots for different initial conditions in one graph
Printing repetitively on the same line in R
Generating UI/Server based on initial selection
Subset dataframe based on pickerInput
How to let user pick the data in R-shiny?
Couldn't show my simple bar charts separately on Shiny R dashboardBody
How to programmatically filter contents of a second shiny app displayed via iframe
How to select specific interesting groups for the boxplot in R Shiny app?
Crosstable and Plot grouping with reactive values
Is there a way to make multiple Shiny picker inputs where the selections must be disjoint?
Delay/avoid duplication of shiny server side functions until after credentials
Predictions only returns value "1"
How to display a busy indicator in a shiny app?
Append doesn't work when writing to CSV in R
Changing the start date of a gantt chart in DiagrammeR
Check for installed packages before running install.packages()
Compare two columns element-wise