Joining data frames with recurring variable

The problem I'm having is that inner_join() creates a new row with all the associated values.

An example:

zip_code <- c("1000", "1000", "1001")
village <- c("village_x", "village_y", "village_z")
villages <- data.frame(cbind(zip_code, village))

zip_code <- c("1000", "1000", "1001")
case <- c("case1", "case2", "case3")
cases <- data.frame(cbind(zip_code, case))

data <- inner_join(villages, cases, by="zip_code")

This solution increases the number of cases, as there are several villages with the same ZIP code.

How can I make it so that villages with the same ZIP code are in the same cell?

Or that the merge only pairs the cases with the first found value?

Solution

@ConnerSexton's solution worked:

data <- inner_join(villages, cases, by="zip_code") %>% group_by(zip_code, case) %>% summarize(village = paste(village, collapse = ', '), .groups = 'drop')

Thanks a lot!

Assign groups in dataframe using vectors containing start and end indexes
Is there a product operator (or work around) in SQLite?
Loading Magick package after tidyverse
Make silhouette icons in tmap
Ignoring NA cases when getting column index of lowest value in row
plm object to LaTeX table
Why is my bslib theme not coloring the navbar as expected?
Adding a footnote to a single row label in a gtsummary table
How to order a data frame by one descending and one ascending column?
dplyr is turning dates to doubles when mutating dataframe
How to display headers from a quarto with a for loop and plotly graphs?
mixed date converted from default mm/dd/yy in excel to dd/mm/yy
Git lfs version https://git-lfs.github.com/spec/v1 oid error
Generate list of lead values
How to find the date a categorical variable was last active?
R - could not find function "cld"
Overlay sf object on ggmap - not aligned, specific to region
Using ggplot and alleffects in tandem
Assign categories from JSON column in data table
Opposite of tidyr::separate, concatenating multiple columns into one
How to Stack Painted Phylogenetic Trees in R Like ggdensitree but with Colored Regimes
Counting number of viable pathways in a network diagram from a specific node input
Report p-value from lme model (random intercept) with gtsummary using tbl_summary() and add_p()
dynamically remove all nav panel from tabsetPanel
If x and y have the same length, do qqplot(x,y) and plot(sort(x),sort(y)) yield the same plot in R?
Applying SPEI::hargreaves function to time series from each pixel of SpatRaster using terra package R?
Turn dataframe with frequency into a frequency table for chi-square test
Print result of a function in R
How to require user authentication in R Shiny before users see any part of the app using shinyauthr?
Determine foreground / background colour in bslib app with dark/light mode