How to keep only unique rows but ignore a column?

If I have this data:

df1 <- data.frame(name = c("apple", "apple", "apple", "orange", "orange"),
       ID = c(1, 2, 3, 4, 5),
       is_fruit = c("yes", "yes", "yes", "yes", "yes"))

and I want to keep only the unique rows, but ignore the ID column such that the output looks like this:

df2 <- data.frame(name = c("apple", "orange"),
       ID = c(1, 4),
       is_fruit = c("yes", "yes"))

df2
#    name ID is_fruit
#1  apple  1      yes
#2 orange  4      yes

How can I do this, ideally with dplyr?

Solution

You can use distinct function; By specifying the variables explicitly, you can retain unique rows just based on these columns; And also from ?distinct:

If there are multiple rows for a given combination of inputs, only the first row will be preserved

distinct(df1, name, is_fruit, .keep_all = T)
#    name ID is_fruit
#1  apple  1      yes
#2 orange  4      yes

Calculate average distance to coastline in R
How to use purrr:map() and rlang to emulate a pipe chain
magick annotate picture with arrows
Import png files and convert to animation(.mp4) in R
Convert a column with text files into separate images in R
How to calculate RSE_Var from SE_var/mean_Var row-wise for many variables, Var, using pivot() in R?
ggplot2 x-axis with many hours for each of many days. Is there a way to span the dates in the x-axis over the hours for that day?
gtsummary - Wilcoxon on ordered factor
R odbc::odbcListDrivers() does not list dirver in /opt/homebrew/etc/odbcinst.ini
Change size and aspect ratio without distortion
How can I get an R environment via Sys.getenv() with GitHub Actions using secrets?
ggplot geom_point color based on both x and y axis values
Perform a random binomial draw for each row in R without rowwise()
Scrape the university name (in QS World University Rankings website) with R
ggplot for linear-log regression model?
How to join (merge) data frames (inner, outer, left, right)
Avoid rescaling while binning using scale_*_steps
How can I mock a function globally using testthat?
R - could not find function "cld"
Create multiple lagged variables with different offsets
Expanding dataframe to include non existing values
Split string to columns based on paragraph ending from ocr'd image
from magick-image to rasterBrick
How to remove repeated elements in a vector, similar to 'set' in Python
Rename multiple variables at once using dplyr
Reading large multi-part table from file and combing its parts into one tibble
Processing multiple images with Magick (in R) with transformations
R: Convert/Read 3D Matrix into a 'magick' object and vice versa
Error using magick R to import PDF
Method in R to crop whitespace on svg file