r dataframe sorting duplicates multiple-columns

Create numerical discrete values if values in a column equal in R

I have a column of IDs in a dataframe that sometimes has duplicates, take for example,

ID
209
315
109
315
451
209

What I want to do is take this column and create another column that indicates what ID the row belongs to. i.e. I want it to look like,

ID	ID Category
209	1
315	2
109	3
315	2
451	4
209	1

Essentially, I want to loop through the IDs and if it equals to a previous one, I indicate that it is from the same ID, and if it is a new ID, I create a new indicator for it.

Does anyone know is there a quick function in R that I could do this with? Or have any other suggestions?

Solution

Convert to factor with levels ordered with unique (order of appearance in the data set) and then to numeric:

data$IDCategory <- as.numeric(factor(data$ID, levels = unique(data$ID)))

#> data
#   ID IDCategory
#1 209          1
#2 315          2
#3 109          3
#4 315          2
#5 451          4
#6 209          1

Create multiple lagged variables with different offsets
Expanding dataframe to include non existing values
Split string to columns based on paragraph ending from ocr'd image
from magick-image to rasterBrick
How to remove repeated elements in a vector, similar to 'set' in Python
Rename multiple variables at once using dplyr
Reading large multi-part table from file and combing its parts into one tibble
Processing multiple images with Magick (in R) with transformations
R: Convert/Read 3D Matrix into a 'magick' object and vice versa
Error using magick R to import PDF
Method in R to crop whitespace on svg file
Read table from a website into R Studio and create a dataframe with the info
Perspective transformation using R and magick
R magick: Square crop and circular mask
r piping image_annotate doesn't work as expected
reading text portion from list of images and saving in R, using magick
Difference betweeen Fix and Edit in R
Conditional coloring in the Flextable in R
Fast NMF in R on sparse matrices
cannot find -lMagick++-6.Q16
Suppressing output from ImageMagick when calling function from the R Animation package
How to identify relevant strings in this emmeans() output
Why is message() a better choice than print() in R for writing a package?
How can I prevent my computer from crashing when running R-script on large dataset
Crop out circle from image and lay over second image
R: Swap two variables without using a third
Literal curly brackets in gtsummary
Hide "Panel" row from rbinded modelsummary tables
Ignoring NA cases when getting column index of lowest value in row
Is there a way to change the color of the tab_spanner when creating a gt table?