Create column representing instance rather than total count

Let's say I have the following dataframe:

ID <- c(15, 25, 90, 1, 23, 543)

animal <- c("fish", "dog", "fish", "cat", "dog", "fish")

df <- data.frame(ID, animal)

How could I create a third column to represent the instance (from top to bottom) that a repeat animal appears? for example, a column "Instance" in the order (1, 1, 2, 1, 2, 3)? I know I can use group_by to receive the total count, but this is not exactly what I'm after. Thanks.

Solution

You need row_number() by groups of animal.

library(dplyr)

df %>%
  mutate(Instance = row_number(), .by = animal)

#    ID animal Instance
# 1  15   fish        1
# 2  25    dog        1
# 3  90   fish        2
# 4   1    cat        1
# 5  23    dog        2
# 6 543   fish        3

With built-in packages, you can use ave:

ave(df$animal, df$animal, FUN = seq)

# [1] "1" "1" "2" "1" "2" "3"

Calculate average distance to coastline in R
How to use purrr:map() and rlang to emulate a pipe chain
magick annotate picture with arrows
Import png files and convert to animation(.mp4) in R
Convert a column with text files into separate images in R
How to calculate RSE_Var from SE_var/mean_Var row-wise for many variables, Var, using pivot() in R?
ggplot2 x-axis with many hours for each of many days. Is there a way to span the dates in the x-axis over the hours for that day?
gtsummary - Wilcoxon on ordered factor
R odbc::odbcListDrivers() does not list dirver in /opt/homebrew/etc/odbcinst.ini
Change size and aspect ratio without distortion
How can I get an R environment via Sys.getenv() with GitHub Actions using secrets?
ggplot geom_point color based on both x and y axis values
Perform a random binomial draw for each row in R without rowwise()
Scrape the university name (in QS World University Rankings website) with R
ggplot for linear-log regression model?
How to join (merge) data frames (inner, outer, left, right)
Avoid rescaling while binning using scale_*_steps
How can I mock a function globally using testthat?
R - could not find function "cld"
Create multiple lagged variables with different offsets
Expanding dataframe to include non existing values
Split string to columns based on paragraph ending from ocr'd image
from magick-image to rasterBrick
How to remove repeated elements in a vector, similar to 'set' in Python
Rename multiple variables at once using dplyr
Reading large multi-part table from file and combing its parts into one tibble
Processing multiple images with Magick (in R) with transformations
R: Convert/Read 3D Matrix into a 'magick' object and vice versa
Error using magick R to import PDF
Method in R to crop whitespace on svg file