Extracting number of unique observations using a string

I have a column like this:

data.frame(x = c("ABC1","ABD1","ABE1","ABF1","ABG1","ABC2","ABC2","ABF2","ABE2"))

I want to find out how many unique observations there are which contain "AB" and a letter. So ABC1 and ABC2 are not unique but ABC1 and ABD1 are.

In this example, there would be 5 unique observations.

Solution

You can select only the first 3 characters for each word. Then count the number of unique occurrences.

df = data.frame(x = c("ABC1","ABD1","ABE1","ABF1","ABG1","ABC2","ABC2","ABF2","ABE2"),stringsAsFactors = FALSE)

length(unique(substr(df$x,1,3)))
5

Calculate transition probabilities
How to subset R dataframe based on specific values in several columns?
Change colon to period in figure caption in Quarto
Applying different functions to different groups using 2 data frames
Disable button for 5 seconds with different label and change after it
How to group columns with an odd index, based on columns with an even index?
Different plots of marginal effects with interaction term and offset when using different packages
Animate a MWE GIF in R
populating a matrix by rows
How to relevel in ROC curve in R/plotROC?
R - glmer different results on different machines (non-deterministic)
Dynamically operate on column (inside :=)
Random change in obj_addr() output when including the objects into a list and vectorizing over them
Add to curly-curly argument
h2o predict error: Test/Validation dataset has no columns in common with the training set
Is is possible to convert a dataframe object to a tribble constructor?
encoding issues in R
How to Scrape NBA stats page using rvest
Conversion of Decimal Minutes Seconds into Decimal Degrees when cells within the Columns Contains the Factor 'Missed' in R
Difficulty using case_when() to add column that, conditionally, pastes value from another column
Flip the matrix
Creating test for date columns incorrectly throwing error
Exists() and getSymbols() can't find symbol listed in yahoo
spearman's rho plot
Use OpenMP on M2 Mac with R and data.table
How to find the connected instances from a minimum spanning trees model in R
Hide a column in shiny datatable but keep it searchable
How to extract data from an edited datatable in shiny?
Render Data Table with double header
filter dataTables in Shiny Dashboad based on selectInput Values