Randomly assign different values to rows using different probability in R

Have such a data frame:

ID var
1  NA
2  NA
3  NA
4  NA
...

I need to randomly assign var values of 20% rows to be A, and 30% rows to be B, and 50% rows to be C.

Is there some efficient way to solve this?

Solution

suppose you have dataframe named df: then you can write:

randvar = sample(c('A','B','C'),size = nrow(df),prob = c(0.2,0.3,0.5),replace = TRUE)
df$var = randvar

suppose you want the "A"s is rightly 20% percent, so do "B" in 30% and "C" in 50% then it is not one line code, suppose your c(0.2,0.3,0.5)*df_size is all integer my answer is :

n = nrow(df)
df$var = "C"  #initialize all value to be "C"
index = 1:n
indexa = sample(index,0.2*n)  #pick 20% index for "A"
indexb = sample(index[-indexa],0.3*n) #pick 30% index for "B" need to rule out the "A"s you already picked
df$var[indexa] = "A" #assign "A" to df$var at indexa
df$var[indexb] = "B" #assign "B" to df$var at indexb
#the rest 50% is "C"

calculate mean of matrices having different lengths
Get column and row position of nth element in a matrix
Is there any authoritative documentation on R release nicknames?
R Glassdoor Web Scraping
Issue with graticule across 180° for several country/territory EEZs
Separating grouped layers in a raster stack in terra
Way to web-scrape a popular eSport website using R?
Variance calculation warning: longer object length is not a multiple
gratia::draw(): "'length.out' must be a non-negative number"
Using Swift as custom engine in knitr and including all previous content
convert source target value dataframe into a correlation matrix
ggplot2 plotting a 100% stacked area chart
Use string as formula for ipwtm function?
interpolarization within groups with NA
Multi-row x-axis labels in ggplot line chart
How to do a SOAP request for EUR-Lex API with R?
Make an alluvial plot
Parameters for the ggplot theme function about legend.axis.line
Error handling for tidyr hoist in API call dplyr pipe when column type changes between calls
calculate distance between regression line and datapoint
Colour picker input not updating output in R Shiny
Order() in R - argument is missing, with no default
How to plot geom_bar without showing multiple lines
Computing only the n first rows of a distance matrix with R torch
R: speeding up "group by" operations
How to manipulate NetCDF-4 groups in R?
How to convert categorial raster to mapped RGB values in R?
How to add a hover-over tooltip to rhandonstable header cell?
How to detect the right encoding for read.csv?
Rcpp Rf_warningcall compiler warnings