Convert data frame of snp genotypes in numeric matrix

snp1 <- c("AA", "AT", "AA", "TT", "AA", "AT", "AA", "AA", "AA", "AT")
snp2 <- c("GG", "GC", "GG", "CC", "CC", "GC", "GG", "GG", "GG", "GC")
df1 <- data.frame(snp1, snp2)

num1 <- c(1, 2, 1, 3, 1, 2, 1, 1, 1, 2)
num2 <- c(1, 2, 1, 3, 3, 2, 1, 1, 1, 2)
df2 <- data.frame(num1, num2)

This is done in R. I have an object df1, which I want to convert to df2. For each column in df1, the most common value is converted to 1, the second most common value to 2, etcetera. How do I do this efficiently?

Solution

Variation on a theme:

lapply(df1, function(x) match(x, levels(x)[order(-table(x))]) )
#$snp1
# [1] 1 2 1 3 1 2 1 1 1 2
#
#$snp2
# [1] 1 2 1 3 3 2 1 1 1 2

R Language - Extracting the correct Data Type in a PDF Table
Comparing the values of a certain number previous rows with the current row
rpart package installation in R
An efficient way to assign value based on a min-max range and category
Change output of the `purrr::map` function
osmdata_sf returns failed to perform HTTP request curl::curl_fetch_memory() error in R?
Comparing nls() to nls2() - what am I doing wrong
How to add "variables grid" below ggplot
How can I use predefined code snippets outside of code chunks in Quarto within RStudio/Posit?
Wrap text for collapse rows in KableExtra for a long table in R
Implementation of Breusch-Pagan test for random effects in plm with unbalanced panels
Finding a value of a dataset in different ones
Replicate matrix
Unexpected results after converting raster data from geographic to projected coordinate system using the terra package
How to remove rows by condition in R?
How do I add an alias for magrittr pipe from R in vscode
Package ‘neuralnet’ in R, rectified linear unit (ReLU) activation function?
Sub-subtitle in a graph made with `ggplot2`
How can I execute a statement and ignore warnings with tryCatch?
Enumerate events where n consecutive values are not NA
Serialize/deserialize a column with R and DuckDB
Putting multiple plots on the same page in R?
NA values in a non-editable date column in a datatable in a shiny app change to "Invalid Date" when clicked on
How to enable/disable checkboxInput when certain panel is selected
Writing robust R code: namespaces, masking and using the `::` operator
Replacing with conditional value in dplyr case_when()
How to assign pre-determined RGB values to polygons
python/pandas equivalent to dplyr 1.0.0 summarize(across())
Calculating moving average
Estimating non-monotonic bi-exponential curve fit