Search code examples
rdataframesapply

R: Apply function on specific columns preserving the rest of the dataframe


I'd like to learn how to apply functions on specific columns of my dataframe without "excluding" the other columns from my df. For example i'd like to multiply some specific columns by 1000 and leave the other ones as they are.

Using the sapply function for example like this:

    a<-as.data.frame(sapply(table.xy[,1], function(x){x*1000}))

I get new dataframes with the first column multiplied by 1000 but without the other columns that I didn't use in the operation. So my attempt was to do it like this:

    a<-as.data.frame(sapply(table.xy, function(x) if (colnames=="columnA") {x/1000} else {x}))

but this one didn't work.

My workaround was to give both dataframes another row with IDs and later on merge the old dataframe with the newly created to get a complete one. But I think there must be a better solution. Isn't it?


Solution

  • If you only want to do a computation on one or a few columns you can use transform or simply do index it manually:

    # with transfrom:
    df <- data.frame(A = 1:10, B = 1:10)
    df <- transform(df, A = A*1000)
    
    # Manually:
    df <- data.frame(A = 1:10, B = 1:10)
    df$A <- df$A * 1000