Search code examples
rmicrosoft-r

Adding a new column to an xdf file


I would like to add a new column to an xdf file using rxDataStep. To achieve this aim, I have written this code:

    rxDataStep(nyc_jan_xdf,transformFunc = CashVsCard )

CashVsCard<-function(dataList)
{

  if(dataList$payment_type==1)
  {
    dataList$cash_vs_Card="Card"
  }
  else
  {
    if(dataList$payment_type==2)
    {
      dataList$cash_vs_Card="Cash"
    }
  }

  return(dataList)
}

then I get this error:

   The variable 'cash_vs_Card' has a different number of rows than other columns in the data: 1 vs. 10
Caught exception in file: CxAnalysis.cpp, line: 3830. ThreadID: 7288 Rethrowing.
Caught exception in file: CxAnalysis.cpp, line: 5347. ThreadID: 7288 Rethrowing.
Error in doTryCatch(return(expr), name, parentenv, handler) : 
  The variable 'cash_vs_Card' has a different number of rows than other columns in the data: 1 vs. 10
In addition: Warning messages:
1: In if (dataList$payment_type == 1) { :
  the condition has length > 1 and only the first element will be used
2: In if (dataList$payment_type == 2) { :
  the condition has length > 1 and only the first element will be used

Solution

  • Use ifelse for vectorised transformations based on the values of another variable.

    cashVsCard <- function(datalist)
    {
          datalist$cash_vs_card <- ifelse(datalist$payment_type == 1, "Card", "Cash")
          datalist
    }
    
    rxDataStep(*, transformFunc=cashVsCard)