Applying undersampling techniques to train and test data

I know if you perform some sort of transformation and you use fit() then you have to transform() both the training set and the test set.

Suppose you apply a targeted undersampling technique such as TomekLinks to your training data to allow the model to better identify\separate classes.

Question: If you are going to use the model to predict against a test set, do you also perform the same undersampling technique against the test set, or is the undersampling only used on the training set to assist the model in clarifying class boundaries. The trained model would then be applied against the full test set.

Solution

I don't think you should undersample your test data. While it is perfectly resonable to do it the the training data, doing it on the test data is unrealistic. If the model is intended for any online application, it needs to be tested on the real, unbalanced dataset.

Math.Sin() gives incorrect value
How to run my python script when the sunOS is start booting
Express-session: not resetting cookie expiration on each request
Getting a stack overflow exception when normalizing a vector
Edit default summary function in R gives error for multiple variables
What was a For loop? Why isn't it needed in R?
How to use download button in shiny and save results in various formats (csv, texte, pdf, spss...)?
Why are there two assignment operators, `<-` and `->` in R?
lm()$assign: what is it?
How to get the value of list(...) in R and S functions
Design matrix for MLM from library(lme4) with fixed and random effects
how to generate elements not included in my sample
Create a matrix with gradually changing values without a for loop
Emacs ESS and S-plus ( S+ ) 8.1 compatability
How to lag date-index in a time-series in R?
Nonlinear regression in R / S
Calling R from S-Plus?