Search code examples
cluster-analysisrapidminer

K-medioids with Dynamic Time Warping in RapidMiner


How to perform K-medioids clustering with Dynamic Time Warping as a distance measure in RapidMiner?

The idea with Dynamic Time Warping is to perform it on time series of different length. How can I do that in RapidMiner? I get this error message

The data contains missing values which is not allowed for KMediods

How can I cluster time series of different length?


Solution

  • You could fill the missing values with zeroes. The operator Replace Missing Values does this. I don't know the details of your data nor how RapidMiner calculates DTW distances so I therefore can't tell if this approach would yield valid results.

    Faced with this, I might use the R extension with the dtw and cluster packages to investigate how distances between different length time series could be used to make clusters. Once you have R working, you can call it from RapidMiner.