I have a data set with labels that were produced by a k-means clustering algorithm. Now there is some data (with the same data structure) from another source and I wonder what is the most sensible way to label this new, yet unseen data? I was thinking about either
Unfortunately, I couldn't find anything about this particular problem. There are only a few questions about the general use of k-means as a classification model:
Thanks in advance.
Uli
You dont need SVM thing.First way is more convenient.If you are using sklearn https://scikit-learn.org/stable/modules/generated/sklearn.cluster.KMeans.html there is an example here.predict
function will do your job.