Search code examples
pythonscikit-learnnmf

How to perform cross validation on NMF Python


I am trying to perform cross-validation on NMF to find the best parameters to use. I tried using the sklearn cross-validation but get an error that states the NMF does not have a scoring method. Could anyone here help me with that? Thank you all


Solution

  • A property of nmf is that it is an unsupervised (machine learning) method. This generally means that there is no labeled data that can serve as a 'golden standard'. In case of NMF you can not define what is the 'desired' outcome beforehand.

    The cross validation in sklearn is designed for supervised machine learning, in which you have labeled data by definition.

    What cross validation does, it holds out sets of labeled data, then trains a model on the data that is leftover and evaluates this model on the held out set. For this evaluation any metric can be used. For example: accuracy, precision, recall and F-measure, and for computing these measures it needs labeled data.