Search code examples
azure-machine-learning-serviceazureml-python-sdk

azureml.contrib.dataset vs azureml.data


Looks like AzureML Python SDK has two Dataset packages exposed over API:

  1. azureml.contrib.dataset
  2. azureml.data

The documentation doesn't clearly mention the difference or when should we use which one? But, it creates confusion for sure. For example, There are two Tabular Dataset classes exposed over API. And they have different APIs for different functions:

  1. azureml.data.TabularDataset
  2. azureml.contrib.dataset.TabularDataset

Any suggestion about when should I use which package will be helpful.


Solution

  • As per the PyPi, azureml.contrib.dataset has been deprecated and azureml.data should be used instead:

    The azureml-contrib-dataset package has been deprecated and might not receive future updates and removed from the distribution altogether. Please use azureml-core instead.