Search code examples
metadatacatalog

What are the data catalog option available in open source


I would like to know which is the best data catalog available which serves the following requirements.

  • Open source
  • Highly available behind load balancer
  • Describes the dataset
  • Can describes the data inside the datasets, Ability to have option to describe the fields inside the data
  • Data set searchable

I looked at ckan and was really impressed but it did not offer describing fields inside the datasets

Is there any other good tool for the same


Solution

  • You can actually extend the CKAN and mold it as per your needs. CKAN is a very versatile and flexible product.

    e.g. for describing fields inside CKAN you can use https://github.com/ckan/ckanext-scheming

    or you can create your own extension by following the documentation. https://docs.ckan.org/en/2.8/extensions/