Search code examples
wikipediadbpediawikipedia-api

What is the difference between DBPedia and Wikipedia data?


May I define (or calculate) the difference between DBPedia and Wikipedia data concerning, for example, Ecuadorian People?

Does DBpedia contain the same Ecuadorian People existing in Wikipedia? If not - what is the difference (e.g. how to extract all Ecuadorian People from Wikipedia)?

I could run SPARQL to calculate the number of Ecuadorian People from DBPedia but do not know how to do it for Wikipedia(what approach?)


Solution

  • Wikipedia (specifically, mostly its infoboxes) is the source for the data in DBpedia. And that applies to each specific piece of information too. For example, take the first person on Wikipedia's List of Ecuadorians, Abdón Ubidia (Wikipedia, DBpedia):

    • DBpedia says that his birthPlace is Quito and Ecuador, because Wikipedia says that he was Born in Quito, Ecuador
    • DBpedia says that his nationality is Ecuadorian, because Wikipedia says that his Nationality is Ecuador

    But Wikipedia is not completely consistent. If you take the following lists from Wikipedia:

    • people that were Born somewhere in Ecuador
    • people whose Nationality is Ecuador
    • people in List of Ecuadorians
    • people in subcategories of Category:Ecuadorian people

    then it's very likely you will get 4 different lists of people.