What is the best way to index Wikipedia Articles (which contain Geo locations lon/lat) in a Solr Server?
E.g. I have a given lon/lat Position and want to index all Wikipedia Articles around a distance of 60 kilometers.
I could download the whole Wikipedia Dump and write a application which tries to fetch all data in the xml within the given distance of the point. But the dump is about 40GB and this could take a long time. And I have the following condition: I want to keep the data up to date (They should be updated every 48 hours). Is there a partial wiki dump available (e.g. for every country) or a API / Application to use for this case?
Special:Nearby you mentioned in comments used to be powered by Solr, but it now uses Elasticsearch. The extension that provides geospatial search - GeoData - also supports MySQL-based searches which is more practical for small datasets. If you're interested specifically in Solr, you can look how it was done before I killed it because Elasticsearch is ohhh so much nicer.