Hey Justin,
Please post them. Lot of people are asking for them.
Regards,
Devdatta Tengshe
Ph: 735-358-0782
Can I post shapefile data for India Villages?
--
Datameet is a community of Data Science enthusiasts in India. Know more about us by visiting http://datameet.org
---
You received this message because you are subscribed to the Google Groups "datameet" group.
To unsubscribe from this group and stop receiving emails from it, send an email to datameet+u...@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
Does anyone have a village dataset I can compare to for verification before I upload to github?
You received this message because you are subscribed to a topic in the Google Groups "datameet" group.
To unsubscribe from this topic, visit https://groups.google.com/d/topic/datameet/Q_FFof5U3M8/unsubscribe.
To unsubscribe from this group and all its topics, send an email to datameet+u...@googlegroups.com.
>>why people from India aren't posting the data when they have it
I can't answer for others, but I imagine, that this is because it is difficult to explain how this data happened to be in your ownership, when the original source is usually Survey Of India, or some other Govt Department.
>>Can it be hosted on GitHub?
Even Though I'm not a Lawyer, my understanding is that it can be. Infact, given that you are not based in the country, you would be the best person on the group to do so.
We've also been really keen on publishing the datasets. To answer your question as to why haven't we done it yet.
- Yes, the fear of having to answer the question of how we got the files is one reason.
- Secondly, we're working on cleaning them up and adding census attribute data to them. Like Dev said, the Bhuvan shapefile isn't an exact match to the census list of villages. But we've found that with some cleaning we still get a very usable dataset.
Once we create thematic visualizations from these its easier to show their usefulness and get broader support from academics,civil society (rather than just tech groups) asking for these datasets to be open.
Take the savethemap campaign for example. Its hard to spread awareness when not many people understand what impact it could have.
One thing making the cleaning process cumbersome is that not all of the files have both village codes 2001 and 2011 in their attribute tables. It seems that some of the files we downloaded were a work in progress. For example when I first downloaded Maharashtra it didn't have any 2011 census codes. When another of the DM Pune guys downloaded Maharashtra a few weeks later it had both 2001 and 2011 census codes.
Having said all this it's obviously no good if we do all the cleaning up and then don't have a plan for publishing it. Its about time already! I suggest we have a dialogue about this within this group about the best way to do it. Perhaps Thej, Nisha can comment?
Best
Craig