Query about the 2011 Districts shapefile

201 views
Skip to first unread message

Ashim Kapoor

unread,
Dec 8, 2021, 12:48:32 AM12/8/21
to data...@googlegroups.com
Deall All,

I am working with the Districts shapefile using R.

The file is located here:

https://github.com/datameet/maps/tree/master/Districts

Here are some records from this shapefile:

> districts.map.with.crs %>% head
Simple feature collection with 6 features and 5 fields
Geometry type: MULTIPOLYGON
Dimension: XY
Bounding box: xmin: 71.60734 ymin: 18.33075 xmax: 93.20723 ymax: 27.40883
CRS: EPSG:4326
district st_nm st_cen_cd dt_cen_cd censuscode
1 Adilabad Andhra Pradesh 28 1 532
2 Agra Uttar Pradesh 9 15 146
3 Ahmadabad Gujarat 24 7 474
4 Ahmadnagar Maharashtra 27 26 522
5 Aizawl Mizoram 15 3 283
6 Ajmer Rajasthan 8 21 119
SHAPE
1 MULTIPOLYGON (((78.84972 19...
2 MULTIPOLYGON (((78.19803 27...
3 MULTIPOLYGON (((72.03456 23...
4 MULTIPOLYGON (((74.67333 19...
5 MULTIPOLYGON (((92.98749 24...
6 MULTIPOLYGON (((74.89253 26...
>

I have a question:

In the Districts shapefile:

st_cen_cd is the 2011 census code for States.
dt_cen_cd is the 2011 census code for Districts.
censuscode is the 2001 census code for Districts.

Is the above correct? Especially the censuscode. Is it the 2001 census
code for a District? Is this mentioned somewhere ?

Many thanks,
Ashim

Ashim Kapoor

unread,
Dec 8, 2021, 2:05:04 AM12/8/21
to data...@googlegroups.com
Dear All,

I was a little confused.

Attached are the extracts from 2001 / 2011 shapefiles as snippet.

I think censuscode in 2011 shapefile = Census 2011 code for a District.
I further think DT_CEN_CD in 2011 OR 2011 shapefiles = Census 2001
code for a District.

I would like to confirm this.

Best,
Ashim

----8<-----snippet---->8-----

> st_read("Census_2011/2011_Dist.shp") %>% filter(DISTRICT=="Ahmadnagar")
Reading layer `2011_Dist' from data source
`/home/ashim/Documents/my-workspace/space-to-verify-state-census-codes-from-districts-shapefile/maps/Districts/Census_2011/2011_Dist.shp'
using driver `ESRI Shapefile'
Simple feature collection with 641 features and 5 fields
Geometry type: MULTIPOLYGON
Dimension: XY
Bounding box: xmin: 68.18625 ymin: 6.755953 xmax: 97.41529 ymax: 37.07827
CRS: 4326
Simple feature collection with 1 feature and 5 fields
Geometry type: MULTIPOLYGON
Dimension: XY
Bounding box: xmin: 73.62021 ymin: 18.33075 xmax: 75.58884 ymax: 19.98771
CRS: 4326
DISTRICT ST_NM ST_CEN_CD DT_CEN_CD censuscode
1 Ahmadnagar Maharashtra 27 26 522
geometry
1 MULTIPOLYGON (((74.67333 19...
> st_read("Census_2001/2001_Dist.shp") %>% filter(DISTRICT=="Ahmadnagar")
Reading layer `2001_Dist' from data source
`/home/ashim/Documents/my-workspace/space-to-verify-state-census-codes-from-districts-shapefile/maps/Districts/Census_2001/2001_Dist.shp'
using driver `ESRI Shapefile'
Simple feature collection with 594 features and 4 fields
Geometry type: MULTIPOLYGON
Dimension: XY
Bounding box: xmin: 68.18625 ymin: 6.755953 xmax: 97.41529 ymax: 37.07827
CRS: 4326
Simple feature collection with 1 feature and 4 fields
Geometry type: MULTIPOLYGON
Dimension: XY
Bounding box: xmin: 73.62021 ymin: 18.33075 xmax: 75.58884 ymax: 19.98771
CRS: 4326
ST_NM ST_CEN_CD DT_CEN_CD DISTRICT geometry
1 Maharashtra 27 26 Ahmadnagar MULTIPOLYGON (((74.67333 19...

Devdatta Tengshe

unread,
Dec 8, 2021, 3:16:54 AM12/8/21
to data...@googlegroups.com
The 'censuscode' is just a sequential ID, going from 0 (for POK Areas) to 640.

As far as I know, it has nothing to do with either the 2001 Census code or 2011 Census.




Regards,
Devdatta


--
Datameet is a community of Data Science enthusiasts in India. Know more about us by visiting http://datameet.org
---
You received this message because you are subscribed to the Google Groups "datameet" group.
To unsubscribe from this group and stop receiving emails from it, send an email to datameet+u...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/datameet/CAC8%3D1eoNrTUkrKUWNUO67904wqR4qo1kzEYk319ZYQ2%2Bkez3nQ%40mail.gmail.com.

Ashim Kapoor

unread,
Dec 8, 2021, 4:19:39 AM12/8/21
to data...@googlegroups.com
Dear Devdatta,

Many thanks for your reply.

The website:

https://lgdirectory.gov.in

has Census 2001, Census 2011 codes for ALL Districts of India.

Example on this website:

For the State of Maharashtra, the districts Ahmednagar, Akola and
Amrawati have Census 2011 codes = 522,501,503.

In the 2011 shapefile the `censuscodes` for 522,501,503 correspond to
Districts Ahmadnagar, Akola Amrawati.

I have done a "outer join" of District names by
censuscode=Census2011.Code and I get 531 / 641 matches.

Please see the attached text file. I do not think this is happening by
"chance". I do think that the censuscodes column in the 2011 Districts
shapefile is the Census 2011 codes for Districts.

Best,
Ashim
> To view this discussion on the web visit https://groups.google.com/d/msgid/datameet/CAOQOUMvt_RrYrJY3_Hh2sut47XODGOpBwEXoVhACTabLzFfk3Q%40mail.gmail.com.
matching-names.csv

Devdatta Tengshe

unread,
Dec 8, 2021, 4:42:42 AM12/8/21
to data...@googlegroups.com
When this data was created, the Census Tables were not available, and it was created by digitising the maps given in the Administrative Atlas PDF available on the Census Website.

So it included only the numbers available in the Administrative Atlas, and not the actual codes that are available in the CENSUS Tables.

Source: I was part of the India-WRIS team which had created this data.

Regards,
Devdatta


Ashim Kapoor

unread,
Dec 8, 2021, 5:17:20 AM12/8/21
to data...@googlegroups.com
Dear Devdatta,

OK. I wonder why I get 531/641 matches if the codes do not have the
SAME meaning in both (the map and the lgdirectory ) datasets.

What about ST_CEN_CD and DT_CEN_CD in the 2001 / 2011 Districts
shapefiles ? Are those BOTH the Census codes for States and Districts
respectively in 2001 ?

I wish someone wrote a README explaining the column names in 2001/2011
Districts and States shapefiles.

Best,
Ashim
> To view this discussion on the web visit https://groups.google.com/d/msgid/datameet/CAOQOUMtCi-KUH435KpoV7WrYcn%3DoQKMDZW%2B0JkKAj-65A%3DL0Ng%40mail.gmail.com.
Reply all
Reply to author
Forward
0 new messages