Data digitization using python

80 views
Skip to first unread message

Shiv Hastawala

unread,
Apr 9, 2024, 2:03:28 PM4/9/24
to data...@googlegroups.com
Hi data enthusiasts

I have a lot of publicly available data which are pdf scans of old publications. I wish to digitize them as a public service. I found that the following python package is pretty efficient at doing this job:



However, since I am python-illiterate, I was wondering if any of you python enthusiasts would be interested in writing the code for this project? Obviously, this is voluntary work. 

Please reply to me personally if you are interested. Thanks!

Thanks and regards.


Yours sincerely

Shiv Hastawala

(He/His/Him)
Doctoral Candidate
Department of Economics
Binghamton University (State University of New York)

Email ID: shastaw1[at]binghamton[dot]edu

Zoom ID: 201 717 2613

Pradeep Vanga

unread,
Apr 15, 2024, 7:57:40 PM4/15/24
to datameet
Hi Shiv,

Do you mind sharing a couple of sample pdfs? Do they contain structured data like tables or some other type of data?
Reply all
Reply to author
Forward
0 new messages