Merging bulk data

103 views
Skip to first unread message

Pier Luigi Giardino

unread,
Aug 14, 2024, 2:13:06 PM8/14/24
to OpenSecrets Open Data
Hello everyone,

I need to merge data about lobbying and PAC for all cycles from 1990 to this year, with the aim of assessing firms how much they contributed to political activities over the years.

Does anyone have idea about how to do it?

Thanks so much :)

Kamron Eck

unread,
Aug 15, 2024, 1:18:59 PM8/15/24
to OpenSecrets Open Data
Because there is no universal identifier that links the data perfectly, you will have to do some string matching. I used a python tool called SPlink, i’ve even left some code in another thread on this groups page. The official method for linking the data to firms would be considered probabilistic matching. I noticed that using a 80% probability to be the sweet spot for matching firm names from OpenSecrets and Capital IQ. 
Reply all
Reply to author
Forward
0 new messages