Hello,
I am looking for the best way to identify duplicates in the crime incident data. dc_key seemed promising, but it is unique for all observations in the subset of data I am examining (DUIs). And a sort of observations on dispatch_date_time and location_block shows a large number of potential duplicates.
Any thoughts?
Delete duplicates that have the same dispatch_date_time and location_block?
Best,
Pete