Finding duplicates

107 views
Skip to first unread message

DrElectron

unread,
Sep 27, 2021, 12:40:15 PM9/27/21
to ResourceSpace
Has anyone managed to successfully find duplicates using the special search term  "!duplicates" ?

I understand I need to activate checksums being created, which I've done by adding "$file_checksums = true;" to config.php. 

However, newly uploaded files still don't seem to have any checksum information added to them (the "checksum" field is empty if when I use 'Actions' to download CSV metadata) and so the !duplicates search still fails.

Am I missing anything that needs setting up to use this feature?

Any suggestions (nice ones!) gratefully received :)

DrElectron

unread,
Sep 27, 2021, 3:17:54 PM9/27/21
to ResourceSpace
OK, I've supplied my own answer - what I needed in config.php was:

$file_checksums_offline = false;

So that checksums are created on upload, rather than via a cron job.

Now the !duplictes search is working as expected :)

Fred

unread,
Nov 11, 2021, 5:42:40 PM11/11/21
to ResourceSpace
How does it work?

Fred

unread,
Nov 15, 2021, 2:14:02 PM11/15/21
to ResourceSpace
Need this. Anyone can explain the process?

DrElectron

unread,
Nov 15, 2021, 2:59:42 PM11/15/21
to ResourceSpace
Hi Fred, can you explain more clearly what you need?
Are you wanting to find duplicates using "!duplicates" in search?
If I can, I am happy to explain the process if I know what you are trying to achieve. 
We can both use Google translate if necessary.

Salut Fred, peux-tu expliquer plus clairement ce dont tu as besoin ? Voulez-vous trouver des doublons à l'aide de "!duplicates" dans la recherche ? Si je peux, je suis heureux d'expliquer le processus si je sais ce que vous essayez d'accomplir. Nous pouvons tous les deux utiliser Google translate si nécessaire.

Peter

Fred

unread,
Nov 15, 2021, 3:01:16 PM11/15/21
to ResourceSpace
What are the exacts steps and pre requisites to start a duplicate find?

Le jeudi 11 novembre 2021 à 17 h 42 min 40 s UTC-5, Fred a écrit :

Fred

unread,
Nov 15, 2021, 3:11:48 PM11/15/21
to ResourceSpace
I want to find duplicates and delete them. ;)

DrElectron

unread,
Nov 15, 2021, 4:36:02 PM11/15/21
to ResourceSpace
Hi Fred

Resourcespace identifies duplicates by comparing checksums of the files, so to use the "!duplicates" search you first need to create checksums of the files. Checksums can can be created either a) by a 'cron' job (i.e. at set times) or b) as files are uploaded.

If you want your checksums to be created on upload (as I did) you need to modify config.php by adding two lines of code:
$file_checksums = true;
$file_checksums_offline = false;

This will cause checksums to be added to each file that you upload.
Of course, this won't affect files that you've already uploaded. To add checksum to those files,  check out the help files here:
Reply all
Reply to author
Forward
0 new messages