Hello everybody,
I am throwing this here just in case anybody is interested..and I am
also curious whether the following could be implemented in the future
in IRODS for content deduplication, versioning, provenance, etc..
Recently, the International Organization for Standardization has
published ISO 24138 (
https://www.iso.org/standard/77899.html), which
defines the International Standard Content Code or ISCC. An ISCC is a
similarity preserving (soft hash) fingerprint and identifier for
digital media assets which, differently than md5, sha256 etc.., is also
sensitive to metadata (and not only). Its applications are numerous:
https://iscc.codes/
https://core.iscc.codes
In principle, there is an ISCC python library that could be easily
added to the PRC
https://github.com/irods/python-irodsclient/issues/573
Cheers,
Leonardo
--
Sincerely yours,
Leonardo Lenoci, PhD
ICT Research and Security Advisor
Leiden University | Faculty of Science
Microsoft's Software is Malware:
https://www.gnu.org/proprietary/malware-microsoft.html
Personal Website:
https://social.edu.nl/@the_dr_leonardo_lenoci