Sherry Lake
--
You received this message because you are subscribed to the Google Groups "Dataverse Users Community" group.
To unsubscribe from this group and stop receiving emails from it, send an email to dataverse-commu...@googlegroups.com.
To post to this group, send email to dataverse...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/dataverse-community/c8ff83e4-a641-4e60-a9a1-19dc988433e3%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
Duplicate file detection is based on checksums. Please see this (open) issue:
Update File Ingest Documentation To Explain Duplicate File Handling · Issue #2956 · IQSS/dataverse - https://github.com/IQSS/dataverse/issues/2956
On Tue, Mar 15, 2016 at 10:56 AM, Sherry Lake <shla...@gmail.com> wrote:
What is the criteria for duplicate file flagging on upload? A zip file with 30 items was uploaded to our UVa Dataverse and two were flagged as dups:
As far as I can tell these are not "really" dups. I am not sure exactly which files Dataverse thinks they are duplicates of, but I can guess based on filesize. Even if the contents are the same, I think it is safe to say that it is OK to have two files with the same contents and different file names in a dataset.
I've put the zip file here for someone to check out:
https://virginia.box.com/s/6jv4afrjjqssa10vktl7nifeeoffi7lb
Thanks
Sherry Lake
--
You received this message because you are subscribed to the Google Groups "Dataverse Users Community" group.
To unsubscribe from this group and stop receiving emails from it, send an email to dataverse-community+unsub...@googlegroups.com.
To post to this group, send email to dataverse...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/dataverse-community/c8ff83e4-a641-4e60-a9a1-19dc988433e3%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
I am thinking that I am not in the business of questioning why a researcher has "duplicate" files with different file names in their dataset. So is there no work around for dataverse to accept these files?
Another possible work around is to keep the zip file "zipped".
Is there a way for Dataverse NOT to unzip a zip file?
Thanks.
Sherry
On Tuesday, March 15, 2016 at 11:00:48 AM UTC-4, Philip Durbin wrote:
Duplicate file detection is based on checksums. Please see this (open) issue:
Update File Ingest Documentation To Explain Duplicate File Handling · Issue #2956 · IQSS/dataverse - https://github.com/IQSS/dataverse/issues/2956
On Tue, Mar 15, 2016 at 10:56 AM, Sherry Lake <shla...@gmail.com> wrote:
What is the criteria for duplicate file flagging on upload? A zip file with 30 items was uploaded to our UVa Dataverse and two were flagged as dups:
As far as I can tell these are not "really" dups. I am not sure exactly which files Dataverse thinks they are duplicates of, but I can guess based on filesize. Even if the contents are the same, I think it is safe to say that it is OK to have two files with the same contents and different file names in a dataset.
I've put the zip file here for someone to check out:
https://virginia.box.com/s/6jv4afrjjqssa10vktl7nifeeoffi7lb
Thanks
Sherry Lake
--
You received this message because you are subscribed to the Google Groups "Dataverse Users Community" group.
To unsubscribe from this group and stop receiving emails from it, send an email to dataverse-commu...@googlegroups.com.
To post to this group, send email to dataverse...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/dataverse-community/c8ff83e4-a641-4e60-a9a1-19dc988433e3%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
--Philip Durbin
Software Developer for http://dataverse.org
http://www.iq.harvard.edu/people/philip-durbin
--
You received this message because you are subscribed to the Google Groups "Dataverse Users Community" group.
To unsubscribe from this group and stop receiving emails from it, send an email to dataverse-commu...@googlegroups.com.
To post to this group, send email to dataverse...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/dataverse-community/3ea64612-5919-45e5-bf09-63f9249956b3%40googlegroups.com.
--
You received this message because you are subscribed to a topic in the Google Groups "Dataverse Users Community" group.
To unsubscribe from this topic, visit https://groups.google.com/d/topic/dataverse-community/FLnm8-60sOs/unsubscribe.
To unsubscribe from this group and all its topics, send an email to dataverse-commu...@googlegroups.com.
To post to this group, send email to dataverse...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/dataverse-community/CAPAYmDM%3DE2wZLHHCz29fMjZZAVseSi69FopgbqsDM_gpb%2BUGVQ%40mail.gmail.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/dataverse-community/CADL9p-UdjDh-_mkR827dJ%3Dz%2BPDxMt4sydYB_RrXeGgT3G9Hqqg%40mail.gmail.com.
Duplicate file detection is based on checksums. Please see this (open) issue:
Update File Ingest Documentation To Explain Duplicate File Handling · Issue #2956 · IQSS/dataverse - https://github.com/IQSS/dataverse/issues/2956
On Tue, Mar 15, 2016 at 10:56 AM, Sherry Lake <shla...@gmail.com> wrote:
What is the criteria for duplicate file flagging on upload? A zip file with 30 items was uploaded to our UVa Dataverse and two were flagged as dups:
As far as I can tell these are not "really" dups. I am not sure exactly which files Dataverse thinks they are duplicates of, but I can guess based on filesize. Even if the contents are the same, I think it is safe to say that it is OK to have two files with the same contents and different file names in a dataset.
I've put the zip file here for someone to check out:
https://virginia.box.com/s/6jv4afrjjqssa10vktl7nifeeoffi7lb
Thanks
Sherry Lake
--
You received this message because you are subscribed to the Google Groups "Dataverse Users Community" group.
To unsubscribe from this group and stop receiving emails from it, send an email to dataverse-community+unsub...@googlegroups.com.
To post to this group, send email to dataverse...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/dataverse-community/c8ff83e4-a641-4e60-a9a1-19dc988433e3%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.