Hi,
Dataverse wonderfully provides the option to create Persistent IDentifiers for datasets and does that already for a very long time. Over time, the awareness to create PID's for other things than datasets / archival resources has grown, some even to believe that all things must be identified (Tim Berners-Lee "The Next Web", Wikidata). Dataverse supports this reasoning via the optional ID field for 'authors' of datasets, allowing authors to be identified via person identifiers such as ORCID's.
I'd like to raise four issues, that hamper the quality of data currently entered in Dataverse instances world wide.
1. Name of Dataset Author should not be obligatory
Currently, it is obligatory to enter the name of the author of a dataset. However, if I were to provide a ORCID for said author, the name would be a redundant characteristic as identification via PID supersedes identification via a string. Moreover, data entry of the name string is prone to inconsistencies, and is not friendly to author's changing their names over time (marriage, gender change). I don't want to make the case, here that no one should enter names anymore, but I don't want to be forced anymore to enter a author's name, when I provided that person's PID (e.g. ORCID).
So: please remove the forced entry of names (perhaps conditional on when a identifier is given).
2. PID's should be available for Point of Contact / Depositor
I don't need a lot of words to drive this point home as it follows from the previous. If we care about PID's for entities other than datasets, it should also be possible to add PID's for the Point of Contact / Depositor.
3. Point of Contact fields should not imply entry of person data
The point of contact data fields suggest that data on a person need to be entered: name ('Familyname,Givenname'), affiliation, email. However, many institutes use a 'helpdesk@institute' like email, to properly support Dataverse users (e.g. safeguarding non-response as a result of personnel having changed institutes). Thus, the point of contact could also be a organisation like entity. This is not just my opinion, it is also what the onMouseOver alt-text provides: "the name of the entity.. person ... or organisation". And this is also concurs with say:
https://schema.org/ContactPoint.
4. URI should be added as open identifier type
The implication of points 2 and 3, is that one would need to be able to add an identifier to a organisation or department. Currently, most identifier types relate to person's or controlled 'vocabulary' entities (e.g. VIAF). However, if an institute would like to work with the Identifier the community assigned to said institute, in our case,
https://www.wikidata.org/wiki/Q1667757, I feel institutes should be able to do so.
These points may not work for all. I would like to emphasise that I am not requesting for my points to be added as 'obligatory'. However, points 1-4 underline that the current data entry fields are restrictive in terms of adding homogenous yet flexible metadata on entities responsible for datasets and prevent the adoption of community driven identifiers for said entities.
I would like to ask you for your view points regarding points 1-4.
All the best,
Richard