Skip to first unread message

Leif Longva

unread,
Apr 8, 2014, 10:35:31 AM4/8/14
to dataverse...@googlegroups.com
I would like to see a language field in Dataverse, to record the language of the data. Our Dataverse-to-be-released will be used for linguistic data, and the language(s) as topic of the data is rather important. Is there a language field coming in 4.0, perhaps?

Eleni Castro

unread,
Apr 8, 2014, 12:14:04 PM4/8/14
to dataverse...@googlegroups.com
Thanks for your question, Leif. 

Before my team starts looking into supporting this -- just to be clear: are you looking for a language field to keep track of the primary language of the data (resource) rather than the language that the metadata is written in? 
Do you expect that a data set could have multiple languages? Or just one primary language would be assigned per data set?
Any information you might be able to offer would be helpful for me to see how closely this aligns with what Dublin Core and DataCite schemas use for their Language element for indicating the language of the resource.

Best regards,
Eleni

-- 

Eleni Castro

Research Coordinator, Data Acquisition and Archiving, Data Science

IQSS, Harvard University

http://www.iq.harvard.edu/people/eleni-castro 



On Tue, Apr 8, 2014 at 10:35 AM, Leif Longva <leif....@uit.no> wrote:
I would like to see a language field in Dataverse, to record the language of the data. Our Dataverse-to-be-released will be used for linguistic data, and the language(s) as topic of the data is rather important. Is there a language field coming in 4.0, perhaps?

--
You received this message because you are subscribed to the Google Groups "Dataverse Users Community" group.
To unsubscribe from this group and stop receiving emails from it, send an email to dataverse-commu...@googlegroups.com.
To post to this group, send email to dataverse...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/dataverse-community/697f8de0-a878-4986-8676-7183c8d5e5f3%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Leif Longva

unread,
Apr 9, 2014, 3:41:41 AM4/9/14
to dataverse...@googlegroups.com
Thank you Eleni. And yes, it is the language of the data - the resource. And data could have multiple languages, yes. Data based on comparing languages, for example. So it is actually language as a topic of the data.

Leif



kl. 18:14:04 UTC+2 tirsdag 8. april 2014 skrev Eleni Castro følgende:
Thanks for your question, Leif. 

Before my team starts looking into supporting this -- just to be clear: are you looking for a language field to keep track of the primary language of the data (resource) rather than the language that the metadata is written in? 
Do you expect that a data set could have multiple languages? Or just one primary language would be assigned per data set?
Any information you might be able to offer would be helpful for me to see how closely this aligns with what Dublin Core and DataCite schemas use for their Language element for indicating the language of the resource.

Best regards,
Eleni

-- 

Eleni Castro

Research Coordinator, Data Acquisition and Archiving, Data Science

IQSS, Harvard University

http://www.iq.harvard.edu/people/eleni-castro 

On Tue, Apr 8, 2014 at 10:35 AM, Leif Longva <leif....@uit.no> wrote:
I would like to see a language field in Dataverse, to record the language of the data. Our Dataverse-to-be-released will be used for linguistic data, and the language(s) as topic of the data is rather important. Is there a language field coming in 4.0, perhaps?

--
You received this message because you are subscribed to the Google Groups "Dataverse Users Community" group.
To unsubscribe from this group and stop receiving emails from it, send an email to dataverse-community+unsub...@googlegroups.com.

Eleni Castro

unread,
Apr 9, 2014, 9:40:14 AM4/9/14
to dataverse...@googlegroups.com
Thanks for clarifying. I have created a ticket in Redmine https://redmine.hmdc.harvard.edu/issues/3836 for our team to review if this can be scheduled for 4.0. In the interim, since you mention that language is the topic of the data would it be suitable to list the languages under "Topic Classification"?

Best regards,
Eleni



To unsubscribe from this group and stop receiving emails from it, send an email to dataverse-commu...@googlegroups.com.

To post to this group, send email to dataverse...@googlegroups.com.

Leif Longva

unread,
Apr 10, 2014, 6:31:40 AM4/10/14
to dataverse...@googlegroups.com
Using the Topic Classification is is likely how it will be done at the point of launching our Dataverse. The problem is that we would like to apply controlled vocabularies, and different sets of controlled vocabularies in this field by repeating the field. But repeating the field in the template does not give me the option of selecting a different set of controlled vocabulary. The ISO list of language codes could be one set of controlled vocabulary, while a predefined list of topic terms could be another. So I guesss for our need, the possibility to use different controlled vocabularies in repeated Topic Fields would be perfect.
 
Leif
 

kl. 15:40:14 UTC+2 onsdag 9. april 2014 skrev Eleni Castro følgende:
Thanks for clarifying. I have created a ticket in Redmine https://redmine.hmdc.harvard.edu/issues/3836 for our team to review if this can be scheduled for 4.0. In the interim, since you mention that language is the topic of the data would it be suitable to list the languages under "Topic Classification"?

Best regards,
Eleni

On Wed, Apr 9, 2014 at 3:41 AM, Leif Longva <leif....@uit.no> wrote:
Thank you Eleni. And yes, it is the language of the data - the resource. And data could have multiple languages, yes. Data based on comparing languages, for example. So it is actually language as a topic of the data.

Leif


kl. 18:14:04 UTC+2 tirsdag 8. april 2014 skrev Eleni Castro følgende:
Thanks for your question, Leif. 

Before my team starts looking into supporting this -- just to be clear: are you looking for a language field to keep track of the primary language of the data (resource) rather than the language that the metadata is written in? 
Do you expect that a data set could have multiple languages? Or just one primary language would be assigned per data set?
Any information you might be able to offer would be helpful for me to see how closely this aligns with what Dublin Core and DataCite schemas use for their Language element for indicating the language of the resource.

Best regards,
Eleni

-- 

Eleni Castro

Research Coordinator, Data Acquisition and Archiving, Data Science

IQSS, Harvard University

http://www.iq.harvard.edu/people/eleni-castro 

On Tue, Apr 8, 2014 at 10:35 AM, Leif Longva <leif....@uit.no> wrote:
I would like to see a language field in Dataverse, to record the language of the data. Our Dataverse-to-be-released will be used for linguistic data, and the language(s) as topic of the data is rather important. Is there a language field coming in 4.0, perhaps?

--
You received this message because you are subscribed to the Google Groups "Dataverse Users Community" group.
To unsubscribe from this group and stop receiving emails from it, send an email to dataverse-community+unsubscribe...@googlegroups.com.

--
You received this message because you are subscribed to the Google Groups "Dataverse Users Community" group.
To unsubscribe from this group and stop receiving emails from it, send an email to dataverse-community+unsub...@googlegroups.com.
To post to this group, send email to dataverse...@googlegroups.com.
Reply all
Reply to author
Forward
0 new messages