inconsistent language labels

3 views
Skip to first unread message

andre castro

unread,
Jan 14, 2026, 11:33:36 AM (22 hours ago) Jan 14
to Getty Vocabularies as Linked Open Data
Hi all,
my first issue here.

I have been looking closely at the AAT gvp:Concept instances and noted there are some consistences concerning  the (English) preferred labels.

There are 2 main property which link to label  xl:prefLabel and  gvp:prefLabelGVP. I was trying to see if there is more consistence in one or the other property values. But I encountered issues with both of them.

Examples:

     gvp:prefLabelGVP aat_term:1000799424-en
     xl:prefLabel  aat_term:1000799425-fr

Should xl:prefLabel include aat_term:1000799424-en as a value?


https://vocab.getty.edu/aat/300018434  Southern Song
 xl:prefLabel aat_term:1000018434-zh, aat_term:1000421665-es, aat_term:1000473914-nl, aat_term:1000644552-zh-Hant, aat_term:1000890492-zh-Latn-pinyin-x-notone, aat_term:1000890493-zh-Latn-pinyin-x-hanyu, aat_term:1000890494-zh-Latn-wadegile
 gvp:prefLabelGVP aat_term:1000018434-zh

No English language label no in either  xl:prefLabel or gvp:prefLabelGVP
And aat_term:1000018434-zh states: xl:literalForm Southern Song@zh 
Which is not Chinese but English


 xl:prefLabel aat_term:1000669433-la, aat_term:1000669433-nl
gvp:prefLabelGVP  aat_term:1000669433-nl

Why is this scientific name with Dutch "(soort)" and no English.
Shouldn't there just be 1 concept for Quercus ilex with language labels in Latin, English, Dutch, etc?


I discovered these by inspecting the results of the following SPARQL query:

SELECT *
WHERE
{
   ?concept a gvp:Concept .
   OPTIONAL {?concept gvp:prefLabelGVP ?gpvlabel } .
   FILTER ( !regex(STR(?gpvlabel), "-en.*$") )
}
ORDER BY ?concept
LIMIT 100

Let me know if you have any thoughts on this. If there are plans to address these labels inconsistencies.
Best
a

Vladimir Alexiev

unread,
Jan 14, 2026, 2:40:14 PM (19 hours ago) Jan 14
to andre castro, Dunn, Heather (PCH), Gregg Garcia, Getty Vocabularies as Linked Open Data
Hi Andre! I checked the first concept and i think you are right: "prize medal" should be a prefLabel.

You can see that at


Which was the source for this concept. 
I also see 2 altLabel that I don't see in the AAT data.

So it seems to me that some fixes are needed in the CHIN->AAT contribution (ingest).

Which of the other concepts  you investigated come from CHIN?

--
You received this message because you are subscribed to the Google Groups "Getty Vocabularies as Linked Open Data" group.
To unsubscribe from this group and stop receiving emails from it, send an email to gettyvocablo...@googlegroups.com.
To view this discussion visit https://groups.google.com/d/msgid/gettyvocablod/8f6577fe-54b0-4a75-a189-d18296748357n%40googlegroups.com.

Getty Vocabularies LOD

unread,
Jan 14, 2026, 2:48:19 PM (19 hours ago) Jan 14
to Getty Vocabularies as Linked Open Data
Hi, Andre (and Vladimir!).

I have forwarded these records to the editorial staff along with your query. The issues you highlighted will be fixed in the next Vocabularies publish at the beginning of February. Thanks for the thorough review of the data.

Gregg Garcia
Software Architect, Getty Digital, J. Paul Getty Trust

andre castro

unread,
4:08 AM (6 hours ago) 4:08 AM
to Getty Vocabularies as Linked Open Data
Hi Vlado & Gregg,
Thank for your replies and commitment to fix these issues.

@Vladimir: I was not aware that "prize medal"came from CHIN and I did not looks specifically at CHIN concept. I was looking for concepts that fell out of the pattern
  ?concept gvp:prefLabelGVP aat:XXXXX-en (or ?concept gvp:prefLabelGVP aat:XXXXX-en-US) since that property states that "Term preferred by the Getty Vocabulary Program. The language is usually English."

Out of curiosity, I ran the following query for , which returned only https://vocab.getty.edu/aat/300427801 as gvp:prefLabelGVP value aat_term:1000880546-fr . In the original, it seems correct https://page.nomenclature.info/parcourir-browse.app?id=4189&lang=en&ws=INT&wo=N
It seems that someone already fixed the "prize medals" label :) thanks.

SELECT *
WHERE
{
?concept a gvp:Concept .
?concept dcterms:contributor <http://vocab.getty.edu/aat/contrib/10000159> .
?concept gvp:prefLabelGVP ?gpvlabel .
FILTER ( !regex(STR(?gpvlabel), "-en.*$") )
}
ORDER BY ?concept
LIMIT 100



I am happy to report I encounter some other issues with the language labels.

In any case, if I want to capture AAT concepts and their @en/@en-US language labels, is it safe to query the gvp:prefLabelGVP? By safe, I mean, @en language label nodes, will be the default, correct? 
Because, before diving deeper into this and noticing the gvp:prefLabelGVP property, I was querying the xl:prefLabel values, and its language nodes values for the @en one, so I could extract the @en label. Using the gvp:prefLabelGVP property property seems more straight forward.

Thank you for your replies and feedback.
Best
Andre

andre castro

unread,
4:46 AM (5 hours ago) 4:46 AM
to Getty Vocabularies as Linked Open Data
@Gregg,
The following query, will shows that are only a few  issues with gvp:prefLabelGVP values in aat:Concept instances. 
Including to the ones I already mentioned, there is:

All the others, seem correct that the gvp:prefLabelGVP nodes are not @en/@en-US.

SELECT *
WHERE
{
?concept a gvp:Concept .
?concept gvp:prefLabelGVP ?gpvlabel .
FILTER ( !regex(STR(?gpvlabel), "-en.*$") )
?gpvlabel skosxl:literalForm ?literal .
}
ORDER BY ?concept
Reply all
Reply to author
Forward
0 new messages