Named Entity lexica for Slavic Languages

34 views
Skip to first unread message

Jakub Piskorski

unread,
Sep 5, 2016, 11:00:34 AM9/5/16
to SIGSLAV

Hello,

I am looking for dictionaries of inflected variants of named entities (including i.a., most typical forenames, locations, organisations, etc.)
and named-entity components (i.e., NE triggers like month and day names, positions, etc.) for any Slavic languages except Polish.

In other words, something similar to what is available under http://clip.ipipan.waw.pl/Gazetteer for Polish.

Best,

Jakub


Josef Steinberger

unread,
Sep 5, 2016, 11:57:54 AM9/5/16
to sig...@googlegroups.com
Hi Jakub.
I have something for Czech. Basically I extended the JRC-Names by morphological variants of Czech names (+ there are some other names which were not in JRC-Names).

Josef

Dne 05.09.2016 v 17:00 'Jakub Piskorski' via SIGSLAV napsal(a):
--
You received this message because you are subscribed to the Google Groups "SIGSLAV" group.
To unsubscribe from this group and stop receiving emails from it, send an email to sigslav+u...@googlegroups.com.
To post to this group, send email to sig...@googlegroups.com.
Visit this group at https://groups.google.com/group/sigslav.
For more options, visit https://groups.google.com/d/optout.


Katerina Zdravkova

unread,
Sep 5, 2016, 2:22:16 PM9/5/16
to sig...@googlegroups.com

 Hello Jakub,


In the electronic dictionary of Macedonian language, which is available via https://www.clarin.si/repository/xmlui/handle/11356/1042, you can find several toponyms, some numbers and dates. If they are not sufficient. Aleksandar and I will send you more NE.


Best regards,

Katerina


From: 'Jakub Piskorski' via SIGSLAV <sig...@googlegroups.com>
Sent: Monday, September 5, 2016 5:00:34 PM
To: SIGSLAV
Subject: [sigslav] Named Entity lexica for Slavic Languages
 

Христо Крушков

unread,
Sep 6, 2016, 5:04:37 AM9/6/16
to sig...@googlegroups.com
Hi Jakub,

You can find some info about Bulgarian in this paper:
Krushkov, Hr. “Automatic Morphological Processing of Bulgarian Proper Nouns”, Journal TAL (Traitement Automatique des Langues), Vol 41, No. 3, February 2001, pp 709-726.

available here:

https://www.researchgate.net/publication/252451836_AUTOMATIC_MORPHOLOGICAL_PROCESSING_OF_BULGARIAN_PROPER_NOUNS

or here:

https://www.academia.edu/18436763/Automatic_morphological_processing_of_Bulgarian_proper_nouns

If this info satisfies your needs, don't hesitate to contact me.

Best,
Hristo


Dne 05.09.2016 v 17:00 'Jakub Piskorski' via SIGSLAV napsal(a):

Marko Tadić

unread,
Sep 7, 2016, 6:27:31 AM9/7/16
to sig...@googlegroups.com
Hi Jakub,
please take a look at the Lexical Inflectional Database of Croatian First and Last Names, available for download through META-SHARE (http://meta-share.ffzg.hr/repository/browse/lexical-inflectional-database-of-croatian-first-and-last-names/11e503cc3d3f11e38a985ef2e4e6c59eaeb2fa3a711d40e7b740b9be76e2964c/).
Best,
MT

Jakub Piskorski

unread,
Sep 21, 2016, 4:41:38 AM9/21/16
to SIGSLAV

Hi Josef,

are these resources downloadable somewhere? We are trying to improve NER here at JRC, and for Slavic languages any resources like this would be very useful :-)

Jakub


On Monday, September 5, 2016 at 5:57:54 PM UTC+2, jstein wrote:
Hi Jakub.
I have something for Czech. Basically I extended the JRC-Names by morphological variants of Czech names (+ there are some other names which were not in JRC-Names).

Josef

Dne 05.09.2016 v 17:00 'Jakub Piskorski' via SIGSLAV napsal(a):

Hello,

I am looking for dictionaries of inflected variants of named entities (including i.a., most typical forenames, locations, organisations, etc.)
and named-entity components (i.e., NE triggers like month and day names, positions, etc.) for any Slavic languages except Polish.

In other words, something similar to what is available under http://clip.ipipan.waw.pl/Gazetteer for Polish.

Best,

Jakub


--
You received this message because you are subscribed to the Google Groups "SIGSLAV" group.
To unsubscribe from this group and stop receiving emails from it, send an email to sigslav+unsubscribe@googlegroups.com.

Pala

unread,
Sep 22, 2016, 2:55:27 AM9/22/16
to sig...@googlegroups.com
Dear Jakub,
 
    the following info can be interesting for you:

I recommend to get in touch with J. Straková: http://ufal.mff.cuni.cz/jana-strakova. I think she has access to the list of Czech NEs, possibly via

Clarin/Lindat repository located at ÚFAL.

With regards,

                                                     K. Pala



Dne 21.9.2016 v 10:41 'Jakub Piskorski' via SIGSLAV napsal(a):
To unsubscribe from this group and stop receiving emails from it, send an email to sigslav+u...@googlegroups.com.
Reply all
Reply to author
Forward
0 new messages