how to load agrovoc in vocbench

53 views
Skip to first unread message

Gaurav Patel

unread,
Sep 20, 2021, 1:01:22 AM9/20/21
to vocbench-user
Hell sir,
I am going to load(agrovoc_2021-06-01_core.rdf ) using preload from local but its given error 500.
i had also try preload from dataset Catalog using LOC Cloud Connector serarch (AGROVOC) bt given Server Error (status: 500)
i am using semanticturkey-8.0.1 version. 

Regads
Gaurav
agrovoc.png

Armando Stellato

unread,
Sep 20, 2021, 5:27:36 AM9/20/21
to Gaurav Patel, vocbench-user

Dear Gaurav,

 

while we will look into the why you got a generic 500 (did you use ST’s API or you did it through VB’s interface? In case, you didn’t get any alert window to the user telling explaining why you couldn’t load your dataset?) I can confirm that getting an error is normal.

 

The “preload” is meant only for facilitating the insertion of project metadata (baseuri, semantic/lexical model etc..), and it loads the data in memory, in a temporary repository. So we put a cap on the size of the dataset (I guess the default is 10Mb). It’s very useful for ontologies and for relatively small datasets, but can’t be used for large ones.

 

So, it suffices to first manually create the project, and then load the data (use the “load data” functionality)

 

For Agrovoc, you can use the following:

 

baseuri: http://aims.fao.org/aos/agrovoc/

namespace: http://aims.fao.org/aos/agrovoc/

model: SKOS

lexical model: SKOS-XL

 

Kind Regards,

 

Armando

 

P.S: about the choice of the file. It will work with any (so ok for the rdf) but, just as word of advice, NT is ingested more quickly.

 

 

--
You received this message because you are subscribed to the Google Groups "vocbench-user" group.
To unsubscribe from this group and stop receiving emails from it, send an email to
vocbench-use...@googlegroups.com.
To view this discussion on the web visit
https://groups.google.com/d/msgid/vocbench-user/7192d8a6-2f3b-45ec-9df9-b87556914ee3n%40googlegroups.com.

Gaurav Patel

unread,
Sep 21, 2021, 9:32:23 AM9/21/21
to vocbench-user
Dear Armando,

it given  errors
Exception in thread "fileinstall-/home/dspace/semanticturkey-9.1/deploy"
Exception: java.lang.OutOfMemoryError thrown from the UncaughtExceptionHandler in thread "fileinstall-/home/dspace/semanticturkey-9.1/deploy"
Exception in thread "fileinstall-/home/dspace/semanticturkey-9.1/etc" java.lang.OutOfMemoryError: GC overhead limit exceeded
Exception in thread "Karaf Lock Monitor Thread" java.lang.OutOfMemoryError: GC overhead limit exceeded
SLF4J: Failed toString() invocation on an object of type [org.eclipse.jetty.util.thread.QueuedThreadPool]
Reported exception:
java.lang.OutOfMemoryError: GC overhead limit exceeded
Exception in thread "qtp1106938546-209" java.lang.OutOfMemoryError: GC overhead limit exceeded
Exception in thread "RMI RenewClean-[172.18.0.4:44444]" java.lang.OutOfMemoryError: GC overhead limit exceeded


i had also try agrovoc_2021-09-01_core.nt file but its not loaded...
Regards
Gaurav

Armando Stellato

unread,
Sep 21, 2021, 4:00:33 PM9/21/21
to Gaurav Patel, vocbench-user

Dear Gaurav,

 

it’s what I expected, it simply ran out of memory. Possibly, someone configured it for you? In that case, maybe they removed the cap on the size (or put it very high) so that you could load it into memory with the preload, but it’s not a good move to do that.

Like I said:

  1. don’t use preload
  2. manually insert the values I gave you for the project metadata
  3. after the project has been created, load the data with the load data functionality

 

You can check the test drive here for loading a large thesaurus:

 

http://vocbench.uniroma2.it/doc/user/test_drive.jsf#creating_a_skos_project_for_managing_a_large_thesaurus_by_connecting_to_an_external_triple_store__exploiting_history__validation_and_inference_

 

You need GraphDB installed first (I’m not sure if you have it already)

 

Kind Regards,

 

Armando

Reply all
Reply to author
Forward
0 new messages