You do not have permission to delete messages in this group
Copy link
Report message
Show original message
Either email addresses are anonymous for this group or you need the view member email addresses permission to view the original message
to Annif Users
Hi, I'm using the newest Annif version 1.4 and python 3.12 on venv and I'm trying to load vocabulary using cmd: `venv/bin/annif load-vocab vocab-pl --projects projects.cfg --language pl --verbosity DEBUG authorities_lem.ttl`, where my .ttl file has about 500 MB. I'm logging the output from Annif, having the following logs:
[2026-05-12 12:45:32,796.796] INFO [6160592896] vocab_load - Module: annif_actions Function: load_vocabulary -> Started loading vocabulary. vocab_id=vocab-pl, language=pl, path=input/vocabulary/processed_ttl/authorities_lem.ttl, size=496.02 MB. [2026-05-12 12:45:32,976.976] INFO [6160592896] vocab_load - Module: execute_handler Function: follow_execute_logs -> [annif-debug] debug: Reading configuration file projects.cfg in CFG format
[2026-05-12 12:49:19,791.791] INFO [6160592896] vocab_load - Module: execute_handler Function: follow_execute_logs -> [annif-debug] updating existing subject index
[2026-05-12 12:49:19,799.799] INFO [6160592896] vocab_load - Module: execute_handler Function: follow_execute_logs -> [annif-out] Loading vocabulary from SKOS file input/vocabulary/processed_ttl/authorities_lem.ttl ...
[2026-05-12 12:49:19,800.800] INFO [6160592896] vocab_load - Module: execute_handler Function: follow_execute_logs -> [annif-debug] debug: loading subjects from data/vocabs/vocab-pl/subjects.csv
[2026-05-12 12:50:23,731.731] INFO [6160592896] vocab_load - Module: execute_handler Function: follow_execute_logs -> [annif-debug] debug: saving <annif.vocab.subject_index.SubjectIndexFile object at 0x104cd9160> to temporary file data/vocabs/vocab-pl/tmp-subjects08k4vzya.csv
[2026-05-12 12:50:27,632.632] INFO [6160592896] vocab_load - Module: execute_handler Function: follow_execute_logs -> [annif-debug] debug: renaming temporary file data/vocabs/vocab-pl/tmp-subjects08k4vzya.csv to data/vocabs/vocab-pl/subjects.csv
[2026-05-12 12:50:29,382.382] INFO [6160592896] vocab_load - Module: execute_handler Function: follow_execute_logs -> [annif-debug] saving vocabulary into SKOS file data/vocabs/vocab-pl/subjects.ttl
It looks like the process is suspending on creating dump file and the process is never finished, it's still active in the background even hours later. The output subjects.ttl file has actually the same size as the input file, so it;s rather correctly created but the whole `load-vocab` cmd is stuck at the dump file creation.
Can you help me figure out this issue?
Osma Suominen
unread,
May 25, 2026, 6:19:41 AM (3 days ago) May 25
Reply to author
Sign in to reply to author
Forward
Sign in to forward
Delete
You do not have permission to delete messages in this group
Copy link
Report message
Show original message
Either email addresses are anonymous for this group or you need the view member email addresses permission to view the original message
to annif...@googlegroups.com
Hi Ewelina,
this is a bit surprising, I don't recall seeing this problem. Is this
something that happens every time you run the load-vocab command?
When you say that the process is active hours later, is it still doing
some work? What is the CPU usage? Is it possible that you have run out
of disk space?
Can you try if the same happens with a smaller vocabulary file? 500 MB
is quite large.
Best,
Osma
On 12/05/2026 16:29, Ewelina C wrote:
> Hi, I'm using the newest Annif version 1.4 and python 3.12 on venv and
> I'm trying to load vocabulary using cmd: /`venv/bin/annif load-vocab