Problem: lsclex on VIRTUAL corpus

41 views
Skip to first unread message

H Pirker

unread,
Jan 29, 2024, 10:33:05 AM1/29/24
to NoSketch Engine
Dear NoSketchengineers, 

I am using 
"api_version": "open-5.66.3", "manatee_version": "2.36.7-open-2.223.6" /CentOs 7 and I am stumbling over a problem with  lsclex on VIRTUAL corpora. 

For ALL token-attributes in the corpus 
lsclex -f reports a frequency of "-2" ! 

I.e. NONvirtual version: 
lsclex -f amc4_demo posTT

0 ADJA 31667
1 NN 150470
2 $( 47527
3 CARD 48998
...

vs. VIRTUAL version:
lsclex -f amc4_demovirt posTT 
0 ADJA -2
1 NN -2
2 $( -2
3 CARD -2
4 $. -2

There's no problem for STRUCTURE-attributes. 
And fun fact: I experimented with several virtual corpora: the result  "frequency" (-2) seems to reflect the number of corpora which make up the VIRTUAL corpus. 
I.e. iff  the VIRTUAL  corpus is composed of 1,2, or 3 corpora, the results from lsclex will be -1, -2 and -3 respectively 

Any hints on how to tackle this? 

The only thing that catches my eye in the compile-log : 
For each token attribute there is a message that arf is not compiled: 

Compiling arf for attribute posx
frq already compiled, skipping.

confused

Hannes
 

H Pirker

unread,
Jan 30, 2024, 1:00:40 PM1/30/24
to NoSketch Engine
additional observation:  for all newly compiled VIRTUAL corpora: 

apart from the problem with lsclex on the command line: 
in the NoSke-GUI the functions wordlist and poswordlist do not work (they return zero hits). 
I guess these are just  2 consequences of the same underlying problem? 

cheers 
Hannes

Miloš Jakubíček

unread,
Feb 2, 2024, 8:34:48 AM2/2/24
to H Pirker, NoSketch Engine
Dear Hannes,

I've just tried to recompile one of our virtual corpora and all seems to be working fine, can you please send us the compilation log? You can send it to sup...@sketchengine.eu if you prefer not disclosing it publicly.

Best
Milos


Milos Jakubicek

CEO, Lexical Computing
Brno, CZ | Brighton, UK


--
You received this message because you are subscribed to the Google Groups "NoSketch Engine" group.
To unsubscribe from this group and stop receiving emails from it, send an email to noske+un...@sketchengine.co.uk.
To view this discussion on the web visit https://groups.google.com/a/sketchengine.co.uk/d/msgid/noske/3cf2f0c8-8b2d-4ea3-9071-aaa73a5fd2abn%40sketchengine.co.uk.

H Pirker

unread,
Feb 23, 2024, 6:35:55 AM2/23/24
to NoSketch Engine, NoSketch Engine
Just for the records: the problem is solved with the new release of the NoSke  (manatee ​2.225.8 ​/ bonito 5.71.9 / gdex 4.13.2 / crystal 2.165.2)


cheers
Hannes
Reply all
Reply to author
Forward
0 new messages