Concordances showing words without spaces when structure is turned off.

21 views
Skip to first unread message

Valdis Saulespurens

unread,
Mar 13, 2025, 4:51:49 AMMar 13
to NoSketch Engine
Is there some setting to adjust in newer versions in NoSketch to show space between texts enclosed by different structures when showing Concordances?

The issue appears one turns off showing of structures (documents, paragraphs, sentences, etc) for Concordances in View Options.

Then the text is shown continuously without spaces at the boundary.

With structures turned on:

<s><p>un idejiski vērtīgākais darbs.</p></s><s><p>Sprīdītis ir kalpa zēns, ko pamāte</p></s>

The issue when structures are turned off:

un idejiski vērtīgākais darbs.Sprīdītis ir kalpa zēns, ko pamāte

We can see here that there is no space between "darbs." and "Sprīdītis"

In the example we have period to help. 
Most examples will not have punctuation and the text flows together and makes for bad user experience.

Often times users do not want to see the structures.

As a workaround I could adjust registry to show space instead of structure tag

STRUCTURE p {
DISPLAYTAG 0
DISPLAYBEGIN " "
}

However, that is far from ideal, as there might be times when researchers/users want to see structures after all.

What else could I try?

Sincerely,
   Valdis Saulespurens
researcher and developer National Library of Latvia

Vlasta Ohlídalová

unread,
Mar 17, 2025, 6:19:26 AMMar 17
to NoSketch Engine, valdis.sa...@gmail.com
Hello, 

This is not the expected behavior; as long as these are separate tokens (i.e. on separate lines in the vertical), there should be a space between them, no matter if there is a structure or not (except, obviously, for the glue structure). 

Are you using the latest NoSKE version? Is it publicly available somewhere, so we could take a look? If not, could you please share the whole registry file with us? 

Best regards,
Vlasta Ohlídalová

Dne čtvrtek 13. března 2025 v 9:51:49 UTC+1 uživatel valdis.sa...@gmail.com napsal:

Valdis Saulespurens

unread,
Mar 18, 2025, 5:21:00 AMMar 18
to NoSketch Engine, Vlasta Ohlídalová, Valdis Saulespurens
Thank You for responding!

   We are currently using the latest version of Dockerized version of No-Sketch from November of 2024.:

I see it is using the following NoSketch component versions:
  • bonito-open-5.71.15.tar.gz
  • crystal-open-2.178.2.tar.gz
  • gdex-4.13.2.tar.gz
  • manatee-open-2.225.8.tar.gz
I am attaching the latest registry file where I "sacrificed" sentence tags to achieve the needed space between tokens.

Ideally, I would like to remove this 
DISPLAYTAG 0
DISPLAYBEGIN " "
from sentence structure.

We noticed this missing space issue appears in all other corpora as well. 

The old depreciated NoSketch version from a few years ago on another server of ours does not have this missing space issue and shows proper space between tokens no matter what structures are selected in View Options.

Sincerely,
   Valdis Saulespurens
researcher and developer National Library of Latvia
 


lat_sen_rom
Reply all
Reply to author
Forward
0 new messages