Dear all,
We would like to perform traffic analysis of our
CLARIN.SI noSketch
concordancers, i.e.
https://www.clarin.si/ske/ and its variant with user
logins
https://www.clarin.si/skelog/. What we would try to find out is how much and in what way specific corpora are being used so that we can better
determine which corpora and features are worth developing further.
We see two main options how to do this (might be missing some):
- cache analysis - previous searches could give some (limited) insights,
but the cache was not really meant for this, i.e. the format seems
difficult to analyse
- web traffic analysis by analysing apache web logs - there might
already be tools / solutions out there
Did anybody else try something similar in the context of noSketch
Engine, so we wouldn't start this from scratch?
Thanks,
Nikola and Tomaž