Hello Sven!
(re-posting to annif-users, as I accidentally replied only to Sven)
Great to hear that you've found Annif useful and achieved good results!
The problem you have is quite common - algorithms sometimes consistently
suggest inappropriate subjects. It was reported a while ago in this
issue:
https://github.com/NatLibFi/Annif/issues/735
This has since been addressed in a PR, which added support for a
configuration feature where specific subjects can be excluded:
https://github.com/NatLibFi/Annif/pull/840
This is not yet included in any Annif release, but the code has been
merged to the main branch on GitHub. It will be released as part of 1.4.
That feature requires listing the excluded concepts individually by URI
in the Annif configuration file. That can be cumbersome if there are
many such concepts, so we are currently working on support for exclusion
rules to make it possible to exclude many concepts in one go based on
different criteria:
https://github.com/NatLibFi/Annif/issues/844
There is already a draft PR with the initial changes required for this
(
https://github.com/NatLibFi/Annif/pull/846) but the work has stalled
for a while. I can't promise when it will be ready.
Regarding use of Annif as a Python library - you are of course free to
do so if you find it useful, but it's not really a use case that we are
planning for or supporting. For instance Python class and method APIs
within Annif quite often change even in minor releases, which could
cause breakage. There are no callbacks of the kind you mention, but I
think that the exclude functionality described ago could be used to
accomplish the same end result.
Best,
Osma
> --
> You received this message because you are subscribed to the Google
> Groups "Annif Users" group.
> To unsubscribe from this group and stop receiving emails from it, send
> an email to
annif-users...@googlegroups.com <mailto:
annif-
>
users+un...@googlegroups.com>.
> To view this discussion visit
https://groups.google.com/d/msgid/annif-
> users/db2e60fe-bc43-4bd1-86d1-c144557ec5bdn%
40googlegroups.com <https://
>
groups.google.com/d/msgid/annif-users/db2e60fe-bc43-4bd1-86d1-
> c144557ec5bdn%
40googlegroups.com?utm_medium=email&utm_source=footer>.
--
Osma Suominen
D.Sc. (Tech), Information Systems Specialist
National Library of Finland
P.O. Box 15 (Unioninkatu 36)
00014 HELSINGIN YLIOPISTO
Tel.
+358 50 3199529
osma.s...@helsinki.fi
http://www.nationallibrary.fi