Hi all,
an article about using Annif for DDC classification has been recently
published in the Journal of Documentation:
Golub, K., Suominen, O., Mohammed, A. T., Aagaard, H., & Osterman, O.
(2024). Automated Dewey Decimal Classification of Swedish library
metadata using Annif software. Journal of Documentation.
https://www.emerald.com/insight/content/doi/10.1108/JD-01-2022-0026/full/html
This is an open access article so available to everyone! Yay!
It took a long time to publish this one. The original manuscript was
submitted two years ago. The Annif version used (0.53) is quite dated,
though the algorithms used in the experiment have largely remained the
same so the results should still be relevant.
The study looked separately at DDC classification using only the top
level with three digits, and using a larger set of tens of thousands of
classes. However, as DDC class numbers can be built and combined in
numerous ways, they are really hard to classify using the kind of
algorithms used in Annif.
-Osma
--
Osma Suominen
D.Sc. (Tech), Information Systems Specialist
National Library of Finland
P.O. Box 15 (Unioninkatu 36)
00014 HELSINGIN YLIOPISTO
Tel.
+358 50 3199529
osma.s...@helsinki.fi
http://www.nationallibrary.fi