Article about Annif and DDC classification

18 views
Skip to first unread message

Osma Suominen

unread,
Jun 7, 2024, 10:46:09 AMJun 7
to Annif Users
Hi all,

an article about using Annif for DDC classification has been recently
published in the Journal of Documentation:

Golub, K., Suominen, O., Mohammed, A. T., Aagaard, H., & Osterman, O.
(2024). Automated Dewey Decimal Classification of Swedish library
metadata using Annif software. Journal of Documentation.

https://www.emerald.com/insight/content/doi/10.1108/JD-01-2022-0026/full/html

This is an open access article so available to everyone! Yay!

It took a long time to publish this one. The original manuscript was
submitted two years ago. The Annif version used (0.53) is quite dated,
though the algorithms used in the experiment have largely remained the
same so the results should still be relevant.

The study looked separately at DDC classification using only the top
level with three digits, and using a larger set of tens of thousands of
classes. However, as DDC class numbers can be built and combined in
numerous ways, they are really hard to classify using the kind of
algorithms used in Annif.

-Osma

--
Osma Suominen
D.Sc. (Tech), Information Systems Specialist
National Library of Finland
P.O. Box 15 (Unioninkatu 36)
00014 HELSINGIN YLIOPISTO
Tel. +358 50 3199529
osma.s...@helsinki.fi
http://www.nationallibrary.fi

Péter Király

unread,
Jun 11, 2024, 5:55:17 PMJun 11
to Osma Suominen, Annif Users
Hi Osma,

nice paper, thanks for sharing!

I have some questions:
- do you know if there is a open source parser for DDC numbers? On
page 3 you give an example: 929.209485, that is a composition of 3
numbers: 929.2 (main DDC), 09 (auxiliary Table 1) and 485 (auxiliary
Table 2). A parser would split 929.209485 into its components, such as
["main": "929.2", "auxiliary1": "09", "auxiliary1": "485"}. Do you
plan to continue research in a way that includes these notations from
auxiiary tables?
- you mentioned that the Swedish version of DDC comes from Pansoft. Do
you happen to know if it is accessible for other for research
purposes? Do they maintain only the Swedish version, or other language
versions as well?

Thanks,
Péter
> --
> You received this message because you are subscribed to the Google Groups "Annif Users" group.
> To unsubscribe from this group and stop receiving emails from it, send an email to annif-users...@googlegroups.com.
> To view this discussion on the web visit https://groups.google.com/d/msgid/annif-users/1dd655ad-8363-49f6-a3af-fb914a8a677c%40helsinki.fi.



--
Péter Király
software developer
GWDG, Göttingen - Europeana - eXtensible Catalog - The Code4Lib Journal
http://linkedin.com/in/peterkiraly
Reply all
Reply to author
Forward
0 new messages