Automatic generation of metadata: case study using the technique of automatic statistical indexing with the ANNIF tool

16 views
Skip to first unread message

Jean Carlos Borges Brito

unread,
Oct 31, 2022, 6:15:14 PM10/31/22
to Annif Users
Dear Osma and Annif users.

Next November 8th,  I will present a case study using Annif. Below is the description:

Abstract: This research presents a case study with the ANNIF tool, performing the automatic generation of metadata through the technique of automatic statistical indexing and machine learning, using a rule-based algorithm to extract metadata values ​​from information resources. The objective of this work is to develop a framework for using the tool. A corpus of knowledge was created with 52 articles from the Brazilian Information Science Base (BRAPCI), using the Brazilian Thesaurus in Information Science (TBCI) as a controlled vocabulary. After the model training process, a preliminary test of automatic statistical indexing was carried out on a Complete Thesis stored in the Institutional Repository of the University of Brasília (RiUnB), generating the recommendation of subjects/descriptors. The terms assigned by the ANNIF were compared with the keywords of the RiUnB thesis, obtaining good similarity. It is concluded that the use of ANNIF, using the technique of automatic statistical indexing, contributed to the automation of the task, achieving satisfactory performance.


Thank you for sharing in this group the knowledge and experiences with Annif.

Reply all
Reply to author
Forward
0 new messages