Improved discoverability of digital objects in institutional repositories using controlled vocabularies.

No Thumbnail Available
Date
2021-09-27
Authors
Chipangila, Bertha
Liswaniso, Eric
Mawila, Andrew
Mwanza, Philomena
Nawila, Daisy
M’sendo, Robert
Nyirenda, Mayumbo
Phiri, Lighton
Journal Title
Journal ISSN
Volume Title
Publisher
IEEE
Abstract
Higher Education Institutions (HEIs) utilise Insti- tutional Repositories (IRs) to electronically store and make available scholarly research output produced by faculty staff and students. With the continued increase of scholarly research output produced, accurate and comprehensive association of subject headings to digital objects, during ingestion into IRs is crucial for effective discoverability of the objects and, additionally facilitating the discovery of related content. This paper outlines a case study conducted at an HEI—The University of Zambia—in order to demonstrate the effectiveness of integrating controlled subject vocabularies during the ingestion of digital objects in to IRs. A situational analysis was conducted to understand how subject headings are associated with digital objects and to analyse subject headings associated with already ingested digital objects. In addition, an exploratory study was conducted to determine domain-specific subject headings to be integrated with the IR. Furthermore, a usability study was conducted in order to comparatively determine the usefulness of using controlled vocabularies during the ingestion of digital objects into IRs. Finally, multi-label classification experiments were carried out where digital objects were assigned with more than one class. The results of the study revealed that the majority of digital objects are currently associated with two or less subject headings (71.2 %), with a significant number of subject headings (92.1 %) being associated with a single publication. The comparative study suggests that IRs integrated with controlled vocabularies are perceived to be more usable (SUS Score = 68.9 ) when compared with IRs without controlled vocabularies (SUS Score = 66.2 ). The effectiveness of the multi-label arXiv subjects classifier demonstrates the viability of integrating automated techniques for subject classification.
Description
Article
Keywords
Controlled Vocabularies , Digital Libraries , Document Classification , Institutional Repositories , Machine Learning
Citation
Chipangila, B., Liswaniso, E., Mawila, A., Mwanza, P., Nawila, D., M’sendo, R., Nyirenda, M. & Phiri, L. (2021). Improved Discoverability of Digital Objects in Institutional Repositories Using Controlled Vocabularies. In 2021 ACM/IEEE Joint Conference on Digital Libraries (JCDL 2021) (pp. 100-109). IEEE. DOI: 10.1109/JCDL52503.2021.00022.