Automatic classification of digital objects for improved metadata quality of electronic theses and dissertations in institutional repositories.

dc.contributor.authorPhiri, Lighton
dc.date.accessioned2021-03-09T13:59:03Z
dc.date.available2021-03-09T13:59:03Z
dc.date.issued2020
dc.description.abstractHigher education institutions typically employ Institutional Repositories (IRs) in order to curate and make available Electronic Theses and Dissertations (ETDs). While most of these IRs are implemented with self-archiving functionalities, self-archiving practices are still a challenge. This arguably leads to inconsistencies in the tagging of digital objects with descriptive metadata, potentially compromising searching and browsing of scholarly research output in IRs. This paper proposes an approach to automatically classify ETDs in IRs, using supervised machine learning techniques, by extracting features from the minimum possible input expected from document authors: the ETD manuscript. The experiment results demonstrate the feasibility of automatically classifying IR ETDs and, additionally, ensuring that repository digital objects are appropriately structured. Automatic classification of repository objects has the obvious benefit of improving the searching and browsing of content in IRs and further presents opportunities for the implementation of third-party tools and extensions that could potentially result in effective self-archiving strategies.en
dc.identifier.citationPhiri, L. (2020). Automatic Classification of Digital Objects for Improved Metadata Quality of Electronic Theses and Dissertations in Institutional Repositories. International Journal of Metadata, Semantics and Ontologies, 14(3), 234–248. https://doi.org/10.1504/IJMSO.2020.112804en
dc.identifier.urihttps://dx.doi.org/10.1504/IJMSO.2020.112804
dc.identifier.urihttp://dspace.unza.zm/handle/123456789/6969
dc.language.isoenen
dc.publisherInternational Journal of Metadata, Semantics and Ontologies (IJMSO)en
dc.subjectDigital Librariesen
dc.subjectDublin Coreen
dc.subjectOAI-PMHen
dc.subjectDocument Classificationen
dc.subjectAutomatic Classificationen
dc.subjectDigital Objectsen
dc.subjectMetadata Qualityen
dc.subjectElectronic Theses and Dissertationsen
dc.subjectInstitutional Repositoriesen
dc.subjectSelf-Archivingen
dc.titleAutomatic classification of digital objects for improved metadata quality of electronic theses and dissertations in institutional repositories.en
dc.typeArticleen
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
papers-phiri20-ijmso-ir_reclassification.pdf
Size:
584.52 KB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.72 KB
Format:
Item-specific license agreed upon to submission
Description: