Title of the Paper-

Freedom Versus Standard in Article Keyword Generation: An Empirical Study

Published in Journal of Library Metadata 2024, VOL. 24, NO. 4, p. 291–305

Abstract-

This study compares the overlap among author-furnished keywords, Library of Congress Subject Headings and a consolidated controlled vocabulary for articles published in Annals of Library & Information Studies, published by NISCAIR from 2018 to 2023, India. The study focuses on the effectiveness of controlled vocabulary for keyword generation. The work is intended to contribute to the literature on author keywords versus keyword standardization in measuring information retrieval efficacy in real-time. The aggregated controlled vocabulary constructed for this study provided a positive result, showing the usefulness and requirements of such a robust controlled vocabulary. The study reflects the extended views of thesauri for a standard author keyword suggester or recommender system.

Title of the Paper-

From chaos to clarity: how controlled vocabularies shape Indian social science repositories

Published in The Electronic Library, 2025, Vol. 43 No. 2,  pp. 154-173

Abstract-

Purpose – The Confederation of Open Access Repositories (COAR) prescribes three types of controlled vocabularies for open access repositories (OARs): access rights, resource type and version type. Interestingly, COAR does not suggest a subject-specific vocabulary for organising content, whereas the subject parameter is one of the most preferred search categories used by information seekers. The purpose of the study is to investigate the use of controlled vocabularies in subject arrangement in OARs.

Design/methodology/approach – The study comprises eight stages. The first is to identify the total number of repositories enlisted in OpenDOAR under the social science domain in India. Next, the samples are selected, followed by data harvesting using MarcEdit OAI-PMH harvester plug-ins. The next step is to process the data and then consolidated controlled vocabulary is constructed by merging six existing thesauri. Then, a similarity matching algorithm is used to determine the usage of controlled vocabularies for subject arrangement. The last step is to evaluate the efficacy of controlled vocabulary in information retrieval.

Findings – The results revealed that subject arrangement differs largely in each repository. The study also showed that the use of controlled vocabularies in OARs for subject arrangement still needs to be standardised to enhance interoperability.

Originality/value – This research reflected that those controlled terms address issues, such as ambiguity, inconsistency and synonym variation, typically found with uncontrolled keywords through standardising subject metadata. This standardisation gives users a more reliable and user-friendly search experience, ultimately improving the discoverability and usability of open access content.

Title of the Paper-

Automated Subject Identification using the Universal Decimal Classification: The ANN Approach

Published in Journal of Information and Knowledge, Vol 60(2), April 2023, p.69-76

Abstract-

Universal Decimal Classification (UDC) is a popular controlled vocabulary that is used to represent subjects of documents. Text categorization determines a text’s category, as evident from the notation-text label format of the Universal Decimal Classification. With the help of machine learning techniques and the Universal Decimal Classification (UDC), the present work aims to develop an end-user (library professional) based recommender system for automatically classifying documents using the UDC scheme. The proposed work is conceived for determining and constructing a complex class number using the syntax of Universal Decimal Classification (UDC). A corpus of documents classified with the UDC scheme is used as a training dataset. The classification of the documents is done with human mediation having proficiency in classificatory approaches. The BERT model and the KNIME software are used for the study. This study uses the classified dataset to fine-tune the pre-trained BERT model to construct the semi-automatic classification model. The results show that the model is constructed with high accuracy and Area Under Curve (AUC) value, although the prediction represented a low accuracy rate. This study reflected that if the model is explicitly trained by annotating each concept and if the full licensed version of UDC class numbers becomes available, there is a greater potency of developing an automated, freely faceted classification scheme for practical use.

Title of the Paper-

Web-based Prosodic Perspectives of Open Access Library and Information Science (LIS) Repositories in South Asia and East Asia

Published in International Journal of Information Science and Management, 2022, Vol. 20, No. 3, p. 277-300

Abstract-

The primary objective of this study is to assess the quality and performance of South and East Asian Open Access Library and Information Science Repositories. The study is segmented into four parts; the first is devoted to quality assessment, and the second is the repositories’ performance using the web analysis tool Nibbler and Alexa. In the third segment, the Revised Web Impact Factors (RWIF) were calculated, and the final part represents the Ranking of the repositories in terms of visibility, transparency, and excellence. The results indicate that the Chinese Institutional Repository of the Chinese Academy of Geographic Sciences and Natural Resources Research, CAS, and the Peking University Institutional Repository ranked the first and the second, respectively, with Japanese repositories ranking the last. “Taiwan’s Chaoyang University of Technology Institutional Repository” has the most comprehensive collection of resources with varying levels of availability of resources in terms of quality assurance indicators. The authors of this paper are in the dire belief that this study may aid administrators in determining the repositories’ strengths and weaknesses to enhance their quality and performance.

Title of the Paper-

Scientometric Introspect of Digital Citizenship in Scopus Database From 1999 to 2022

Published in International Journal of Information Science and Management, 2024, Vol. 22, No. 2,p. 123-137

Abstract-

 Literary warrants on ‘Digital Citizenship’ published since its inception are still countable. The search executed to identify the bibliometric literature on ‘Digital Citizenship’ retrieved a meager outcome. Therefore, the study was pivoted with the data retrieved from the SCOPUS database. By utilizing normalized data and citation analysis to evaluate the influence across the various groupings, it is possible to see an almost linear rise in 2021 on the topic. Statistical and visual modeling software tools (R-Biblioshiny, Bibliometrix) were used in the study. It examines what this means for how bibliometric methodologies are used and disseminated in various situations. The study by the authors addressed bibliometric analysis and categorization of articles as per Ribble and Choi. Moreover, the paper discussed three field Plots (Sources-Keywords-Authors) and the most relevant authors. Furthermore, the study tried to testify to the fitness of Bradford’s Law, trending topics on the subject, co-occurrence network by keywords, topic growth over the year, and a thematic map of topics from 1999 to 2022. The results were quite impressive. out of a total of 377 articles on digital citizenship, a scattering of subjects with numerous overlapped concepts like digital literacy, digital competence, higher education, technology, digital divide, cyber-bullying, information literacy, digital citizenship education, privacy, secondary education, adolescents, digital education, empowerment, primary education, university students have been identified.

Scroll to Top