Clustering of Scientific Publications Based on Field of Expertise Using Latent Dirichlet Allocation and Normalized PSO-K-means

Fina Charisma Hayatina(1) , Sony Hartono Wijaya(2) , Medria Kusuma Dewi Hardhienata(3)
(1) a:1:{s:5:"id_ID";s:49:"Departemen Ilmu Komputer Institut Pertanian Bogor";},
(2) ,
(3)

Abstract

Validating a lecturer's expertise claims often involves scrutinizing their scholarly publications. However, this process can be quite demanding, requiring significant knowledge and time due to the need to assess numerous documents. To address this challenge, this study endeavors to create a model that can categorize documents based on their areas of expertise. The study employs the K-means clustering algorithm to group documents according to the lecturers' fields of expertise. In order to enhance the efficiency of this process, Latent Dirichlet Allocation is utilized to reduce data dimensions. Additionally, Particle Swarm Optimization is used to determine the optimal initial cluster centers for the K-means algorithm. The research yielded promising results, successfully categorizing scholarly publications with a silhouette coefficient of 0.42. Furthermore, by using PSO to identify the optimal cluster centers, the silhouette coefficient was improved by 5.56%. The model's performance was evaluated by comparing the resulting clusters with the provided claims, showing a 75% matching rate and a 25% non-matching rate.

Full text article

Generated from XML file

Authors

Fina Charisma Hayatina
finacharisma@apps.ipb.ac.id (Primary Contact)
Sony Hartono Wijaya
Medria Kusuma Dewi Hardhienata
Clustering of Scientific Publications Based on Field of Expertise Using Latent Dirichlet Allocation and Normalized PSO-K-means. (2023). Jurnal Ilmu Komputer Dan Agri-Informatika, 10(2), 121-132. https://doi.org/10.29244/jika.10.2.121-132

Article Details

How to Cite

Clustering of Scientific Publications Based on Field of Expertise Using Latent Dirichlet Allocation and Normalized PSO-K-means. (2023). Jurnal Ilmu Komputer Dan Agri-Informatika, 10(2), 121-132. https://doi.org/10.29244/jika.10.2.121-132

Most read articles by the same author(s)

<< < 1 2