Clustering dan Visualisasi Data ASN dalam Penunjang Analisis Kecukupan Data di Perangkat Daerah Pemerintah Provinsi Jawa Timur

Zhulfi Bajra Wikjatmiko; Endang Setyati; Yosi Kristian

doi:10.30736/jti.v10i2.1440

Clustering dan Visualisasi Data ASN dalam Penunjang Analisis Kecukupan Data di Perangkat Daerah Pemerintah Provinsi Jawa Timur

Authors

Zhulfi Bajra Wikjatmiko BKD Provinsi Jawa Timur
Endang Setyati Institut Sains dan Teknologi Terpadu Surabaya (ISTTS)
Yosi Kristian Institut Sains dan Teknologi Terpadu Surabaya(ISTTS)

DOI:

https://doi.org/10.30736/jti.v10i2.1440

Keywords:

Clustering dan Visualisasi Data, ASN Pemerintah Provinsi Jawa Timur, Pemutakhiran data, Sistem informasi kepegawaian, Metode Elbow

Abstract

Penelitian ini mengimplementasikan K-Means, Gaussian Mixture Model (GMM), dan Hierarchical Clustering untuk menganalisis kecukupan data 18.962 ASN Pemerintah Provinsi Jawa Timur dari 232 unit kerja di 38 wilayah administratif. Dataset terdiri dari 8 jenis perangkat daerah dengan Satpol PP (7.231 ASN) dan UPT (6.961 ASN) sebagai kontributor terbesar. Variabel clustering mencakup 12 atribut kelengkapan dokumen kepegawaian dalam format biner: foto ½ badan, foto full body, akta lahir, KTP, NPWP, sumpah jabatan PNS, nota BKN, SPMT, kartu ASN virtual, nomor NPWP, nomor BPJS, dan nomor KK. Metodologi penelitian meliputi preprocessing data dengan normalisasi Min-Max, penghapusan 287 duplikat, dan encoding biner. Metode Elbow menghasilkan cluster optimal k=7 untuk K-Means (distortion score 119.496), k=4 untuk GMM (BIC 119.839), dan k=3 untuk Hierarchical Clustering. Evaluasi menggunakan Silhouette Score, Calinski-Harabasz Index, dan Davies-Bouldin Index menunjukkan K-Means memiliki performa terbaik (0.332, 3412.783, 1.224). K-Means mengidentifikasi 35% ASN kategori High (>80%), 45% Medium (70-79%), dan 20% Low (<70%). GMM menghasilkan distribusi 40% High, 55% Medium, 5% Low plus 14 outlier. Hierarchical Clustering menghasilkan 52% High, 47% Medium, 1% Low. Temuan menunjukkan unit kerja Surabaya memiliki kelengkapan tertinggi (54.27%) dibanding kabupaten lain (<5%). PNS memiliki kelengkapan 90% lebih baik dari PPPK/CPNS. Kartu ASN Virtual dan Nomor KK merupakan dokumen dengan kelengkapan terendah (<40%). Visualisasi melalui dashboard interaktif, heatmap, scatter plot PCA, dan dendrogram memfasilitasi identifikasi prioritas pembenahan data. Model ini dapat diadaptasi untuk mendukung transformasi digital birokrasi di instansi pemerintah lainnya.

Downloads

Download data is not yet available.

References

A. Ardiansyah et al., “K-Means Clustering Application of Open Unemployment in 2020 Caused by COVID-19 in West Java Province,” Enthusiastic: Int. J. Appl. Stat. Data Sci., pp. 1–12, 2024.

M. Barata, I. S. Ayuni, A. Y. Kartini, and Z. Alawi, “Algoritma K-Means dalam Clustering Produk Skincare untuk Menentukan Strategi Pemasaran,” J. Inform. Polinema, vol. 10, no. 3, pp. 421–428, 2024.

A. Basalamah and R. Setyadi, “Penerapan algoritma K-Means clustering pada tingkat penyelesaian pendidikan di Provinsi Indonesia,” J. Inform. Teknol. Komput. (J-ICOM), vol. 4, no. 2, pp. 114–121, 2023.

M. A. M. Bisri and R. Kurniawan, “Analisis Pengelompokan Data Aparatur Sipil Negara Kabupaten Cirebon Berdasarkan Perangkat Daerah dan Usia Menggunakan Metode K-Means Clustering,” JATI (J. Mahas. Tek. Inform.), vol. 8, no. 2, pp. 1500–1506, 2024.

R. H. Carver, Cluster Analysis in the Public Sector: Exploratory Data Analysis (EDA), Principal Component Analysis (PCA), and Cluster Analysis. Cary, NC: JMP Statistical Discovery LLC, 2022.

B. Dash et al., “A Hybridized K-means Clustering Approach for High Dimensional Dataset,” Int. J. Eng. Sci. Technol., vol. 2, no. 2, pp. 59–66, 2010.

J. Dingelstad, R. T. Borst, and A. Meijer, “Hybrid Data Competencies for Municipal Civil Servants: An Empirical Analysis of The Required Competencies for Data-Driven Decision-Making,” Public Pers. Manag., vol. 51, no. 4, pp. 458–490, 2022.

G. D. Zion and B. K. Tripathy, “Comparative Analysis of Tools for Big Data Visualization and Challenges,” in Data Visualization: Trends and Challenges Toward Multidisciplinary Perception, Springer, 2020, pp. 33–52.

E. Emigawaty et al., “K-Means Clustering Algorithm for Partitioning the Openness Levels of Open Government Data Portals,” JOIV: Int. J. Inform. Vis., vol. 7, no. 3, pp. 977–983, 2023.

R. A. Faujia and M. Z. Subarkah, “Analisis Cluster K-Means dan Visualisasi Data Spasial Berdasarkan Karakteristik Persebaran Covid-19 dan Pelanggaran Protokol Kesehatan di Jawa Tengah,” in Seminar Nasional Official Statistics, vol. 2022, no. 1, pp. 813–822, Nov. 2022.

S. A. Hermawan et al., “Visualisasi data pengelompokkan kelulusan mahasiswa dengan algoritma clustering k-means,” J. Spektrum, vol. 11, no. 1, 2024.

E. K. Korir, "Comparative clustering and visualization of socioeconomic and health indicators: A case of Kenya," Socio Econ. Plan. Sci., vol. 95, p. 101961, 2024.

A. D. Kuswanto, A. N. Fadhila, P. T. Setiawan, M. K. Setiawan, and D. R. Syahputra, "Penerapan K-Means Clustering Untuk Menentukan Jumlah Pengangguran Berdasarkan Umur: Studi Kasus Di Badan Statistik Provinsi DKI Jakarta 2020-2022," Repeater: Publikasi Teknik Informatika dan Jaringan, vol. 2, no. 3, pp. 135–146, 2024.

B. Lund and J. Ma, "A Review of Cluster Analysis Techniques and Their Uses in Library and Information Science Research: K-Means and K-Medoids Clustering," Perform. Meas. Metrics, vol. 22, no. 3, pp. 161–173, 2021.

E. D. Madyatmadja et al., "Data Visualization of Internet Usage in The Jabodetabek Area," Infotech: J. Technol. Inform., vol. 7, no. 1, pp. 55–62, 2021.

E. D. Madyatmadja et al., "Data Visualization of Priority Region Based on Community Complaints in Government," ICIC Express Lett. Part B Appl., vol. 12, pp. 957–964, 2021.

Downloads

Published

2025-09-09

How to Cite

Wikjatmiko, Z. B., Endang Setyati, & Yosi Kristian. (2025). Clustering dan Visualisasi Data ASN dalam Penunjang Analisis Kecukupan Data di Perangkat Daerah Pemerintah Provinsi Jawa Timur. Joutica, 10(2), 183–193. https://doi.org/10.30736/jti.v10i2.1440

Download Citation

Issue

Vol. 10 No. 2 (2025): SEPTEMBER

Section

editorial

License

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

Joutica : Journal of Informatic Unisla is licensed under an Attribution-ShareAlike 4.0 International (CC BY-SA 4.0) license. You are free to:

Share copy and redistribute the material in any medium or format
Adapt remix, transform, and build upon the material for any purpose, even commercially. This license is acceptable for Free Cultural Works.

The licensor cannot revoke these freedoms as long as you follow the license terms.

Attribution You must give appropriate credit, provide a link to the license, and indicate if changes were made. You may do so in any reasonable manner, but not in any way that suggests the licensor endorses you or your use.
Share Alike If you remix, transform, or build upon the material, you must distribute your contributions under the same license as the original.
No additional restrictions You may not apply legal terms or technological measures that legally restrict others from doing anything the license permits.

Copyright

Authors who publish with this journal agree to the following terms:

Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under an Attribution-ShareAlike 4.0 International (CC BY-SA 4.0) that allows others to share the work with an acknowledgment of the work's authorship and initial publication in this journal.
Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgment of its initial publication in this journal.
Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (See The Effect of Open Access).