ICD Coding Automation Model of Retinal Detachment Case Using Support Vector Machine and Random Forest
Downloads
Health Information Management (HIM) professionals are responsible for maintaining the consistency of ICD-based clinical codes for the health reimbursement and health analytics through the review of medical documentation. The complexity of coding rules and clinical pathways increases the risk of miscoding, but the implementation of Electronic Medical Record (EMR) opens opportunities for the development of automation of ICD coding. This study aims to build an ICD code automation model for retinal detachment cases from eye referral hospital using artificial intelligence through clinical text classification with Natural Language Processing (NLP) and Machine Learning (ML) algorithms. The dataset includes disease resumes, physical examinations, diagnoses, medical procedures, surgical records, and therapies from 300 inpatients. Text preprocessing uses the NLTK library through sentence splitting, abbreviation expansion, case folding, stop word removal, and tokenization functions. Data preparation involves splitting data (80:20 ratio), feature extraction with TF-IDF Vectorizer, and 5-fold cross validation. Classification modeling uses Support Vector Machine (SVM) and Random Forest (RF). Evaluation of the SVM model showed an accuracy of 0.82 (precision 0.84; recall 0.82; F1-Score 0.82), while the RF model achieved an accuracy of 0.87 (precision 0.88; recall 0.87; F1-Score 0.87). Based on confusion metrics, the correct predictions for classes H33.0, H33.2, and H33.4 on SVM are 79, 87, and 80, while RF reaches 83, 88, and 91. The development of this automation requires HIM professional’s role in ensuring the quality of EMR data and accuracy of ICD code as well as intensive model training to handle the complexity of clinical data.
Ahsan, H. (2019). Karakteristik Laser Retinopexy pada Pasiendengan Tear Retina di Divisi Vitreoretina RS Cipto Mangunkusomo Periode Januari – Desember 2018. Health Anf Medical Journal, 1(2), 47–52. https://doi.org/https://doi.org/10.33854/heme.v1i2.237
Anjani, Sylvia ; Tomy Abiyasa, M. (2023). Disrupsi Digital dan Masa Depan Rekam Medis. Selat Media Partners.
Anthony, L., Maimuna, C., Ordóñez, P., & Sebastian, J. (2020). Leveraging Data Science for Global Health. Springer Nature.
Ball, T. G. B. B. C. et al. (2014). Computer-Assisted Coding Toolkit. AHIMA Press.
Chuabsombat, K., & Padungweang, P. (2025). Automated ICD-9 and ICD-10 Coding with Machine Learning : A Real-World Study Using Electronic Medical Record Text from Udon Thani Cancer. 2025 11th International Conference on Computing and Artificial Intelligence (ICCAI), 794–801. https://doi.org/10.1109/ICCAI66501.2025.00123
Cyganek, B., Graña, M., & Krawczyk, B. (2016). A Survey of Big Data Issues in Electronic Health Record Analysis. Applied Artificial Inteliigence International Journal. https://doi.org/10.1080/08839514.2016.1193714
Dharma, A. G., Djatikusumo, A., Adriono, G. A., Yudantha, A. R., Hutapea, M. M., & Victor, A. A. (2020). Vitrektomi dengan Anestesi Lokal pada Ablasio Retina Rhegmatogen di Rumah Sakit Cipto Mangunkusumo. Opthalmologica Indonesiana, 46(2), 131–136.
Dong, H., Falis, M., Whiteley, W., Alex, B., Matterson, J., & ... (2022). Automated clinical coding: what, why, and where we are? In NPJ digital …. nature.com.
HIMSS. (2017). Demystifying Big Data and Machine Learning for Healthcare. Taylor & Francis Group.
Kaur, Rajvir; Anupama Ginige, Jeewani; Obst, O. (2023). AI-Based ICD Coding and Classification Approaches Using Discharge Summaries: A Systematic Literature Review. Expert Systems with Applications, 213. https://doi.org/https://doi.org/10.1016/j.eswa.2022.118997
Kaur, R. (2018). A Comparative Analysis of Selected Set of Natural Language Processing (NLP) And Machine Learning (ML) Algorithms For Clinical Coding Using Clinical Classification Standars. Pubmed.
Kedia, A., & Rasu, M. (2020). Hands-On - Python Natural Language Processing. In Packt Publishing.
Kementerian Kesehatan. (2020). Keputusan Menteri Kesehatan Republik Indonesia Nomor Hk 01.07/Menkes/312/2020 tentang Standar Profesi Perekam Medis dan Informasi Kesehatan.
Kementerian Kesehatan. (2022). Peraturan Menteri Kesehatan Republik Indonesia Nomor 24 Tahun 2022 tentang Rekam Medis.
Ness, S., Subramanian, M. L., & Chen, X. (2022). Diagnosis and Management Degenerative of Retinoschisis and Related Complications.pdf. Survey of Opthalmology, 67(4), 892–927. https://doi.org/https://doi.org/10.1016/j.survophthal.2021.12.004
Persatuan Dokter Spesialis Mata Indonesia (PERDAMI). (2018). Pedoman Nasional Pelayanan Kedoteran Ablasio Retina Regmatogen.
Rachman, F. H. (2020). Buku Ajar Komputasi Bahasa Alami (1st ed.). Media Nusa Creative (MNC Publisihing).
RI, K. E. P. dan P. K. N. K. K. (2021). Pedoman dan Standar Etik Penelitian dan Pengembangan Kesehatan Nasional. Lembaga Penerbit Badan Penelitian dan Pengembangan Kesehatan.
Stanfill, M. H., & Marc, D. T. (2019). Health information management: implications of artificial intelligence on healthcare data and information management. In Yearbook of medical informatics. thieme-connect.com. https://doi.org/10.1055/s-0039-1677913
Surur, F. M., Mamo, A. A., Gebresilassie, B. G., Mekonen, K. A., Golda, A., Behera, R. K., & Kumar, K. (2025). Unlocking The Power of Machine Learning in Big Data: a Scoping Survey. Data Science and Management. https://doi.org/10.1016/j.dsm.2025.02.004
Zaky, H., Salem, A., Alzubaidi, M., Shah, H. A., Alam, T., Shah, Z., & Househ, M. (2023). Using AI for Detection, Prediction and Classification of Retinal Detachment. Studies in Health Technology and Informatics, 305, 636–639. https://doi.org/10.3233/SHTI230578
Copyright (c) 2026 Dyah Kurniawati, Mieke Nurmalasari, Hosizah Markam, Dewi Krismawati

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.









