Automated Diagnosis Assistant with Random Forest Medical Image and Algorithm Feature Extraction
DOI:
https://doi.org/10.59934/jaiea.v5i2.1815Keywords:
Feature Extraction;, Random Forest;, Automatic Diagnosis Assistant;, Medical Imaging, Classification of Diseases.Abstract
Medical image-based disease diagnosis is a complex process and requires a high level of expertise. This study aims to develop an Automatic Diagnosis Assistant using a combination of image feature extraction techniques and Random Forest (RF) classification algorithms. Medical images are processed to extract meaningful textural features, such as using the Gray Level Co-occurrence Matrix (GLCM), which is then used to train the RF model. To address the problem of data imbalance that is common in medical datasets, the SMOTE technique is applied. The performance of the model is evaluated and optimized using Randomized Search to find the best hyperparameters. The results showed that the optimized RF model was able to achieve high accuracy, with significant improvements in the Recall and F1-Score metrics compared to the baseline model. This automated diagnostic assistant is expected to be an effective tool for medical personnel in speeding up and improving diagnostic accuracy, especially in cases with high image volumes.
Downloads
References
M. J. Pomeroy, “HHS Public Access,” vol. 39, no. 6, pp. 2013–2024, 2024, doi: 10.1109/TMI.2019.2963177.3D-GLCM.
G. Litjens et al., “Deep learning as a tool for increased accuracy and efficiency of histopathological diagnosis,” Nat. Publ. Gr., no. April, pp. 1–11, 2016, doi: 10.1038/srep26286.
A. K. Aggarwal, “Learning Texture Features from GLCM for Classification of Brain Tumor MRI Images using Random Forest Classifier,” vol. 18, pp. 60–63, 2022, doi: 10.37394/232014.2022.18.8.
B. Nicholas, J. Hotlando, F. Utaminingrum, and Y. A. Sari, “Sistem Deteksi Kualitas Susu Menggunakan Metode Gray Level Co- occurrence Matrix dan Random Forest,” vol. 9, no. 8, pp. 1–5, 2025.
“TexturalFeaturesHaralickShanmugamDinstein.pdf.”
M. Rofiq, T. H. Saragih, and D. T. Nugrahadi, “Implementasi Ekstraksi Fitur GLCM dengan Klasifikasi Algoritma C5 . 0 Pada Data Computerized Tomography Scan Covid-19,” pp. 353–362, 2021.
V. P. Singh, A. Srivastava, D. Kulshreshtha, A. Chaudhary, and R. Srivastava, “Mammogram Classification Using Selected GLCM Features and Random Forest Classifier,” vol. 14, no. 6, pp. 82–87, 2016.
S. B. Kotsiantis, “Supervised Machine Learning : A Review of Classification Techniques,” vol. 31, pp. 249–268, 2007.
L. E. O. Breiman, “Random Forests,” pp. 5–32, 2001.
J. Bergstra and Y. Bengio, “Random Search for Hyper-Parameter Optimization,” vol. 13, pp. 281–305, 2012.
M. Sokolova and G. Lapalme, “A systematic analysis of performance measures for classification tasks,” Inf. Process. Manag., vol. 45, no. 4, pp. 427–437, 2009, doi: 10.1016/j.ipm.2009.03.002.
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2026 Journal of Artificial Intelligence and Engineering Applications (JAIEA)

This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.







