Peningkatan Prediksi Kelainan Tekanan Darah dengan Logistic Regression dan Random Forest: Pendekatan Sequence Machine Learning

Authors

  • Florentina Yuni Arini Universitas Negeri Semarang
  • Rahmat Hidayat Universitas Negeri Semarang
  • Arzaki Zunior Putra Universitas Negeri Semarang
  • Muhammad Nur Furqon Universitas Negeri Semarang
  • Muhammad Zuniar Hilmi Universitas Negeri Semarang

DOI:

https://doi.org/10.54259/pakmas.v6i1.4497

Keywords:

Abnormal Blood Pressure, Random Forest, Logistic Regression, Sequence Model, Medical Classification, Hypertension

Abstract

Early detection of blood pressure abnormalities plays a critical role in preventing and managing cardiovascular diseases, which remain the leading cause of death globally. This study proposes a sequence machine learning approach that combines Random Forest (RF) and Logistic Regression (LR) to enhance the accuracy of abnormal blood pressure prediction. The dataset, obtained from Kaggle, includes various clinical and lifestyle-related features. Data preprocessing involved handling missing values, label encoding, and normalization of numerical features. Evaluation of individual models showed that Random Forest achieved an accuracy of 0.83, while Logistic Regression reached 0.75. The sequence model, which incorporates Random Forest-generated prediction probabilities as an additional feature in Logistic Regression, improved the prediction performance with an accuracy of 0.84. Feature importance analysis identified hemoglobin level, chronic kidney disease, and genetic pedigree coefficient as the most influential predictors in classifying abnormal blood pressure. These findings highlight the effectiveness of the sequence approach in addressing the complexity of medical data and improving the precision of clinical decision support systems for hypertension diagnosis and management. Recommendations include developing advanced ensemble models, collecting longitudinal data, and conducting external validation to enhance model generalizability across diverse clinical populations.

Downloads

Download data is not yet available.

References

Adrian, M. R., Putra, M. P., Rafialdy, M. H., & Rakhmawati, N. A. (2021). Perbandingan Metode Klasifikasi Random Forest dan SVM Pada Analisis Sentimen PSBB. Jurnal Informatika Upgris, 7(1). https://doi.org/10.26877/jiu.v7i1.7099

Al Azhima, S. A. T., Darmawan, D., Arief Hakim, N. F., Kustiawan, I., Al Qibtiya, M., & Syafei, N. S. (2022). Hybrid Machine Learning Model untuk memprediksi Penyakit Jantung dengan Metode Logistic Regression dan Random Forest. Jurnal Teknologi Terpadu, 8(1), 40–46. https://doi.org/10.54914/jtt.v8i1.539

Andhika, R. (n.d.). MACHINE LEARNING DALAM PENGEMBANGAN PERANGKAT LUNAK | Integrative Perspectives of Social and Science Journal. Retrieved September 19, 2025, from https://ipssj.com/index.php/ojs/article/view/163

Aziz, F., & Abasa, S. (2025). PENGEMBANGAN DAN VALIDASI MODEL HYBRID MACHINE LEARNING UNTUK DIAGNOSIS AWAL DEPRESI. Journal Pharmacy and Application of Computer Sciences, 3(1), 8–15. https://doi.org/10.59823/jopacs.v3i1.69

Az’zahra Tarimana, A., Ryan Septian Fajar, M., Azriel Saktiawan, M., & Adi Saputra, R. (2024). PREDIKSI PENYAKIT HIPERTENSI MENGGUNAKAN MACHINE LEARNING DENGAN ALGORITMA REGRESI LOGISTIK. JATI (Jurnal Mahasiswa Teknik Informatika), 8(6), 12062–12068. https://doi.org/10.36040/jati.v8i6.11793

Bimo, A. A. (n.d.). Pemanfaatan Decision Tree pada Algoritma Random Forest untuk Klasifikasi Kanker Payudara. https://informatika.stei.itb.ac.id/~rinaldi.munir/Matdis/2024-2025/Makalah/Makalah-IF1220-Matdis-2024%20(121).pdf

Bodanki, P. (2021). Blood pressure data for disease prediction [Dataset]. Kaggle. https://www.kaggle.com/datasets/pavanbodanki/blood-press

Cheung, A. K., Chang, T. I., Cushman, W. C., Furth, S. L., Hou, F. F., Ix, J. H., Knoll, G. A., Muntner, P., Pecoits-Filho, R., Sarnak, M. J., Tobe, S. W., Tomson, C. R. V., Lytvyn, L., Craig, J. C., Tunnicliffe, D. J., Howell, M., Tonelli, M., Cheung, M., Earley, A., & Mann, J. F. E. (2021). Executive summary of the KDIGO 2021 Clinical Practice Guideline for the Management of Blood Pressure in Chronic Kidney Disease. Kidney International, 99(3), 559–569. https://doi.org/10.1016/j.kint.2020.10.026

Ermawati, Ibnas, R., & Kurniawan, B. A. (2024). Klasifikasi Penderita Anemia Menggunakan Metode Regresi Logistik. Jurnal MSA ( Matematika Dan Statistika Serta Aplikasinya), 11(2), 93–101. https://doi.org/10.24252/msa.v11i2.45083

Gori, T., Sunyoto, A., & Al Fatta, H. (2024). Preprocessing Data dan Klasifikasi untuk Prediksi Kinerja Akademik Siswa. Jurnal Teknologi Informasi Dan Ilmu Komputer, 11(1), 215–224. https://doi.org/10.25126/jtiik.20241118074

Habibi, M. R., Hibatullah, F., Kusrini, D. E., Putri, D. A. P., Pratiwi, N. Y., Putri, F. E., Aisha, N., Putra, F. P., Diana, A. P. A., & Ramadlana, A. R. (2023). ANALISIS FAKTOR YANG MEMENGARUHI PRESTASI IPK MAHASISWA DENGAN MENGGUNAKAN REGRESI LOGISTIK. Journal of Innovation Research and Knowledge, 3(7), 1387–1394.

Hamrahian, S. M. (2022). Hypertension and Cardiovascular Disease in Patients with Chronic Kidney Disease. In J. McCauley, S. M. Hamrahian, & O. H. Maarouf (Eds.), Approaches to Chronic Kidney Disease (pp. 281–295). Springer International Publishing. https://doi.org/10.1007/978-3-030-83082-3_15

Kuneinen, S. M., Kautiainen, H., Ekblad, M. O., & Korhonen, P. E. (2024). Multifactorial prevention program for cardiovascular disease in primary care: Hypertension status and effect on mortality. Journal of Human Hypertension, 38(4), 322–328. https://doi.org/10.1038/s41371-024-00900-x

Nugraha, W., & Syarif, M. (2024). Evaluasi Performa Algoritma Klasifikasi dalam Prediksi Gagal Jantung: Studi Kasus Dataset Heart Failure Prediction. Techno.Com, 23(4), 897–908. https://doi.org/10.62411/tc.v23i4.11685

Pranandito, R., & Hendry, H. (2023). PERBANDINGAN PREDIKSI PENYAKIT SERANGAN JANTUNG MENGGUNAKAN MODEL MACHINE LEARNING. JIPI (Jurnal Ilmiah Penelitian Dan Pembelajaran Informatika), 8(4), 1228–1237. https://doi.org/10.29100/jipi.v8i4.4165

Putra, F., Tahiyat, H. F., Ihsan, R. M., Rahmaddeni, R., & Efrizoni, L. (2024). Penerapan Algoritma K-Nearest Neighbor Menggunakan Wrapper Sebagai Preprocessing untuk Penentuan Keterangan Berat Badan Manusia: Application of K-Nearest Neighbor Algorithm Using Wrapper as Preprocessing for Determination of Human Weight Information. MALCOM: Indonesian Journal of Machine Learning and Computer Science, 4(1), 273–281. https://doi.org/10.57152/malcom.v4i1.1085

Rangga Aditya Tarigan, L., & Dahlan, D. (2024). OPTIMALISASI FITUR DENGAN FORWARD SELECTION PADA ESTIMASI TINGKAT PENYAKIT PARU-PARU MENGGUNAKAN ALGORITMA KLASIFIKASI RANDOM FOREST. JATI (Jurnal Mahasiswa Teknik Informatika), 8(5), 10341–10348. https://doi.org/10.36040/jati.v8i5.11064

Saeed, S. M. A. (2023). The Effect of the Thyroid Gland on High Blood Pressure. Journal of Prevention, Diagnosis and Management of Human Diseases, 3(02), 13–17. https://doi.org/10.55529/jpdmhd.32.13.17

Salman, H. A., Kalakech, A., & Steiti, A. (2024). Random Forest Algorithm Overview. Babylonian Journal of Machine Learning, 2024, 69–79. https://doi.org/10.58496/BJML/2024/007

Saputro, D. K., Ajie, M. F. R., Azizah, S., & Hartanti, D. (2023). Penerapan Logistic Regression untuk Mendeteksi Penyakit Jantung pada Pasien. Prosiding Seminar Nasional Teknologi Informasi Dan Bisnis, 666–671.

Sari, P. K., & Suryono, R. R. (2024). KOMPARASI ALGORITMA SUPPORT VECTOR MACHINE DAN RANDOM FOREST UNTUK ANALISIS SENTIMEN METAVERSE. Jurnal Mnemonic, 7(1), 31–39. https://doi.org/10.36040/mnemonic.v7i1.8977

Simamora, P., Pasaribu, S. A., & Vera Wijaya. (2025). Peningkatan dan Optimalisasi Prediksi Harga Emas Menggunakan Metode Combine Machine Learning Random Forest dan Gradient Boosting. Jurnal Mahkota Informatika, 1(1), 42–52.

Sitanggang, D., Nicholas, N., Wilson, V., Sinaga, A. R. A., & Simanjuntak, A. D. (2022). IMPLEMENTASI DATA MINING UNTUK MEMPREDIKSI PENYAKIT JANTUNG MENGGUNAKAN METODE K-NEAREST NEIGHBOR DAN LOGISTIC REGRESSION. Jurnal Teknik Informasi Dan Komputer (Tekinkom), 5(2), 493. https://doi.org/10.37600/tekinkom.v5i2.698

Sun, Z., Wang, G., Li, P., Wang, H., Zhang, M., & Liang, X. (2024). An improved random forest based on the classification accuracy and correlation measurement of decision trees. Expert Systems with Applications, 237, 121549. https://doi.org/10.1016/j.eswa.2023.121549

Downloads

Published

2026-05-15

How to Cite

Florentina Yuni Arini, Rahmat Hidayat, Putra, A. Z., Furqon, M. N., & Hilmi, M. Z. (2026). Peningkatan Prediksi Kelainan Tekanan Darah dengan Logistic Regression dan Random Forest: Pendekatan Sequence Machine Learning. PaKMas: Jurnal Pengabdian Kepada Masyarakat, 6(1), 96–104. https://doi.org/10.54259/pakmas.v6i1.4497

Issue

Section

Articles