Using Data Mining Techniques to Predict Chronic Kidney Disease: A Review Study

Mohammad Sattari, Maryam Mohammadi

Abstract


One of the growing global health problems is chronic kidney disease (CKD). Early diagnosis, control, and management of chronic kidney disease are very important. This study considers articles published in English between 2016 and 2021 that use classification methods to predict kidney disease. Data mining models play a vital role in predicting disease. Through our study, data mining techniques of support vector machine, Naive Bayes, and k‑nearest neighbor had the highest frequency. After that, random forest, neural network, and decision tree were the most common data mining techniques. Among the risk factors associated with chronic kidney disease, respectively, risk factors of albumin, age, red blood cells, pus cells, and serum creatinine had the highest frequency in these studies. The highest number of best yields was allocated to random forest technique. Reviewing larger databases in the field of kidney disease can help to better analyze the disease and ensure the risk factors extracted.

Keywords


Classification; data mining; diagnosis; kidney diseases; machine learning

Full Text:

PDF

References


Couser WG, Remuzzi G, Mendis S, Tonelli M. The contribution

of chronic kidney disease to the global burden of major

noncommunicable diseases. Kidney Int 2011;80:1258‑70.

Rady E‑HA, Anwar AS. Prediction of kidney disease stages

using data mining algorithms. Informatics Med Unlocked

;15:100178.

Shih CC, Lu CJ, Chen GD, Chang CC. Risk prediction for early

chronic kidney disease: Results from an adult health examination

program of 19,270 individuals. Int J Environ Res Public Health

;17:1‑11.

Ho CY, Pai TW, Peng YC, Lee CH, Chen YC, Chen YT, et al.

Ultrasonography image analysis for detection and classification

of chronic kidney disease. In2012 Sixth International Conference

on Complex, Intelligent, and Software Intensive Systems 2012 p.

-9. IEEE.

Arif‑Ul‑Islam, Ripon SH. Rule Induction and Prediction of

Chronic Kidney Disease Using Boosting Classifiers, Ant‑Miner

and J48 Decision Tree. In: 2nd International Conference on

Electrical, Computer and Communication Engineering, ECCE

; 2019.

Sisodia DS, Verma A. Prediction performance of individual

and ensemble learners for chronic kidney disease. In2017

international conference on inventive computing and informatics

(ICICI) 2017; p. 1027-31. IEEE.

Al‑Hyari AY, Al‑Taee AM, Al‑Taee MA. Diagnosis and

classification of chronic renal failure utilising intelligent data

mining classifiers. Int J Inf Technol Web Eng 2014;9:1‑12.

Sinha P, Sinha P. Comparative study of chronic kidney disease

prediction using KNN and SVM. International Journal of

Engineering Research and Technology 2015;4:608-12.

Hippisley‑Cox J, Coupland C. Predicting the risk of chronic

kidney disease in men and women in england and wales:

Prospective derivation and external validation of the QKidney

scores. BMC Fam Pract 2010;11:49.

Tangri N, Stevens LA, Griffith J, Tighiouart H, Djurdjev O,

Naimark D, et al. A predictive model for progression of chronic

kidney disease to kidney failure. JAMA 2011;305:1553‑9.

Embrechts MJ. Neural networks for data mining. In: Intelligent

Engineering Systems Through Artificial Neural Networks. 7.

ASME; 1997:741–6.

Han J, Kamber M, Pei J. Data Mining: Concepts and Techniques.

Data Min Concepts Tech 2012.

Amato F, López‑Rodríguez A, Peña‑Méndez E, Vaňhara P,

Hampl A, Havel J. Artificial neural networks in medical

diagnosis. J Appl Biomed 2013;11:47‑58.

Yamashita R, Nishio M, Do RKG, Togashi K. Convolutional

neural networks: An overview and application in radiology.

Insights Imaging 2018;9:611‑29.

Kolukisa B, Yavuz L, Soran A, Bakir‑Gungor B, Tuncer D,

Onen A, et al. Coronary artery disease diagnosis using optimized

adaptive ensemble machine learning algorithm. Int J Biosci

Biochem Bioinforma 2020;10:58‑65.

Tseng C‑J, Lu C‑J, Chang C‑C, Chen G‑D. Application of

machine learning to predict the recurrence‑proneness for cervical

cancer. Neural Comput Appl 2014;24:1311‑6.

Qi Y. Random forest for bioinformatics. In: Ensemble machine

learning. Springer US; 2012:307‑23.

Song YY, Lu Y. Decision tree methods: Applications for

classification and prediction. Shanghai Arch Psychiatry

;27:130‑5.

Safavian SR, Landgrebe D. A survey of decision tree classifier

methodology. IEEE Trans Syst Man Cybern 1991;21:660‑74.

Navaneeth B, Suchetha M. A dynamic pooling based

convolutional neural network approach to detect chronic kidney

disease. Biomed Signal Process Control 2020;62:102068.

Ekanayake IU, Herath D. Chronic kidney disease prediction using

machine learning methods. In: MERCon 2020 ‑ 6th International

Multidisciplinary Moratuwa Engineering Research Conference,

Proceedings. 2020:260–5.

Alaiad A, Najadat H, Mohsen B, Balhaf K. Classification and

Association Rule Mining Technique for Predicting Chronic

Kidney Disease. Journal of Information & Knowledge

Management (JIKM), World Scientific Publishing Co. Pte. Ltd.,

;19:1-17.

Aldhyani THH, Alshebami AS, Alzahrani MY. Soft clustering

for enhancing the diagnosis of chronic diseases over machine

learning algorithms. J Healthc Eng 2020;2020:4984967.

Jongbo OA, Adetunmbi AO, Ogunrinde RB, Badeji‑Ajisafe B.

Development of an ensemble approach to chronic kidney disease

diagnosis. Sci African 2020;8:e00456.

Pinto A, Ferreira D, Neto C, Abelha A, Machado J. Data mining

to predict early stage chronic kidney disease. In: Procedia

Computer Science 177;2020:562‑7.

Nurzahputra A, Muslim MA, Prasetiyo B. Optimization of C4.5

algorithm using meta learning in diagnosing of chronic kidney

diseases. In: Journal of Physics: Conference Series. 1321; 2019.

Devika R, Avilala SV, Subramaniyaswamy V. Comparative

study of classifier for chronic kidney disease prediction using

naive bayes, KNN and random forest. In: Proceedings of the

rd International Conference on Computing Methodologies and

Communication, ICCMC 2019.; 2019:679‑84.

Akben SB. Early stage chronic kidney disease diagnosis by

applying data mining methods to urinalysis, blood analysis and

disease history. IRBM 2018;39:353‑8.

Balakrishna T, Narendra B, Reddy MH, Jayasri D. Diagnosis

of chronic kidney disease using random forest classification

technique. HELIX 2017;7:873‑7.

Li JH, Luo JF, Jiang Y, Ma YJ, Ji YQ, Zhu GL, et al.

Red blood cell lifespan shortening in patients with

early‑stage chronic kidney disease. Kidney Blood Press Res

;44:1158‑65.

Raman M, Green D, Middleton RJ, Kalra PA. Comparing

the impact of older age on outcome in chronic kidney disease

of different etiologies: A prospective cohort study. J Nephrol

;31:931‑9.