Advanced Heart Disease Prediction Using Fuzzy-Rough Sets and Enhanced Missing Data Imputation Techniques
D. CenittaDepartment of Computer Science and Engineering, Manipal Institute of Technology, Manipal Academy of Higher Education, Manipal, Karnataka, India cenitta.d@manipal.edu,0000-0003-3715-6941
A.R. ShravyaAssistant Professor, Department of Computer Science and Engineering, B.M.S. College of Engineering, Bangalore, Karnataka, India shravya.cse@bmsce.ac.in0009-0003-9469-679X
Rajeshwari MadliAssistant Professor, Department of Computer Science and Engineering, B.M.S. College of Engineering, Bangalore, Karnataka, India madlirajeshwari.cse@bmsce.ac.in,0009-0001-9809-533X
S. SunayanaAssistant Professor, Department of Computer Science and Engineering, B.M.S. College of Engineering, Bangalore, Karnataka, India sunayanas.cse@bmsce.ac.in0009-0005-5858-0786
D. Sonika SharmaAssistant Professor, Department of Computer Science and Engineering, B.M.S. College of Engineering, Bangalore, Karnataka, India sonikasharma.cse@bmsce.ac.in0009-0003-3368-7048
T. SowmyaAssistant Professor, Department of Computer Science & Engineering, B.M.S. College of Engineering, Bangalore, Karnataka, India sowmyat.cse@bmsce.ac.in0009-0005-3971-7791
Vijaya ArjunanDepartment of Computer Science and Engineering, Manipal Institute of Technology, Manipal Academy of Higher Education, Manipal, Karnataka, India 0000-0002-1402-65730000-0002-1402-6573
Keywords: Heart Disease Prediction, Fuzzy-Rough Sets, Missing Data Imputation, Machine Learning, Random Forest, Predictive Analytics
Abstract
For early diagnosis and efficient clinical decision-making, heart disease prediction accuracy is essential. However, the reliability of predictive models is severely hampered by the prevalence of missing data in medical datasets. The fuzzy-rough set theory-based improved missing data imputation system presented in this work is intended to address the ambiguity and incompleteness of medical data. Our approach builds upon the Cardiovascular Disease Multiple Imputation Technique (CVDMIT) by combining fuzzy-rough sets with sophisticated classifiers such as Random Forest and new ensemble learning methods. Several benchmark datasets, including the UCI Heart Disease dataset, were used to validate the method, which showed a 95% accuracy rate higher than more conventional techniques like expectation maximization and fuzzy C-means. Our proposed method achieves better performance through Extensive experiments which enhance sensitivity while improving precision and recall metrics. Our research establishes fundamental principles for integrating fuzzy-rough set-based imputation into clinical workflows thus enabling better scalability of heart disease prediction models.