Publication Type : Conference Paper
Publisher : Springer Nature Singapore
Source : Lecture Notes in Networks and Systems
Url : https://doi.org/10.1007/978-981-96-2694-6_8
Campus : Bengaluru
School : School of Computing
Year : 2025
Abstract : Cancer is one of the deadly diseases characterized by abnormal cell division, destructive to mankind. It has been one of the leading cause of deaths for decades worldwide. In this context, this study is designed to embrace the proficiency of ML algorithms in detecting the risk of cancer occurrence based on medical and lifestyle habits amongst individuals. Specifically, the current study is designed to satisfy three objectives namely, (1) identifying cancer-causing data attributes, (2) employing ML models to decipher risk of cancer, (3) improving performance of ML models to predict cancer instances. In this perspective, cancer dataset is collected from Kaggle repository and subjected to attribute selection approaches followed by classification algorithms. Experimentations revealed that eight prominent features induced risk of cancer, as identified by Boruta algorithm. Subsequently, nine ML models were employed on data to detect risk of cancer. The hybrid ensemble model, XRandGrad achieved enhanced cancer predictive performance with an accuracy score of 99.19%. Finally, the model was validated to ascertain its performance.
Cite this Research Publication : Kalyan Nagaraj, H. S. Prashanth, Amulyashree Sridhar, A Novel Machine Learning Ensemble Algorithm to Predict Occurrence of Cancer, Lecture Notes in Networks and Systems, Springer Nature Singapore, 2025, https://doi.org/10.1007/978-981-96-2694-6_8