Publication:
Performance analysis of machine learning based optimized feature selection approaches for breast cancer diagnosis

Loading...
Thumbnail Image

Date

2022

Journal Title

International Journal of Information Technology (Singapore)

Journal ISSN

Volume Title

Publisher

Springer Science and Business Media B.V.

Research Projects

Organizational Units

Journal Issue

Abstract

Healthcare systems around the world are facing huge challenges in responding to trends of the rise of chronic diseases. The objective of our research study is the adaptation of Data Science and its approaches for prediction of various diseases in early stages. In this study we review latest proposed approaches with few limitations and their possible solutions for future work. This study also shows importance of finding significant features that improves results proposed by existing methodologies. This work aimed to build classification models such as Na�ve Bayes, Logistic Regression, k-Nearest neighbor, Support vector machine, Decision tree, Random Forest, Artificial neural network, Adaboost, XGBoost and Gradient boosting. The experimental study chooses group of features by means of three feature selection approaches such as Correlation-based selection, Information Gain based selection and Sequential feature selection. Various Machine learning classifiers are applied on these feature subsets and based on their performance best feature subset is selected. Finally, ensemble based Max Voting Classifier is proposed on top of three best performing models. The proposed model produces an enhanced performance label with accuracy score of 99.41%. � 2021, Bharati Vidyapeeth's Institute of Computer Applications and Management.

Description

Keywords

Breast cancer, Data science, Ensemble Learning, Feature selection techniques, Machine learning

Citation

Collections