Performance Evaluation and Comparison of Heart Disease Prediction Using Machine Learning Methods with Elastic Net Feature Selection

Sanjib Ghosh; Muhammad Alamgir Islam

doi:10.12691/ajams-11-2-1

Article Versions

Export Article

Cite this article

Normal Style
MLA Style
APA Style
Chicago Style

Research Article

Open Access Peer-reviewed

Performance Evaluation and Comparison of Heart Disease Prediction Using Machine Learning Methods with Elastic Net Feature Selection

Sanjib Ghosh, Muhammad Alamgir Islam

American Journal of Applied Mathematics and Statistics. 2023, 11(2), 35-49. DOI: 10.12691/ajams-11-2-1

Received February 20, 2023; Revised March 25, 2023; Accepted April 05, 2023

Abstract

Abstract Heart disease is a fatal human disease that rapidly increases globally in developed and underdeveloped countries and causes death. This disease's timely and accurate diagnosis is essential for avoiding patient harm and preserving their lives. This study compared the classifier’s performance in three stages: complete attributes, class balance, and after-feature selection. For class balancing using SMOTE (Synthetic Minority Oversampling Technique) and Elastic Net feature selection algorithm has been used to select suitable features from the available dataset. In this study, justification of performance, the authors have used Logistic Regression (LR), K-nearest neighbor (KNN), Support vector machine (SVM), Random Forest (RF), Adaboost (AB), Artificial neural network (ANN), and Multilayer perceptron (MLP). It has been found that the performance increased ANN and LR after class balance and was unchanged in SVM and MLP. The classification accuracies of the top two classification algorithms, i.e., RF and Adaboost, on full features were 99% and 94%, respectively. After applying feature selection algorithms, the classification accuracy of RF slightly decreases from 99% to 92%. The accuracy of Adaboost decreases from 94% to 83%. However, the performance of classifiers increased after class balance and feature selection, such as KNN, SVM, and MLP. After class balancing and feature selection, we observed that the SVM classifier performs best.

Keywords: heart disease SMOTE elastic net LR KNN SVM DF Adaboost ANN MLP

1. Introduction

Heart disease is considered one of the most hazardous and life-snatching chronic diseases worldwide. Heart diseases are currently the number one cause of death worldwide, and the world health organization 2020 estimated this to be around 17.9 million deaths yearly ¹. This amount represents approximately 31% of global deaths. According to the latest WHO data published in 2020, coronary heart disease in Bangladesh reached 108,528, or 15.16% of total death. The heart is a vital organ in the human body. The heart provides all of the blood pumpings to every part of the body, and blood, oxygen, and other nutrients are provided to the body. If the heart fails to function correctly, other body organs may suffer. As a result, caring for the heart and other organs becomes difficult. Furthermore, due to our chaotic lives and poor eating habits, the risk of heart disease is increasing among the population. Diabetes, smoking or excessive drinking, high cholesterol, high blood pressure, obesity, and other risk factors can all raise the risks of developing heart disease.

The World Health Organization (WHO) predicts that the number of people dying from cardiovascular disease will reach 30 million by 2040 ². To diagnose cardiovascular problems, doctors often use electrocardiograms (ECGs), echocardiography (heart ultrasounds), cardiac magnetic resonance imaging (MRI), stress testing (exercise stress test, stress ECG, nuclear cardiac stress test), and angiography. However, angiography has certain drawbacks, including a high cost, various side effects, and a high level of technical knowledge ³.

Due to their inefficiency, traditional techniques usually result in erroneous diagnoses and take longer. Because of human error, traditional procedures frequently result in inexact diagnoses and take more ti. Furthermore, it is an expensive and computationally difficult method of illness diagnosis that takes time to examine ⁴.

To overcome these problems, researchers endeavored to develop different non-invasive pioneering healthcare systems based on predictive machine learning techniques, namely: Support Vector Machine (SVM), K-Nearest Neighbor (KNN), Naïve Bayes (NB), and Decision Tree (DT), etc. ⁵. As a result, the death rate of heart disease patients has dropped ⁶.

Improved monitoring of cardiac patients may help in lowering death rates. People usually consult a cardiac expert at the end of their illness ⁷.

The main aim of developing this decision support system is to create a system that literate people can utilize without a doctor to identify heart disease at an early stage. Even this type of system may help the doctor in making a decision. The beauty of this approach is that it will be based on clinical data and will not require a heart specialist doctor. Several accurate heart disease decision support systems have been discovered in the literature with varying degrees of accuracy. However, most researchers have yet to collectively evaluate missing values, outlier identification, balanced data, and feature selection strategy. The authors of the suggested hybrid decision support system for heart disease prediction have addressed the issue of missing values and data balance, and feature selection.

So, in this research, we attempted to investigate all of the risks and variables that affect the heart and can lead to cardiac illness. We have also employed a variety of prediction algorithms to predict cardiovascular disease. We showed the relevant work in section 2. Section 3 explained the methodologies and techniques we used to predict heart disease. Section 4 contains the results and discussions. Section 5 contains the conclusion and future work.

2. Related Work

Using machine learning methods, researchers presented various decision support systems to forecast cardiac disease. This section discusses several support systems offered by various researchers for forecasting cardiac disease.

A five-classifier ensemble technique was suggested by Bashir et al. ⁸ to predict heart disease. It includes decision tree induction using information gain, naive Bayes, memory-based learning, support vector machine, and decision tree induction using Gini Index. Because the datasets utilized by the authors include only important attributes, feature selection was not conducted. Missing values and outliers were removed by data preparation.

Olaniyi and Oyedtun ⁹ have used support vector machines and multilayer perceptron neural network algorithms for developing heart disease diagnosis systems.

For predicting cardiac disease, Thomas J. et al. ¹⁰ employed artificial neural networks (ANN), k-nearest neighbor (KNN), decision trees (DT), and Naive Bayes (NB). To predict heart disease, they examined several features and risk levels, such as higher than 50%, less than 50%, and zero. KNN is used to train and classify the data, and the ID3 algorithm is used to forecast and test it.

Verma et al. ¹¹ proposed a hybrid technique for predicting cardiac disease combining four classification multi-layer perceptron (MLP), Fuzzy unordered rule induction algorithm (FURIA), Multinomial logistic regression model (MLR), and C4.5 (decision tree algorithm). The correlation-based feature subset selection (CFS) method was used with Particle Swarm Optimization to pick features (PSO). The CFS method chooses attributes based on their correlation. Feature selection was used to minimize the number of characteristics from 25 to 5. After conducting feature selection with CFS and PSO, the k-mean clustering technique was used to eliminate improperly allocated data points from the data. Verma and Srivastava ¹² proposed utilizing a neural network model to diagnose coronary artery disease.

Pahwa K. Kumar R. ¹³ presented a hybrid strategy (SVM-RFE) for feature selection, decreasing unnecessary data, and removing duplication. Furthermore, random forest and Naive Bayes predict heart disease after feature selection. A correlation-based feature selection (CFS) was used for subset assessment ¹⁴. A hybrid strategy of merging the Best–First–Search, and CFS subset assessment model has been adapted to lower dimensionality. A model for heart disease prediction is presented employing random forest algorithms or a random forest modification, which did quite well compared to the classic random forest approach.

In their suggested system, Jabber et al. ¹⁵ employed the chi-square approach for feature selection and the random forest method for heart disease prediction. The authors' technique was more accurate than the decision tree.

Latha and Jeeva ¹⁶ proposed a model for heart disease detection that uses majority voting to combine the findings of random forest, multilayer perceptron, Naive Bayes, and Bayes network. Terada et al. ¹⁷ employed cardiac disease prediction algorithms such as ANN, AdaBoost, and Decision tree.

Tama et al. ¹⁸ suggested an ensemble model for predicting cardiac disease. The ensemble model was built using gradient boosting, random forest, and extreme gradient boosting.

3. Materials and Methods

3.1. Methodology

We examined the approaches employed for the experiment in this study in this system. We have mentioned how we examined the significant risk variables for the experiment and what methodologies we used to predict heart disease. In this study, we presented a hybrid system comprising three steps: data collection, preparation, and model selection. Missing data are imputed, feature selection is performed, feature scaling is conducted, and class balance is performed during preprocessing. The MICE (multivariate imputation by chained equations) technique imputes missing variables. Afterward, feature selection is carried out using the Elastic Net embedded method.

Using the standard scalar, ensure each feature has a mean of zero and a standard deviation of one. SMOTE is used for class balancing. It generates synthetic samples of minor classes, yielding an equal number of samples from each class ¹⁹.

Classification is done before and after scaling, class balance, and feature selection using LR, SVM, RF, KNN, NB, ANN, and DT. Finally, the classifier predicts whether or not a person has a heart illness. Figure 1 depicts the suggested hybrid system for heart disease prediction.

PNG
Larger image(png format)
View option
Figures index
NEW
Larger figure in new window
NEXT
View next figure
Figure 1. Proposed hybrid decision support system

Table 1. Features of Hungarian heart disease dataset
Tables index
View option
Full Size Next Table

3.2. Dataset

In this study, a heart disease dataset was processed to design our expected model. The dataset was gathered from Kaggle ²⁰. There are 14 attributes. Table 1 depicts the details of all features. The dataset contains 1025 patient records, including 713 males and 312 females of different ages, where 48.68% of patients are normal and 51.32% have heart disease. Among the patients, who have heart disease, 300 patients are male, and 226 patients are female.

3.3. Data Preprocessing

We have used Python version 3.8.3 for exploratory data analysis (EDA) and visualization. Data preprocessing is mandatory for any data mining or machine learning approach since the performance of a machine learning methodology depends on how well the dataset is prepared. There are six missing values in this dataset. The MICE method was used to impute these missing data. Imputation is performed repeatedly by this method. It is presumptive that the data is randomly absent. This approach employs a regression model to forecast the value of the data set's missing properties ²¹. The steps of this algorithm are shown in Figure 1.

To detect outlier and extreme values at the phase of preprocessing using Inter quartile Range (IQR). The outlier is a data point that goes outside the expected range of the data and can be assumed for the purposes of the analysis to be due to recording errors or other irrelevant phenomena ²². For data mining or machine learning methods ²³, it is important to remove such outliers to get a better analytical or statistical result. For outlier detection, data is partitioned into three quartiles, Q₃, Q_2, and Q₁. Here Q₁ and Q₃ are the boundaries of the data. We calculated the value of IQR by IQR = Q₃ – Q₁. Then the lower boundary B_l and upper boundary B_u were calculated using the following equations ²⁴:

Here, a result lower than B_l and greater than B_u is considered an outlier. The synthetic minority oversampling technique (SMOTE) was also applied to balance the imbalanced data.

3.4. Feature Selection

Data representation and intelligent diagnosis both depend on feature selection. One of the most popular feature selection methods is the elastic net. However, the features chosen rely on the training data, and the regularized regression weights assigned to them are unrelated to their significance if used for feature ranking, which reduces the model's interpretability and extensibility.

According to Hastie et al. ²⁵, the elastic net penalty aims to combine the gains of shrinkage and sparsity, which are both strengths of ridge regression and LASSO. By minimizing the elastic net estimator

Due to the ridge regularization, the elastic net estimator can handle correlations between the predictors better than LASSO, and due to the L1 regularization, sparsity is obtained. However, the bias issue present for LASSO is still present for the elastic net.

However, the elastic net results in uneven feature selection (FS). The elastic net chooses new features and re-estimates their weights as the training set's data samples change.

3.5. Classification Algorithms

This section discusses classification algorithms that are used to make predictions.

3.5.1. Logistic Regression

Supervised machine learning techniques like logistic regression may be applied to both classification and regression issues. Probability is used in logistic regression to forecast how categorical data will be categorized. To forecast the result, input values can be blended linearly using a sigmoid or logistic function and coefficient values. The sigmoid function is used to estimate maximum likelihood using the most likely data, and a probability between 0 and 1 is given to indicate whether or not an event will occur. When the decision threshold is applied, a categorization issue arises. There are several varieties of it, including Binary (0 or 1), Multinomial (three or more classes without any ordering), and Ordinal (three or more classifications with ordering). It is an easy model to use and can provide accurate predictions. The following logistic regression equation determines the likelihood that input X should be classified in class 1:

Here is bias and is the weight that is multiplied by input X ²⁶.

3.5.2. K-Nearest Neighbour

The supervised machine learning method K-nearest Neighbour (KNN) may be applied to classification and regression issues. In KNN, if a data point's target value is absent, the k closest data points are found in the training set, and the average value of those points is supplied instead. The mode of k labels is assigned or returned in classification, while the mean of k labels is tuned in regression. It is a basic method that is used for categorization when prior knowledge of the data is lacking. The nearest data points can be determined using distance metrics like the Manhattan distance or the Euclidean distance. Even with enormous amounts of noisy data, it can yield superior findings and predictions.

3.5.3. Support Vector Machine (SVM)

To perform classification, a hyper-plane is created, with samples from one class lying on one side and samples from another class lying on the other. The hyper-plane is optimized to achieve the most excellent possible separation between two classes. The data points from classes that are closest to the hyper-plane are considered to support vectors ²⁷.

A hyper-plane can be created using the equation shown below:

Two more hyper-planes H₁ and H₂ are created in parallel to the constructed hyper-planes as given in the following equations:

Hyper-planes should satisfy the constraints given by the following equations for each input vector I_j:

and

Hyper-plane separating two classes in SVM is shown in Figure 2.

3.5.4. Random Forest (RF)

Both classification and regression issues may be solved with Random Forest. It takes advantage of supervised machine learning. A random forest is an ensemble of decision trees that uses many trees for training and prediction ¹⁷. Every sample from the dataset is taken for repeated sampling, and for each sample, a decision tree is created. The majority voting method is used to predict outcomes based on combining all decision trees ¹⁸. The functionality of RF is shown in Figure 3.

RF can be turned for increased accuracy by optimizing parameters such as the number of estimators, minimum size of node and a number of features used for to split the node, etc. In this research, authors had done hyper-parameter tuning of RF using GridSearchCV() method.

PNG
Larger image(png format)
View option
Figures index
NEW
Larger figure in new window
PREV
View previous figure
NEXT
View next figure
Figure 2. Hyperplane separating two classes in SVM

PNG
Larger image(png format)
View option
Figures index
NEW
Larger figure in new window
PREV
View previous figure
NEXT
View next figure
Figure 3. Random Forest algorithm

3.5.5. AdaBoost (AB)

To create a more reliable classifier, the AdaBoost or Adaptive Boosting methods combines a number of weak classifiers. Based on 1000 samples, this method generates the predicted accuracy. As illustrated in (1), occurrences from the training dataset are weighted.

(1)

Where x_i is the i^th training instance, and N is the frequency of training occurrences. Each input variable receives an output from the decision tree. Equation 2 is then used to get the misclassification rate.

(2)

Boosting simply means combining several simple trainers to achieve a more accurate prediction. AdaBoost fixes the weights which may vary for both samples and classifiers. The final classification formula is shown in equation (3).

(3)

Equation (3), where k is the total number of weak classifiers h_k(p), is a linear combination of all weak classifiers (simple learners). The working of AdaBoost algorithm is shown in Figure 4.

3.5.6. Artificial Neural Network (ANN)

Machine learning in neural networks includes artificial neural networks. The functioning of ANNs is compared to that of the human brain. It is built to mimic a human neuron cell in that it learns from data, classifies, and anticipates output in the same manner as a cell receives input and responds ²⁸. The statistical architecture used to find sophisticated problem-solving is non-linear. As illustrated in Figure 5, an ANN structure comprises a data input layer, one or more hidden layers, and an output layer with many nodes that resemble neurons in the human brain.

Nodes in an ANN function as the input of the input layer, taking information from the output world and feeding it to the hidden layer, like how neurons interact. After some data processing, the hidden layer finds the pattern. The concealed layer may have one or more layers. Multi-Layer Perceptron (MLP), which we shall explore in the next point, is an ANN with several hidden layers and backpropagation. Once everything has been processed, it sends the categorized data to the output layer. An activation function is used to transform an input function into an output function; there are many distinct activation functions, including logistic, sigmoid, tanh, and linear. Recently, ANNs have gained more popularity and are utilized in various industries, including medicine, image identification, speech recognition, and facial recognition. However, employing the proper activation function and ANN parameters may provide significantly superior predictions.

PNG
Larger image(png format)
View option
Figures index
NEW
Larger figure in new window
PREV
View previous figure
NEXT
View next figure
Figure 4. AdaBoost algorithm

PNG
Larger image(png format)
View option
Figures index
NEW
Larger figure in new window
PREV
View previous figure
NEXT
View next figure
Figure 5. The basic structure of ANN

3.5.7. Multi-Layer Perceptron (MLP)

When an ANN uses many hidden layers with backpropagation instead of a single hidden layer, this is known as a multi-layer perceptron (more than three layers, including the input and output layer). A feed-forward network is MLP (a cycle is not formed between connections). It operates by receiving input from other perceptron giving each node a weight and then sending that information to the hidden layer. The output layer is fed from concealed, as well. The anticipated value (derived during input processing or backpropagation) and actual output are used to calculate the error value ²⁹. Backpropagation between the hidden layer and output layer is done to minimize. The network does feed-forward after back-propagation.

Classifier validation method: Validation of the prediction model is an essential step in machine learning processes. In this study, the K-Fold cross-validation method is applied to validate the results of the above-mentioned classification models.

K-fold cross-validation (CV): The whole dataset is split into k equal parts in K-Fold CV. The (k-1) parts are utilized for training, and the rest is used for testing at each iteration. This process continues for k-iteration. Various researchers have used different values of k for CV. Here k=10 is used for experimental work because it produces good results. In tenfold CV, 90% of data is utilized for training the model, and the remaining 10% is used for testing the model at each iteration.

3.6. Performance Measure Indices

The accuracy, sensitivity, specificity, f1-score, recall, Mathew Correlation-coefficient (MCC), and AUC-score assessment matrices have all been applied to evaluate the effectiveness of the classification algorithms implemented in this work. These metrics are all computed using the confusion matrix shown in Table 2.

Table 2. Confusion matrix
Tables index
View option
Full Size Previous Table Next Table

The model predicts that the patient does not have cardiac disease, suggesting that the patient is healthy. True Negative (TN) in the confusion matrix shows that the patient does not have cardiac disease.

The model correctly classified a person with heart disease if True Positive (TP) confirms that the patient has heart disease and the model predicts the same result.

False Positive (FP) results show that the patient does not have cardiac disease despite the model's prediction that they do; in other words, the model misclassified the patient as healthy. Another name for this is a type-1 error.

False Negative (FN) results indicate that the patient has heart disease despite the model's prediction that they do not; in other words, the model misclassifies patients with heart disease. This is also called a type-2 error.

Accuracy: The following formula may be used to compute the accuracy of the classification model, which indicates the model's overall performance:

Specificity: A measure of specificity is the proportion of newly identified healthy persons to the total number of healthy people. It indicates that the prediction is negative and the person is healthy. The following is the specificity calculation formula

Sensitivity: Sensitivity is a ratio of the number of people with heart disease who are currently categorized as having the condition to the total number of people. It indicates that the model's prediction was accurate and that the person has heart disease. The following formula can be used to determine sensitivity:

Precision: Precision is the difference between the actual and positive scores predicted by the classification model. To calculate precision, use the formula below.

F1-score: f1-measure is the weighted measure of both recall and precision. Its value ranges between 0 and 1. If its value is one, it means the classification algorithm's excellent performance; if its value is 0, it means the bad performance of the classification algorithm.

MCC: It is a correlation coefficient between the actual and predicted results. MCC gives resulting values between − 1 and + 1. Where − 1 represents the completely wrong prediction of the classifier. 0 means that the classifier generates random prediction, and + 1 represents the ideal prediction of the classification models. The formula for calculating MCC values is given below:

The area under the curve (AUC) and classification algorithm performance are closely associated, meaning that the greater the AUC value, the better the classification algorithm performance. The ROC of a classifier is described by the area under the curve (AUC).

4. Result and Discussions

Patients' cardiac conditions were diagnosed using various classification methods, such as Logistic regression, K-nearest neighbor, Support vector machine, Random forest, Adaboost, Artificial neural network, and Multilayer perceptron. The UCI Cleveland dataset was implemented for the studies. Several medical factors from the dataset were used to diagnose heart disease. With class 1 denoting the presence of a disease and class 0 denoting the absence of a disease, these factors were utilized to accomplish classification. Seven instances in the dataset had missing values. These values were imputed using MICE algorithm. Application of this algorithm resulted in a complete dataset with no instance having missing value. System performance was measured on the scale of accuracy, sensitivity, specificity, precision, f1-measure, sensitivity, AUC-score, and MCC. Results are validated using k-fold cross-validation method with k=10. In this method partitioning of the dataset is done into k groups. Performance of the model is evaluated using k-1 groups for the training of the model and one group for testing of the model. These steps of evaluating the model are repeated k times each time taking different training and testing groups.

4.1. Performance of Classifiers with All Features

At first, the experiments were performed on all dataset features without applying any pre-processing or feature selection techniques. The performance of classifiers on the feature set is shown in Table 3. Classifier provided the highest performance on the full feature set, whereas SVM provided the lowest performance.

4.2. Performance Improvement Using Class Balancing

After applying the unbalanced classes, the performance of the classifiers has been further improved by maintaining the balance of the classes. SMOTE algorithm was applied for class balancing.

Table 3. Performance of classifiers on full feature set
Tables index
View option
Full Size Previous Table Next Table

Table 4. Performance improvement using class balancing
Tables index
View option
Full Size Previous Table Next Table

It resulted in 526 instance of both class 0 and class 1. Impact of class balancing on the performance of classifiers is shown in Table 4. An increase, decrease and unchanged of accuracy, sensitivity, specificity, AUC, precision, f1-measure and MCC of classifiers with and without class balancing shown in Figures 6, 7, 8, 9, 10, 11, and 12 respectively.

PNG
Larger image(png format)
View option
Figures index
NEW
Larger figure in new window
PREV
View previous figure
NEXT
View next figure
Figure 6. Accuracy of classifier using Class balancing

PNG
Larger image(png format)
View option
Figures index
NEW
Larger figure in new window
PREV
View previous figure
NEXT
View next figure
Figure 7. Sensitivity of classifier using Class balancing

PNG
Larger image(png format)
View option
Figures index
NEW
Larger figure in new window
PREV
View previous figure
NEXT
View next figure
Figure 8. Specificity of classifiers after class balancing

PNG
Larger image(png format)
View option
Figures index
NEW
Larger figure in new window
PREV
View previous figure
NEXT
View next figure
Figure 9. AUC score of classifiers after class balancing

PNG
Larger image(png format)
View option
Figures index
NEW
Larger figure in new window
PREV
View previous figure
NEXT
View next figure
Figure 10. Precision of classifiers after class balancing

PNG
Larger image(png format)
View option
Figures index
NEW
Larger figure in new window
PREV
View previous figure
NEXT
View next figure
Figure 11. f1-measure of classifiers after class balancing

PNG
Larger image(png format)
View option
Figures index
NEW
Larger figure in new window
PREV
View previous figure
NEXT
View next figure
Figure 12. MCC of classifiers after class balancing

Table 5. Performance of all classifiers with and without balancing
Tables index
View option
Full Size Previous Table Next Table

According to Table 5, the performance score of the ANN and LR models increased after class balancing. On the other hand, SVM and MLP model scores remain almost same, while the model performance score reduced of RF.

4.3. Performance Improvement Using Feature Selection

The Elastic Net feature selection approach is used to identify the optimum feature set that decreases computing cost and improves classifier performance. The best results are still noted on the subset of (n=7) to pick the ideal feature space that reduces computing cost and enhances the classifiers' performance. Table 6 shows the projected outcomes of the best-chosen feature space using a 10-fold CV, and the optimal value of lambda used 0.03 because this value of root mean sum of squares (RMSE) is low, as shown in Table 7. The Classifiers’ performance on optimal feature space of Elastic Net feature selection algorithm at reported in Table 8.

Table 6. Selected features by Elastic Net algorithm and their scores
Tables index
View option
Full Size Previous Table Next Table

Table 7. Optimum value of lambda for elastic net regression
Tables index
View option
Full Size Previous Table Next Table

Table 8. Performance using Elastic Net feature selection algorithm
Tables index
View option
Full Size Previous Table

PNG
Larger image(png format)
View option
Figures index
NEW
Larger figure in new window
PREV
View previous figure
NEXT
View next figure
Figure 13. Accuracy scores of classifiers using feature selection

PNG
Larger image(png format)
View option
Figures index
NEW
Larger figure in new window
PREV
View previous figure
NEXT
View next figure
Figure 14. Sensitivity of classifiers using feature selection

PNG
Larger image(png format)
View option
Figures index
NEW
Larger figure in new window
PREV
View previous figure
NEXT
View next figure
Figure 15. Specificity of classifiers using feature selection

PNG
Larger image(png format)
View option
Figures index
NEW
Larger figure in new window
PREV
View previous figure
NEXT
View next figure
Figure 16. Increase in AUC score of classifiers using feature selection

PNG
Larger image(png format)
View option
Figures index
NEW
Larger figure in new window
PREV
View previous figure
NEXT
View next figure
Figure 17. Increase in precision of classifiers using feature selection

PNG
Larger image(png format)
View option
Figures index
NEW
Larger figure in new window
PREV
View previous figure
NEXT
View next figure
Figure 18. Increase in f1-measure of classifiers using feature selection

PNG
Larger image(png format)
View option
Figures index
NEW
Larger figure in new window
PREV
View previous figure
Figure 19. Increase in the MCC score of classifiers using feature selection

Feature selection improved the performance of all classifiers except RF and Adaboost. LR, KNN, ANN, and MLP accuracy increased by only 1%, 16%, 10%, and 6%. A maximum increase of 20% was observed in the accuracy of SVM, and a maximum decrease of 11% was observed in Adaboost. The best results were achieved in SVM. Sensitivity, Specificity, AUC, precision, f1-measure, and MCC of SVM increased by only 29%, 20%, 22%, 22%, and 41%. After feature selection, performance scores decrease in Adaboost classifier. An improvement performance of all classifiers with feature selection is shown in Figures 13, 14, 15, 16, 17, 18 and 19.

Results indicates that Random Forest provided the highest performance in combination with MICE, Elastic Net and SMOTE. Dimensionality reduction using Feature selection the performance of RF slightly decreasing compared to without feature selection and class balancing. KNN, SVM and ANN performance increased gradually after feature selection.

5. Conclusion

Heart disease is one of the most devastating and fatal chronic disease that rapidly increase in both economically developed and undeveloped countries and cause death. This damage can be reduced considerably if the patient is diagnosed in the early stages and proper treatment is provided to his/ her. The major cause of loss of life in heart disease is a delay in its detection. To assess the performance of classification algorithms various performance evaluation metrics were used such as accuracy, sensitivity, specificity, AUC, precision, f1-measure and MCC. In this paper we showed that the comparison the performance scores of LR, KNN, SVM, RF, AB and MLP classifiers with and without feature selection and class balancing. We observed that the performance increased ANN and LR after class balance and remain constant in SVM and MLP. The classification accuracies of the top two classification algorithms, i.e., RF and Adaboost, on full features were 99% and 94%, respectively. After applying feature selection algorithms, the classification accuracy of RF with Elastic net feature selection algorithm slightly decreases from 99% to 92%. The accuracy of Adaboost decreases from 94% to 83%. However, the performance of classifiers was increased after class balance and feature selection, such as KNN, SVM, and MLP. After class balancing and feature selection, the SVM classifier provides the best performance. In the future, the researchers plan to investigate with other feature selection approaches, such as Ant Colony optimization and particle swarm optimization, in order to further enhance the performance system. The authors also intend to use deep learning methods to create a system for diagnosing heart disease.

References

[1]	https://www.who.int/health-topics/cardiovascular-diseases#tab=tab_1. [Accessed 02 June 2021].
	In article

[2]	S. I. Ayon, M. M. Islam, and M. R. Hossain, “Coronary artery heart disease prediction: a comparative study of computational intelligence techniques,” IETE Journal of Research, 2020.
	In article	View Article

[3]	Patil, S. B. & Kumaraswamy, Y. Intelligent and effective heart attack prediction system using data mining and artificial neural network. Eur. J. Sci. Res. 31, 642-656 (2009).
	In article

[4]	Vanisree, K. & Singaraju, J. Decision support system for congenital heart disease diagnosis based on signs and symptoms using neural networks. Int. J. Comput. Appl. 19, 6-12 (2015).
	In article

[5]	Edmonds. In Proceedings of AISB Symposium on Socially Inspired Computing 1-12 (Hatfield, 2005).
	In article

[6]	Methaila, A., Kansal, P., Arya, H. & Kumar, P. Early heart disease prediction using data mining techniques. Computer. Sci.Inf. Technol. J. (2014).
	In article	View Article

[7]	Ponikowski P, Anker SD, AlHabib KF, Cowie MR, Force TL, Hu S, Jaarsma T, Krum H, Rastogi V, Rohde LE, Samal UC, Shimokawa H, Siswanto BB, Sliwa K, Filippatos G (2014) Heart failure: preventing disease and death worldwide. ESC Heart Fail 1(1): 4-25.
	In article	View Article PubMed

[8]	Bashir S, Qamar U, Khan FH, javed MY (2014) MV5: a clinical decision support framework for heart disease prediction using majority vote based classifier ensemble. Arab J Sci Eng 39(11): 7771-7783.
	In article	View Article

[9]	Olaniyi EO, Oyedotun OK, Adnan K (2015) Heart diseases diagnosis using neural networks arbitration. Int J Intell Syst Appl 7(12): 75-82.
	In article	View Article

[10]	Thomas J, Princy RT. Human heart disease prediction system using data mining techniques. 2016 International Conference on Circuit, Power, and Computing Technologies (ICCPCT). 2016.
	In article	View Article

[11]	Verma L, Srivastava S, Negi PC (2016) A hybrid data mining model to predict coronary artery disease cases using non-invasive clinical data. J Med Syst 40(7):17.
	In article	View Article PubMed

[12]	Verma L, Srivastava S (2016) A data mining model for coronary artery disease detection using noninvasive clinical parameters. Indian J Sci Technol 9(48): 1.
	In article	View Article

[13]	Pahwa K, Kumar R. “Prediction of heart disease using hybrid technique for selecting features,” 2017 4th IEEE Uttar Pradesh Section International Conference on Electrical, Computer and Electronics (UPCON), Mathura. 2017. p. 500-504.
	In article	View Article

[14]	Xu S, Zhang Z, Wang D, Hu J, Duan X, Zhu T. “Cardiovascular risk prediction method based on CFS subset evaluation and random forest classifcation framework,” 2017 IEEE 2nd International Conference on Big Data Analysis (ICBDA), Beijing. 2017. p. 228-232.
	In article	View Article

[15]	Jabbar MA, Deekshatulu BL, Chandra P (2016) Prediction of heart disease using random forest and feature subset selection. In: Innovations in bio-inspired computing and applications. Springer, Cham, pp 187-196.
	In article	View Article

[16]	Latha CBC, Jeeva SC (2019) Improving the accuracy of prediction of heart disease risk based on ensemble classification techniques. Inf Med Unlocked 16:100203.
	In article	View Article

[17]	Terrada O, Hamida S, Cherradi B, Raihani A, Bouattane O (2020) Supervised machine learning based medical diagnosis support system for prediction of patients with heart disease. Adv Sci Technol Eng Syst J 5(5): 269-277.
	In article	View Article

[18]	Tama BA, Im S, Lee S (2020) Improving an intelligent detection system for coronary heart disease using a two-tier classifier ensemble. Biomed Res Int 2020: 1-10.
	In article	View Article PubMed

[19]	Saez JA, Krawczyk B, Woźniak M (2016) Analyzing the oversampling of different classes and types of examples in multi-class imbalanced datasets. Pattern Recogn 57: 164-178.
	In article	View Article

[20]	https://www.kaggle.com/johnsmith88/heart-disease-dataset [Accessed 02 June 2021].
	In article

[21]	Azur MJ, Stuart EA, Frangakis C, Leaf PJ. (2011). Multiple imputation by chained equations: what is it and how does it work. Int J Methods Psychiatr Res 20(1): 40-49.
	In article	View Article PubMed

[22]	M.R. Rahman, T. Islam, T. Zaman, M. Shahjaman, M.R. Karim, F. Huq, J.M. Quinn, R.D. Holsinger, E. Gov, M.A. Moni, Identification of molecular signatures and pathways to identify novel therapeutic targets in alzheimer’s disease: insights from a systems biomedicine perspective, Genomics 112 (2) (2019) 1290-1299.
	In article	View Article PubMed

[23]	Four Techniques for Outlier Detection, https://www.kdnuggets.com/2018/12/four-techniques-outlier-detection.html.
	In article

[24]	Md Satu, Syeda Atik, Mohammad Moni, A Novel Hybrid Machine Learning Model to Predict Diabetes Mellitus, 2019.
	In article

[25]	Zou H, Hastie T. Regularization and variable selection via the elastic net. J R Statist Soc: Ser B (Statist Methodol). 2005; 67: 301-320.
	In article	View Article

[26]	H. Zou, T. Hastie, Regularization and variable selection via the elastic net, Journal of the Royal Statistical Society: Series B (Statistical Methodology) 67 (2) (2005) 301-320.
	In article	View Article

[27]	Rani P, Kumar R, Jain A, Lamba R (2020) Taxonomy of machine learning algorithms and its applications. J Comput Theror Nanosci 17(6): 2509-2514.
	In article	View Article

[28]	Priya S. Comparative Study of Data Mining Classifcation Algorithms in Heart Disease Prediction.
	In article

[29]	Dangare C, Apte S. A Data Mining Approach for Prediction of Heart Disease Using Neural Networks. 2012.
	In article

This work is licensed under a Creative Commons Attribution 4.0 International License. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/

Cite this article:

Normal Style

Sanjib Ghosh, Muhammad Alamgir Islam. Performance Evaluation and Comparison of Heart Disease Prediction Using Machine Learning Methods with Elastic Net Feature Selection. American Journal of Applied Mathematics and Statistics. Vol. 11, No. 2, 2023, pp 35-49. http://pubs.sciepub.com/ajams/11/2/1

MLA Style

Ghosh, Sanjib, and Muhammad Alamgir Islam. "Performance Evaluation and Comparison of Heart Disease Prediction Using Machine Learning Methods with Elastic Net Feature Selection." American Journal of Applied Mathematics and Statistics 11.2 (2023): 35-49.

APA Style

Ghosh, S. , & Islam, M. A. (2023). Performance Evaluation and Comparison of Heart Disease Prediction Using Machine Learning Methods with Elastic Net Feature Selection. American Journal of Applied Mathematics and Statistics, 11(2), 35-49.

Chicago Style

Like this article()

Figure 1. Proposed hybrid decision support system
View in article
Full Size Figure

Figure 2. Hyperplane separating two classes in SVM
View in article
Full Size Figure

Figure 3. Random Forest algorithm
View in article
Full Size Figure

Figure 4. AdaBoost algorithm
View in article
Full Size Figure

Figure 5. The basic structure of ANN
View in article
Full Size Figure

Figure 6. Accuracy of classifier using Class balancing
View in article
Full Size Figure

Figure 7. Sensitivity of classifier using Class balancing
View in article
Full Size Figure

Figure 8. Specificity of classifiers after class balancing
View in article
Full Size Figure

Figure 9. AUC score of classifiers after class balancing
View in article
Full Size Figure

Figure 10. Precision of classifiers after class balancing
View in article
Full Size Figure

Figure 11. f1-measure of classifiers after class balancing
View in article
Full Size Figure

Figure 12. MCC of classifiers after class balancing
View in article
Full Size Figure

Figure 13. Accuracy scores of classifiers using feature selection
View in article
Full Size Figure

Figure 14. Sensitivity of classifiers using feature selection
View in article
Full Size Figure

Figure 15. Specificity of classifiers using feature selection
View in article
Full Size Figure

Figure 16. Increase in AUC score of classifiers using feature selection
View in article
Full Size Figure

Figure 17. Increase in precision of classifiers using feature selection
View in article
Full Size Figure

Figure 18. Increase in f1-measure of classifiers using feature selection
View in article
Full Size Figure

Figure 19. Increase in the MCC score of classifiers using feature selection
View in article
Full Size Figure

Table 1. Features of Hungarian heart disease dataset
View in article
Full Size

Table 2. Confusion matrix
View in article
Full Size

Table 3. Performance of classifiers on full feature set
View in article
Full Size

Table 4. Performance improvement using class balancing
View in article
Full Size

Table 5. Performance of all classifiers with and without balancing
View in article
Full Size

Table 6. Selected features by Elastic Net algorithm and their scores
View in article
Full Size

Table 7. Optimum value of lambda for elastic net regression
View in article
Full Size

Table 8. Performance using Elastic Net feature selection algorithm
View in article
Full Size

[1]	https://www.who.int/health-topics/cardiovascular-diseases#tab=tab_1. [Accessed 02 June 2021].
	In article

[2]	S. I. Ayon, M. M. Islam, and M. R. Hossain, “Coronary artery heart disease prediction: a comparative study of computational intelligence techniques,” IETE Journal of Research, 2020.
	In article	View Article

[3]	Patil, S. B. & Kumaraswamy, Y. Intelligent and effective heart attack prediction system using data mining and artificial neural network. Eur. J. Sci. Res. 31, 642-656 (2009).
	In article

[4]	Vanisree, K. & Singaraju, J. Decision support system for congenital heart disease diagnosis based on signs and symptoms using neural networks. Int. J. Comput. Appl. 19, 6-12 (2015).
	In article

[5]	Edmonds. In Proceedings of AISB Symposium on Socially Inspired Computing 1-12 (Hatfield, 2005).
	In article

[6]	Methaila, A., Kansal, P., Arya, H. & Kumar, P. Early heart disease prediction using data mining techniques. Computer. Sci.Inf. Technol. J. (2014).
	In article	View Article

[7]	Ponikowski P, Anker SD, AlHabib KF, Cowie MR, Force TL, Hu S, Jaarsma T, Krum H, Rastogi V, Rohde LE, Samal UC, Shimokawa H, Siswanto BB, Sliwa K, Filippatos G (2014) Heart failure: preventing disease and death worldwide. ESC Heart Fail 1(1): 4-25.
	In article	View Article PubMed

[8]	Bashir S, Qamar U, Khan FH, javed MY (2014) MV5: a clinical decision support framework for heart disease prediction using majority vote based classifier ensemble. Arab J Sci Eng 39(11): 7771-7783.
	In article	View Article

[9]	Olaniyi EO, Oyedotun OK, Adnan K (2015) Heart diseases diagnosis using neural networks arbitration. Int J Intell Syst Appl 7(12): 75-82.
	In article	View Article

[10]	Thomas J, Princy RT. Human heart disease prediction system using data mining techniques. 2016 International Conference on Circuit, Power, and Computing Technologies (ICCPCT). 2016.
	In article	View Article

[11]	Verma L, Srivastava S, Negi PC (2016) A hybrid data mining model to predict coronary artery disease cases using non-invasive clinical data. J Med Syst 40(7):17.
	In article	View Article PubMed

[12]	Verma L, Srivastava S (2016) A data mining model for coronary artery disease detection using noninvasive clinical parameters. Indian J Sci Technol 9(48): 1.
	In article	View Article

[13]	Pahwa K, Kumar R. “Prediction of heart disease using hybrid technique for selecting features,” 2017 4th IEEE Uttar Pradesh Section International Conference on Electrical, Computer and Electronics (UPCON), Mathura. 2017. p. 500-504.
	In article	View Article

[14]	Xu S, Zhang Z, Wang D, Hu J, Duan X, Zhu T. “Cardiovascular risk prediction method based on CFS subset evaluation and random forest classifcation framework,” 2017 IEEE 2nd International Conference on Big Data Analysis (ICBDA), Beijing. 2017. p. 228-232.
	In article	View Article

[15]	Jabbar MA, Deekshatulu BL, Chandra P (2016) Prediction of heart disease using random forest and feature subset selection. In: Innovations in bio-inspired computing and applications. Springer, Cham, pp 187-196.
	In article	View Article

[16]	Latha CBC, Jeeva SC (2019) Improving the accuracy of prediction of heart disease risk based on ensemble classification techniques. Inf Med Unlocked 16:100203.
	In article	View Article

[17]	Terrada O, Hamida S, Cherradi B, Raihani A, Bouattane O (2020) Supervised machine learning based medical diagnosis support system for prediction of patients with heart disease. Adv Sci Technol Eng Syst J 5(5): 269-277.
	In article	View Article

[18]	Tama BA, Im S, Lee S (2020) Improving an intelligent detection system for coronary heart disease using a two-tier classifier ensemble. Biomed Res Int 2020: 1-10.
	In article	View Article PubMed

[19]	Saez JA, Krawczyk B, Woźniak M (2016) Analyzing the oversampling of different classes and types of examples in multi-class imbalanced datasets. Pattern Recogn 57: 164-178.
	In article	View Article

[20]	https://www.kaggle.com/johnsmith88/heart-disease-dataset [Accessed 02 June 2021].
	In article

[21]	Azur MJ, Stuart EA, Frangakis C, Leaf PJ. (2011). Multiple imputation by chained equations: what is it and how does it work. Int J Methods Psychiatr Res 20(1): 40-49.
	In article	View Article PubMed

[22]	M.R. Rahman, T. Islam, T. Zaman, M. Shahjaman, M.R. Karim, F. Huq, J.M. Quinn, R.D. Holsinger, E. Gov, M.A. Moni, Identification of molecular signatures and pathways to identify novel therapeutic targets in alzheimer’s disease: insights from a systems biomedicine perspective, Genomics 112 (2) (2019) 1290-1299.
	In article	View Article PubMed

[23]	Four Techniques for Outlier Detection, https://www.kdnuggets.com/2018/12/four-techniques-outlier-detection.html.
	In article

[24]	Md Satu, Syeda Atik, Mohammad Moni, A Novel Hybrid Machine Learning Model to Predict Diabetes Mellitus, 2019.
	In article

[25]	Zou H, Hastie T. Regularization and variable selection via the elastic net. J R Statist Soc: Ser B (Statist Methodol). 2005; 67: 301-320.
	In article	View Article

[26]	H. Zou, T. Hastie, Regularization and variable selection via the elastic net, Journal of the Royal Statistical Society: Series B (Statistical Methodology) 67 (2) (2005) 301-320.
	In article	View Article

[27]	Rani P, Kumar R, Jain A, Lamba R (2020) Taxonomy of machine learning algorithms and its applications. J Comput Theror Nanosci 17(6): 2509-2514.
	In article	View Article

[28]	Priya S. Comparative Study of Data Mining Classifcation Algorithms in Heart Disease Prediction.
	In article

[29]	Dangare C, Apte S. A Data Mining Approach for Prediction of Heart Disease Using Neural Networks. 2012.
	In article

Performance Evaluation and Comparison of Heart Disease Prediction Using Machine Learning Methods with Elastic Net Feature Selection

Abstract

1. Introduction

2. Related Work

3. Materials and Methods

Table 1. Features of Hungarian heart disease dataset

Table 2. Confusion matrix

4. Result and Discussions

Table 3. Performance of classifiers on full feature set

Table 4. Performance improvement using class balancing

Table 5. Performance of all classifiers with and without balancing

Table 6. Selected features by Elastic Net algorithm and their scores

Table 7. Optimum value of lambda for elastic net regression

Table 8. Performance using Elastic Net feature selection algorithm

5. Conclusion

References

Cite this article:

Normal Style

MLA Style

APA Style

Chicago Style