Feature Selection using FFS and PCA in Biomedical Data Classification with AdaBoost-SVM

Authors

  • Rahime Ceylan
  • Mucahid Barstugan

DOI:

https://doi.org/10.18201/ijisae.2018637928

Keywords:

AdaBoost, Biomedical Data Classification, Classification Performance, Feature Selection, Hybrid Structure, Machine Learning

Abstract

: Recently, there has been an increasing trend to propose computer aided diagnosis systems for biomedical pattern recognition. A computer aided diagnosis method, which aims higher classification accuracy, is developed to classify the biomedical dataset. This new method includes two types of machine learning algorithms: feature selection and classification. In this method, firstly, features were extracted from biomedical dataset, then the extracted features were classified by hybrid AdaBoost-Support Vector Machines (SVM) classifier structure. For feature selection, Forward Feature Selection (FFS) and Principal Component Analysis (PCA) algorithms were used. Following it, advantages and disadvantages of these algorithms were evaluated. The proposed two different hybrid structures and other studies in literature were compared with our findings. Wisconsin Breast Cancer (WBC), Pima Diabetes (PD), Heart (Statlog) biomedical datasets and Electrocardiogram (ECG) signals were taken from UCI database and these datasets were used to test the proposed hybrid structure. The obtained results show that the proposed hybrid structure has high classification accuracy for biomedical data classification.

Downloads

Download data is not yet available.

References

B. Yuan and X. Ma, "Sampling+ reweighting: boosting the performance of AdaBoost on imbalanced datasets," Neural Networks (IJCNN), IEEE International Joint Conference, 2012, pp.1-6.

PP. Dhakate, K. Rajeswari and D. Abin, "An ensemble approach for cancerious dataset analysis using feature selection," Communication Technologies (GCCT), IEEE Global Conference, 2015, pp. 479-482.

Y. Gao and F. Gao, "Edited AdaBoost by weighted kNN," Neurocomputing, vol. 73, no. 16, Oct. 2010, pp. 3079-3088.

X.F. Chen, H.J. Xing and X.Z. Wang, "A modified AdaBoost method for one-class SVM and its application to novelty detection," Systems, Man, and Cybernetics (SMC), IEEE International Conference, 2011, pp. 3506-3511.

A. Lahiri and PK. Biswas, "A scalable model for knowledge sharing based supervised learning using AdaBoost," Advances in Pattern Recognition (ICAPR), IEEE Eighth International Conference, 2015, pp. 1-6.

H. Zhang and J. Lu, "Creating ensembles of classifiers via fuzzy clustering and deflection," Fuzzy sets and Systems, vol. 161, no. 13, 2010, pp. 1790-1802.

B. Chen and H.X. Zhang, "An approach of multiple classifiers ensemble based on feature selection," Fuzzy Systems and Knowledge Discovery, IEEE FSKD'08 Fifth International Conference, 2008, pp. 390-394.

J. Ghavidel, S. Yazdani and M. Analoui, "A new ensemble classifier creation method by creating new training set for each base classifier," Information and Knowledge Technology (IKT), IEEE 5th Conference, 2013, pp. 290-294.

SL. Ham and N. Kwak, "Boosted-pca for binary classification problems" Circuits and Systems (ISCAS), IEEE International Symposium, 2012, pp. 1219-1222.

X. Shu and P. Wang, "An improved Adaboost algorithm based on uncertain functions," Industrial Informatics-Computing Technology, Intelligent Technology, Industrial Information Integration (ICIICII), IEEE International Conference, 2015, pp 136-139.

K. Deng, “OMEGA: On-Line Memory-Based General Purpose System Classifier,” PhD thesis, Carnegia Mellon University, 1998.

L. Smith, “A tutorial on Principal Component Analysis”, 2002, pp. 001–027.

B. Kégl, “Introduction to AdaBoost”, 2009, pp. 011-014.

J. Sochman and J. Malas, “AdaBoost with totally corrective updates for fast face detection”, Automatic Face and Gesture Recognition, 2004, pp. 445-450.

S.R. Kulkarni and G. Harman, “Statistical learning theory: a tutorial,” in Wiley Interdisciplinary Review: Computational Statistics, vol. 3, no. 6, 2011, pp. 543-556.

S. Kutscher, “Algorithms for ECG Feature Extraction: an Overview”, 2013, pp. 001-008.

A. Beygelzimer, J. Langford and B. Zadrozny, “Weighted one-against-all”, American Association for Artificial Intelligence, 2005, pp. 720-725.

Downloads

Published

29.03.2018

How to Cite

Ceylan, R., & Barstugan, M. (2018). Feature Selection using FFS and PCA in Biomedical Data Classification with AdaBoost-SVM. International Journal of Intelligent Systems and Applications in Engineering, 6(1), 33–39. https://doi.org/10.18201/ijisae.2018637928

Issue

Section

Research Article