Medical Dataset Classification Using Ensemble Feature Selection and Back Propagation Neural Network Algorithm

Authors

  • T. Christopher, N. Kumar

Keywords:

Medical dataset classification, EFS, Entropy Elephant Herding Optimization (EEHO), Adaptive Firefly Optimization Algorithm (AFOA) and Entropy Butterfly Optimization Algorithm (EBFO) and Back Propagation Neural Network (BPNN)

Abstract

Due to the ongoing creation of digital data, the amount of medical data has significantly expanded in recent years. the many types of medical information, including reports, text, numbers, monitoring, and laboratory results. Because of a problem with a single optimisation technique in the current system, classification accuracy is not considerably guaranteed. Another significant issue is error rates, which prevents early illness prediction from being carried out effectively. This research work uses EFS (Ensemble Feature Selection) with BPNN (Back Propagation Neural Networks) to handle the afore mentioned issues. The input data is pre-processed using KMC (K-Means Clustering) algorithm, mainly for handling missing values and subsequently, EFS method is used to choose the features since it produces the best fitness values using an objective function. To solve the FS problem, EFS relies on integrating many FS rather than just one FS. Combining the results of multiple single FS approaches, such as EEHO (Entropy Elephant Herding Optimisation) and AFOA (Adaptive Firefly Optimisation Algorithm), is one alternative for the EFS method. And EBFO (Entropy Butterfly Optimization Algorithm) acquire improved outcomes rather than utilizing a single FS methodology. Finally, the medical dataset classification is performed using BPNN algorithm. With the help of the BPNN algorithm, a multilayer FFNN (feed forward neural networks) is trained. The class labels in tuples are predicted using weights that are learnt iteratively. The experimental findingsof the proposed EFS-BPNN algorithm demonstrates better values for accuracy, sensitivity, specificity, and execution time when compared with existing methods.

Downloads

Download data is not yet available.

References

Pendyala, Vishnu S., and Silvia Figueira. "Automated medical diagnosis from clinical data." 2017 IEEE Third International Conference on Big Data Computing Service and Applications (BigDataService). IEEE, 2017.

El-Sappagh, Shaker H., and Samir El-Masri. "A distributed clinical decision support system architecture." Journal of King Saud University-Computer and Information Sciences 26.1 (2014): 69-78.

Nahato, Kindie Biredagn, Khanna Nehemiah Harichandran, and Kannan Arputharaj. "Knowledge mining from clinical datasets using rough sets and BP neural network." Computational and mathematical methods in medicine 2015 (2015).

Park, H.W., Li, D., Piao, Y. and Ryu, K.H., 2017, A hybrid feature selection method to classification and its application in hypertension diagnosis. In International Conference on Information Technology in Bio-and Medical Informatics (pp. 11-19). Springer, Cham

Seijo-Pardo B., I. Porto-Díaz, V. Bolón-Canedo, A. Alonso-Betanzos, Ensemble feature selection: homogeneous and heterogeneous approaches, Knowl. Based Syst. 118 (2017) 124–139

Arora, J., Agrawal, U., Tiwari, P., Gupta, D. and Khanna, A., 2020, Ensemble Feature Selection Method based on recently developed Nature-inspired algorithms. In International Conference on Innovative Computing and Communications (pp. 457-470). Springer, Singapore

Amato, Filippo, et al. "Artificial neural networks in medical diagnosis." Journal of applied biomedicine 11.2 (2013): 47-58

Chandra Sekhar, Ch, et al. "Effectiveness of BP Algorithm in Healthcare Data Classification." Green Technology for Smart City and Society. Springer, Singapore, 2021. 289-298.

Leema, N., H. Khanna Nehemiah, and Arputharaj Kannan. "Neural network classifier optimization using differential evolution with global information and back propagation algorithm for clinical datasets." Applied Soft Computing 49 (2016): 834-844.

Peng, Lingxi, et al. "An immune-inspired semi-supervised algorithm for breast cancer diagnosis." Computer methods and programs in biomedicine 134 (2016): 259-265.

Sakri, Sapiah Binti, Nuraini Binti Abdul Rashid, and Zuhaira Muhammad Zain. "Particle swarm optimization feature selection for breast cancer recurrence prediction." IEEE Access 6 (2018): 29637-29647

Elgin Christo, V. R., et al. "Correlation-based ensemble feature selection using bioinspired algorithms and classification using BP neural network." Computational and mathematical methods in medicine 2019 (2019)

Devikanniga, D., and R. Joshua Samuel Raj. "Classification of osteoporosis by artificial neural network based on monarch butterfly optimisation algorithm." Healthcare technology letters 5.2 (2018): 70-75.

Nayak, Monalisa, et al. "Elephant herding optimization technique based neural network for cancer prediction." Informatics in Medicine Unlocked 21 (2020): 100445.

Mohamad, Ismail Bin, and Dauda Usman. "Standardization and its effects on K-means clustering algorithm." Research Journal of Applied Sciences, Engineering and Technology6.17 (2013): 3299-3303

Ng, W.W., Tuo, Y., Zhang, J. and Kwong, S., 2020. Training error and sensitivity-based ensemble feature selection. International Journal of Machine Learning and Cybernetics, 11(10), pp.2313-2326

Arora, S. and Singh, S., 2019. Butterfly optimization algorithm: a novel approach for global optimization. Soft Computing, 23(3), pp.715-734.

Tubishat, M., Alswaitti, M., Mirjalili, S., Al-Garadi, M.A. and Rana, T.A., 2020. Dynamic butterfly optimization algorithm for feature selection. IEEE Access, 8, pp.194303-194314

Liu, Jingsen, et al. "A dynamic adaptive firefly algorithm with globally orientation." Mathematics and Computers in Simulation 174 (2020): 76-101.

Liu, Changnian, et al. "Adaptive firefly optimization algorithm based on stochastic inertia weight." 2013 Sixth International Symposium on Computational Intelligence and Design. Vol. 1. IEEE, 2013

Wang, G.G., Deb, S. and Coelho, L.D.S., 2015, Elephant herding optimization. In 2015 3rd International Symposium on Computational and Business Intelligence (ISCBI) , pp. 1-5.

Li, J., Lei, H., Alavi, A.H. and Wang, G.G., 2020. Elephant herding optimization: variants, hybrids, and applications. Mathematics, 8(9), pp.1-25

Shang, S., Shi, M., Shang, W. and Hong, Z., 2016. Improved feature weight algorithm and its application to text classification. Mathematical Problems in Engineering, vol.2016,no. 7819626, pp.1-12

Chandra, Suresh, and R. Khemchandani. "Twin support vector machines for pattern classification." IEEE Trans. Pattern Anal. Mach. Intell 29.5 (2007): 905-910

de Lima, Márcio Dias, Juliana de Oliveira Roque e Lima, and Rommel M. Barbosa. "Medical data set classification using a new feature selection algorithm combined with twin-bounded support vector machine." Medical & Biological Engineering & Computing 58.3 (2020): 519-528

Menaga, D., Ambati, L.S. & Bojja, G.R. Optimal trained long short-term memory for opinion mining: a hybrid semantic knowledgebase approach. Int J Intell Robot Appl 7, 119–133 (2023). https://doi.org/10.1007/s41315-022-00248-w

D. Menaga and A. Lakshminarayanan, "A Method for Predicting Movie Box-Office using Machine Learning," 2023 4th International Conference on Electronics and Sustainable Communication Systems (ICESC), Coimbatore, India, 2023, pp. 1228-1232, doi: 10.1109/ICESC57686.2023.10192928.

D. Kamalakkannan, D. Menaga, S. Shobana, K. V. Daya Sagar, R. Rajagopal & Mohit Tiwari (2023) A Detection of Intrusions Based on Deep Learning, Cybernetics and Systems, DOI: 10.1080/01969722.2023.2175134

Manivannan R, Venkateshwaran G, Sivakumar S, Kumar MH, Jacob MS. Privacy-Preserving Image Storage on Cloud Using An Unified Cryptographic Authentication Scheme. Salud, Ciencia y Tecnología - Serie de Conferencias 2024; 3:609 . https://doi.org/10.56294/sctconf2024609.

Menaga, D., Saravanan, S. GA-PPARM: constraint-based objective function and genetic algorithm for privacy preserved association rule mining. Evol. Intel. 15, 1487–1498 (2022). https://doi.org/10.1007/s12065-021-00576-z

Journal of Autonomous Intelligence (2024) Volume 7 Issue 1

doi: 10.32629/jai.v7i1.734,Deep learning-based cancer disease classification using Gene Expression Data, J. Dafni Rose*,K. Vijayakumar, D. Menaga

Karn, A. L., Bagale, G. S., Kondamudi, B. R., Srivastava, D. K., Gupta, R. K., & Sengan, S. (2022). Measuring the determining factors of financial development of commercial banks in selected SAARC countries. Journal of Database Management (JDM), 33(1), 1-21.

Raj, K. B., Seth, J. K., Gulati, K., Choubey, S., & Patni, I. (2022, July). Automated Cyberstalking Classification using Social Media. In 2022 International Conference on Innovative Computing, Intelligent Communication and Smart Electrical Systems (ICSES) (pp. 1-6). IEEE.

Chaubey, P. K., Arora, T. K., Raj, K. B., Asha, G. R., Mishra, G., Guptav, S. C., ... & Alhassan, M. (2022). Sentiment Analysis of Image with Text Caption using Deep Learning Techniques. Computational Intelligence and Neuroscience, 2022.

Rajkumar, N., Sriram, V. P., Badrinarayanan, M. K., Raj, K. B., & Patel, R. (2022). A novel framework for the automated healthcare disaster based on intellectual machine learning. World Journal of Engineering, 20(5), 801-807.

Bhavana Raj, K. (2022). Industry 4.0: Smart Manufacturing in Industries‐The Future. Machine Learning and Data Science: Fundamentals and Applications, 67-74.

Reddy, S., & Raj, K. B. (2021). Security and Well-being in Tech-Savvy Urban Communities. In Interdisciplinary Research in Technology and Management (pp. 306-309). CRC Press.

Sanil, H. S., Singh, D., Raj, K. B., Choubey, S., Bhasin, N. K. K., Yadav, R., & Gulati, K. (2021). Role of machine learning in changing social and business eco-system–a qualitative study to explore the factors contributing to competitive advantage during COVID pandemic. World Journal of Engineering, 19(2), 238-243.

Yadav, S., Sudman, M. S. I., Dubey, P. K., Srinivas, R. V., Srisainath, R., & Devi, V. C. (2023, October). Development of an GA-RBF based Model for Penetration of Electric Vehicles and its Projections. In 2023 International Conference on Self Sustainable Artificial Intelligence Systems (ICSSAS) (pp. 1-6). IEEE.

Jana, S., Ghosh, A., & Guha, B. (2021). IPL 2019: Evaluating the performance of teams by DEA & SEM. Malaya Journal of Matematik Vol. S, 1, 41-45.

Jana, S., Ghosh, A., & Guha, B. (2021). IPL 2019: Evaluating the performance of teams by DEA & SEM. Malaya Journal of Matematik Vol. S, 1, 41-45.

Ghosh, A., Dey, M., Guha, B., Jana, S., & Sarkar, A. (2021). Performance evaluation & rankings of players in IPL 2019 by DEA & SEM. Malaya Journal of Matematik, Vol. S, 1, 46-56.

Nandi, B., Jana, S., & Das, K. P. (2023). Machine learning-based approaches for financial market prediction: A comprehensive review. Journal of AppliedMath, 1(2).

Baid, Y., Ghosh, A., Jana, S., & Giri, A. Evaluation of the conspicuous EPL matches for sponsorships using data envelopment analysis.

Jana, S., Sharma, E. S., Khan, A., Maji, A. K., & Pal, R. K. (2022, December). Generating a Suitable Hash Function Using Sudoku for Blockchain Network. In International Conference on Frontiers in Computing and Systems (pp. 161-171). Singapore: Springer Nature Singapore.

Ghosh, A., Dey, M., Guha, B., Jana, S., & Sarkar, A. (2021). Performance evaluation & rankings of players in IPL 2019 by DEA & SEM. Malaya Journal of Matematik, Vol. S, 1, 46-56.

Subrata, J., Ghosh, A., Ghorui, N., & Serdeira Azevedo, P. Ranking of Financial Apps Using Fuzzy Ahp and Fuzzy Marcos: An Application of Multi-Criteria Decision-Making (Mcdm) Techniques.

Swain, S., Gupta, R. K., Ratnayake, K., Priyanka, P. D., Singh, R., Jana, S., ... & Giri, L. (2018). Confocal imaging and k-means clustering of GABAB and mGluR mediated modulation of Ca2+ spiking in hippocampal neurons. ACS chemical neuroscience, 9(12), 3094-3107.

Swain, S., Gupta, R. K., Ratnayake, K., Priyanka, P. D., Singh, R., Jana, S., ... & Giri, L. (2018). Confocal imaging and k-means clustering of GABAB and mGluR mediated modulation of Ca2+ spiking in hippocampal neurons. ACS chemical neuroscience, 9(12), 3094-3107.

Tătăranu, E., Diaconescu, S., Ivănescu, C. G., Sârbu, I., & Stamatin, M. (2016). Clinical, immunological and pathological profile of infants suffering from cow's milk protein allergy. Romanian journal of morphology and embryology = Revue roumaine de morphologie et embryologie, 57(3), 1031–1035.

Ciongradi, C. I., Sârbu, I., Iliescu Halițchi, C. O., Benchia, D., & Sârbu, K. (2021). Fertility of Cryptorchid Testis-An Unsolved Mistery. Genes, 12(12), 1894. https://doi.org/10.3390/genes12121894

Ciongradi, C. I., Filip, F., Sârbu, I., Iliescu Halițchi, C. O., Munteanu, V., & Candussi, I. L. (2022). The Impact of Water and Other Fluids on Pediatric Nephrolithiasis. Nutrients, 14(19), 4161. https://doi.org/10.3390/nu14194161

Ciongradi, C. I., Benchia, D., Stupu, C. A., Iliescu Halițchi, C. O., & Sârbu, I. (2022). Quality of Life in Pediatric Patients with Continent Urinary Diversion-A Single Center Experience. International journal of environmental research and public health, 19(15), 9628. https://doi.org/10.3390/ijerph19159628

Popa, Ș., Apostol, D., Bîcă, O., Benchia, D., Sârbu, I., & Ciongradi, C. I. (2021). Prenatally Diagnosed Infantile Myofibroma of Sartorius Muscle-A Differential for Soft Tissue Masses in Early Infancy. Diagnostics (Basel, Switzerland), 11(12), 2389. https://doi.org/10.3390/diagnostics11122389

Mohammed, A. H. (2021). Fish Schooling And Sorensen Trust Based Wireless Sensor Network Optimization. International Journal, 9, 6.

Mohammed, A. H. DDoS Malicious Node Detection by Jaccard and Page Rank Algorithm in Cloud Environment.

Mohammed, A. H. (2021). Invasive Weed Optimization Based Ransom-Ware Detection in Cloud Environment.

Purohit, S. (2023). California Geographical Society, 96162, California, United States. Journal of Environmental Science and Public Health, 7, 176-184.

Purohit, S. Role of Industrialization and Urbanization in Regional Sustainable Development–Reflections from Tier-II Cities in India.

Purohit, M. S. (2012). Resource management in the desert ecosystem of Nagaur district_ An ecological study of land _agriculture_ water and human resources (Doctoral dissertation, Maharaja Ganga Singh University).

Faisal, L., Rama, V. S. B., Roy, S., & Nath, S. (2022). Modelling of Electric Vehicle Using Modified SEPIC Converter Configuration to Enhance DC–DC Converter Performance Using MATLAB. In Smart Energy and Advancement in Power Technologies: Select Proceedings of ICSEAPT 2021, Volume 2 (pp. 643-653). Singapore: Springer Nature Singapore.

Faisal, L., Rama, V. S. B., Yang, J. M., Wajid, A., & Ghorui, S. K. (2022, May). Performance and Simulation Analysis of IPMSyncRM (Internal Permanent Magnet Synchronous Reluctance Motor) for Advanced Electric Vehicle Design. In 2022 3rd International Conference for Emerging Technology (INCET) (pp. 1-6). IEEE.

Faisal, L., Rama, V. S. B., Yang, J. M., Wajid, A., & Ghorui, S. K. (2022, May). Performance and Simulation Analysis of IPMSyncRM (Internal Permanent Magnet Synchronous Reluctance Motor) for Advanced Electric Vehicle Design. In 2022 3rd International Conference for Emerging Technology (INCET) (pp. 1-6). IEEE.

Mohd, R., & Faisal, L. (2022). Smart Agricultural Practices using Machine Learning techniques For Rainfall Prediction: A case Study of Valkenburg station, Netherlands. Mathematical Statistician and Engineering Applications, 71(4), 8451-8462.

Downloads

Published

09.07.2024

How to Cite

T. Christopher. (2024). Medical Dataset Classification Using Ensemble Feature Selection and Back Propagation Neural Network Algorithm. International Journal of Intelligent Systems and Applications in Engineering, 12(22s), 1403 –. Retrieved from https://www.ijisae.org/index.php/IJISAE/article/view/6658

Issue

Section

Research Article