Deep Learning and Machine Learning Approach to Breast Cancer Classification with Random Search Hyperparameter Tuning
Keywords:
Breast cancer, Convolution neural network, Deep learning, Deep neural network, Machine learning, Random search hyperparameter tuningAbstract
Breast cancer is one of the main causes of death and a threat to women, artificial intelligence has grown in importance in the field of health over time. With the aid of random search hyperparameter tuning, this study proposed a machine learning and deep learning approach to breast cancer classification. Two datasets related to breast cancer were used in this study, and findings proved that optimizing random search hyperparameters enhances the models performance, in addition it is seen that machine learning classifiers are not as effective in classifying breast cancer as compared to deep learning. Among the deep learning approaches Convolutional neural networks showed the highest accuracy over deep neural networks on both datasets. It was further observed that random search hyperparameter tuning performed better on the Breakhis_400x dataset than on the breast histopathology dataset which could be attributed to the notion that hyperparameter tuning using random search performs better on small data than large data.
Downloads
References
R.L. Siegel, K.D. Miller, N.S v. and A. Jemal, "Cancer Statistics," CA: A Cancer Journal for Clinicians, vol. 73, no. 1, pp. 17-48, 2023. https://doi.org/10.3322/caac.21763
World health organization. “Breast cancer”. 2023. Available: https://www.who.int/news-room/fact-sheets/detail/breast-cancer
World Health organization. “Cancer”. 2022. Available: https://www.who.int/news-room/fact-sheets/detail/cancer
National Cancer Institute. “Common Cancer Types”. (n.d). Available at https://www.cancer.gov/types/common-cancers
G.W. Prager, S Braga, B. Bystricky, C. Qvortrup, C. Criscitiello, E. Esin, G.S. Sonke, G.A. Martínez, J.S. Frenel, M. Karamouzis , M. Strijbos , O. Yazici , P. Bossi , S. Banerjee, T. Troiani, A. Eniu, F. Ciardiello, J. Tabernero, C.C. Zielinski, P.G. Casali, F. Cardoso, J.Y. Douillard, S. Jezdic , K. McGregor, G. Bricalli, M.Vyas, A. Ilbawi. “Global cancer control: responding to the growing burden, rising costs and inequalities in access”. ESMO Open. 2018 Feb 2;3(2):e000285. doi: 10.1136/esmoopen-2017-000285. PMID: 29464109; PMCID: PMC5812392.
Y. Xu, X. Liu, X. Cao, C. Huang, E. Liu, S. Qian,X. Liu, Y. Wu, F. Dong, C. Qiu, K.Hua, W.Su, J. Wu, H. Xu , Y. Han , C. Fu, Z. Yin, M. Liu, R. Roepman., S. Dietmann, F. Kengara, Z. Zhang , L. Zhang, T. Zhao, J. Dai, J. Yang, L. Lan, M. Luo, Z. Liu, T. An, B. Zhang, X. He, S. Cong, X. Liu, W. Zhang, J.P. Lewis J. P. Tiedje, Q. Wang, Z. An, F. Wang, L. Zhang, T. Huang, C. Lu, Z. Cai, F. Wang and J. Zhang. “Artificial intelligence: A powerful paradigm for scientific research”. The Innovation. 2021, Vol 2, issue 4. https://doi.org/10.1016/j.xinn.2021.100179
L. Yang. “Image classification of MNIST dataset by using machine learning techniques”. UC Merced. 2021. https://escholarship.org/uc/item/81b0z2hh
Assegie A.T. (2021). An optimized K-Nearest Neighbor based breast cancer detection. Journal of Robotics and Control (JRC) Vol 2, Issue 3, pp 115-118. https://doi: 10.18196/jrc.2363
H.H. Farag, L.A.A. Said, M.R.M. Rizk and M.B.E. Ahmed. “Hyperparameters Optimization for ResNet and Xception in the Purpose of Diagnosing COVID-19”.IOS Press. 2021, vol. 41, no. 2, pp. 3555-3571. https://doi:10.3233/JIFS-2109255
A.K. Nugroho and H. Suhartanto. “Hyper-Parameter Tuning based on Random Search for DenseNet Optimization”. 2020 7th International Conference on Information Technology, Computer, and Electrical Engineering (ICITACEE). 2020, pp. 96-99.
M. Karthik, K. Thangavel and K. Sasirekha. “Novel Deep CNN Model based Breast Cancer Classification.” 2023 7th International Conference on Computing Methodologies and Communication (ICCMC). 2023, pp. 524-529.
K. Sreekala and J. Sahoo. "Hyper Parameter Optimization of Convolutional Neural Networks for Breast Cancer Classification," 2021 International Conference on Advances in Computing and Communications (ICACC), Kochi, Kakkanad, India. 2021, pp. 1-6, https://doi: 10.1109/ICACC-202152719.2021.9708331.
G. Jayandhi, L.J.S. Jasmine and M.S. Jones. X. “Mammogram image classification system using deep learning for breast cancer diagnosis”. AIP Conf. Proc. 2519, 030066 2022. https://doi.org/10.1063/5.0109640
C. Maklin. “Synthetic Minority Over-sampling Technique (SMOTE)”. 2022. Available at https://medium.com/@corymaklin/synthetic-minority-over-sampling-technique-smote-7d419696b88c
A. Özdemir, K. Polat and A. Alhudhaif. “Classification of imbalanced hyperspectral images using SMOTE-based deep learning methods”. Expert Systems with Applications. 2021, Vol 178, https://doi.org/10.1016/j.eswa.2021.114986
I.H. Sarker. “Machine Learning: Algorithms, Real-World Applications and Research Directions”. SN comput. Sci. 2021 Vol 2, article 160. https://doi.org/10.1007/s42979-021-00592-x
V. Jain. “Introduction to KNN Algorithms". 2022. Available at https://www.analyticsvidhya.com/blog/2022/01/introduction-to-knn-algorithms/
S.F. Khorshid and A.M.Abdulazeez. “Breast cancer diagnosis based on k-nearest neighbors: a Review”. Palarch’s Journal of Archaeology of Egypt/Egyptology 18(4), 2021 pp.1927-1951. ISSN 1567-214x
C. Martins. “Gaussian Naive Bayes Explained with Scikit-Learn”. 2023. Available at https://builtin.com/artificial-intelligence/gaussian-naive-bayes
K. Jain. “How to Improve Naive Bayes? Section 3: Tuning the Model in Python”. 2021. Available at https://medium.com/analytics-vidhya/how-to-improve-naive-bayes-9fa698e14cba
V. Viswanatha, A.C. Ramachandra, B. Avinash and S. Shashank. “Breast cancer classification using logistic regression”. High Technology Letters. 2023, Vol 29, Issue 8. https://gjstx-e.cn/
G.G. Pekhimenko (n.d). Penalized Logistic Regression for Classification.
F. Pedregosa, G. Varoquaux, A. Gramfort, V. Michel, B. Thirion, O. Grisel, M. Blondel., P. Prettenhofer, r. Weiss, V. Dubourg, J. Vanderplas, A. Passos, D. Cournapeau, M. Brucher, M. Perrot, and E. Duchesnay. “Scikit-learn: Machine Learning in Python”. Journal of Machine Learning Research. 2021, Vol 12 pp. 2825—2830. Available: https://scikit-learn.org/stable/modules/generated/sklearn.linear_model.LogisticRegression.html
I.H. Sarker. “Deep Learning: A Comprehensive Overview on Techniques, Taxonomy, Applications and Research Directions”. SN COMPUT. SCI. 2, article 420,2021. https://doi.org/10.1007/s42979-021-00815-1
L. Alzubaidi, J. Zhang, A.J. Humaidi, A. Al-Dujaili, Y. Duan, O. Al-Shamma, J. Santamaría, A. Mohammed, F.M. Al-Amidie and L. Farhan. (2021). “Review of deep learning: concepts, CNN architectures, challenges, applications, future directions”. J Big Data 8, article 53, 2021. https://doi.org/10.1186/s40537-021-00444-8
Simplilearn. “What is epoch in Machine Learning?”.2023 Available: https://www.simplilearn.com/tutorials/machine-learning-tutorial/what-is-epoch-in-machine-learning#:~:text=Batch%20size%20is%20a%20hyperparameter,more%20samples%20and%20making%20predictions.
K. Shen. “Effect of batch size on training dynamics”.2018. Available: https://medium.com/mini-distill/effect-of-batch-size-on-training-dynamics-21c14f7a716e
J Brownlee. “Difference Between a Batch and an Epoch in a Neural Network”.2022. Available: https://machinelearningmastery.com/difference-between-a-batch-and-an-epoch/
J. Lim, S. Jeong, S. Lim, H. Cho, J.Y. Shim, S. Hong S.C. Kwon, H. Lee, I. Moon, J. Kim, Y. Amashita. and M. Kano. “Development of Dye Exhaustion Behavior Prediction Model using Deep Neural Network”. Computer Aided Chemical Engineering. 2022, Vol 49, pp. 1825-1830. https://doi.org/10.1016/B978-0-323-85159-6.50304-3
W. Jia, M. Sun, J. Lian and S. Hou. “Feature dimensionality reduction: a review”. Complex Intell. Syst. 8, 2022, pp 2663–2693. https://doi.org/10.1007/s40747-021-00637-x
F. Nazari and W. Yan. “Convolutional versus Dense Neural Networks: Comparing the Two Neural Networks’ Performance in Predicting Building Operational Energy Use Based on the Building Shape”. n.d. Available: https://arxiv.org/ftp/arxiv/papers/2108/2108.12929.pdf
Downloads
Published
How to Cite
Issue
Section
License

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
All papers should be submitted electronically. All submitted manuscripts must be original work that is not under submission at another journal or under consideration for publication in another form, such as a monograph or chapter of a book. Authors of submitted papers are obligated not to submit their paper for publication elsewhere until an editorial decision is rendered on their submission. Further, authors of accepted papers are prohibited from publishing the results in other publications that appear before the paper is published in the Journal unless they receive approval for doing so from the Editor-In-Chief.
IJISAE open access articles are licensed under a Creative Commons Attribution-ShareAlike 4.0 International License. This license lets the audience to give appropriate credit, provide a link to the license, and indicate if changes were made and if they remix, transform, or build upon the material, they must distribute contributions under the same license as the original.