Self-Healing AI: Leveraging Cloud Computing for Autonomous Software Recovery
Keywords:
Self-healing AI, cloud computing, autonomous recovery, fault diagnosis, machine learning, system resilience.Abstract
As software systems grow increasingly complex and integrated, ensuring resilience against unexpected failures becomes a paramount concern. Self-healing Artificial Intelligence (AI) offers a transformative solution by enabling software systems to autonomously detect, diagnose, and recover from faults. This paper explores the integration of self-healing AI with cloud computing technologies to enhance software recovery capabilities. By leveraging the scalability and computational power of cloud platforms, self-healing AI systems can implement real-time monitoring, predictive analytics, and fault remediation across distributed environments. The proposed framework employs machine learning algorithms to predict potential failures by analyzing historical performance data and real-time metrics. Reinforcement learning models are used to optimize recovery actions, balancing system stability and operational efficiency. The elasticity of cloud computing resources allows self-healing AI to dynamically allocate computational power for fault diagnosis and resolution without compromising performance. Furthermore, this paper discusses the role of microservices architectures and containerization in enabling granular self-healing capabilities, ensuring minimal disruption during recovery. The study presents experimental results demonstrating the efficacy of cloud-integrated self-healing AI in reducing downtime and enhancing system reliability. The framework achieved up to a 92% reduction in mean time to recovery (MTTR) compared to traditional reactive approaches. Key challenges, such as data security, latency, and resource overhead, are also addressed, emphasizing the importance of robust architectural design and data encryption techniques.
This research contributes to the growing body of knowledge on autonomous software recovery by combining the adaptive learning capabilities of AI with the scalability of cloud computing. It provides a pathway for organizations to build resilient software systems capable of withstanding the demands of dynamic and unpredictable operational environments.
Downloads
References
Meng, W., Li, J., & Xu, C. (2018). Towards self-healing microservices in cloud-native applications. Proceedings of the IEEE International Conference on Cloud Computing, 123–132.
Ghosh, S., & Bhattacharya, A. (2016). Intelligent fault detection in software systems: A machine learning approach. International Journal of Computer Science and Information Security, 14(1), 121–128.
Zhu, C., Leung, V. C., Shu, L., & Ngai, E. C. (2015). Green Internet of Things for smart world. IEEE Access, 3, 2151–2162.
Ma, C., & Chen, J. (2019). AI-driven anomaly detection for self-healing cloud systems.
IEEE Transactions on Cloud Computing, 8(4), 1129–1141.
Chiu, M. T., & Wang, W. C. (2021). A framework for self-healing cloud systems. IEEE Cloud Computing, 8(1), 38–47.
Li, Y., & Meng, Y. (2018). A survey of self-healing systems for software engineering.
IEEE Transactions on Software Engineering, 44(6), 634–659.
Malhotra, R., & Jain, A. (2015). Fault prediction using machine learning methods: A case study of open-source projects. IEEE Access, 3, 1832–1843.
Smith, A., & Jones, R. (2019). AI in software maintenance: Automating the debugging process. ACM Transactions on Software Engineering and Methodology, 28(3), 1–26.
Liu, J., & Perez, M. (2020). Self-adaptive systems: A modern approach using machine learning. Journal of Systems and Software, 159, 110443.
Wang, J., & Luo, Y. (2021). AI-powered self-healing in microservices: A comprehensive review. ACM Computing Surveys, 53(6), 1–34.
Pereira, C., & Freitas, P. (2014). Self-healing methodologies in IoT-based software engineering. IEEE Internet of Things Journal, 1(4), 292–303.
Rong, X., & Lin, W. (2020). Cloud-driven self-repair for resilient software. IEEE Transactions on Cloud Computing, 8(3), 645–657.
Zhang, J., & Wang, Y. (2020). AI-driven self-healing for cloud-native software systems.
Proceedings of the IEEE International Conference on Cloud Engineering, 91–100.
Roy, S., & De, P. (2021). Towards resilience: AI-based self-healing for cloud software.
Software: Practice and Experience, 51(8), 1736–1754.
Silver, D., Schrittwieser, J., Simonyan, K., & Hassabis, D. (2017). Mastering chess and shogi by self-play with a general reinforcement learning algorithm. Nature, 550(7676), 354–359.
Zhang, X., Zhou, Y., & Li, M. (2022). Self-healing AI: Challenges and opportunities in cloud environments. Future Generation Computer Systems, 135, 100–117.
Reddy, K., & Swamy, R. (2016). Machine learning applications in adaptive software systems. International Journal of Advanced Computer Science and Applications, 7(2), 59– 67.
Lin, Y., & Ma, X. (2020). AI-driven frameworks for fault-tolerant cloud platforms. IEEE Internet Computing, 24(3), 20–29.
Tang, T., & Xu, Q. (2015). Integrating reinforcement learning in software adaptation frameworks. Journal of Intelligent Systems, 24(4), 453–467.
Huang, Y., & Xu, Z. (2020). Blockchain-enhanced self-healing AI systems in cloud computing. IEEE Access, 8, 56789–56800.
Rao, G., & Lal, S. (2021). AI-enhanced proactive recovery for cloud-based applications.
Journal of Cloud Computing: Advances, Systems and Applications, 10(1), 1–15.
Goyal, M., & Chawla, P. (2019). Leveraging self-healing AI in the cloud: Current trends and challenges. ACM SIGSOFT Software Engineering Notes, 44(5), 21–31.
Downloads
Published
How to Cite
Issue
Section
License

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
All papers should be submitted electronically. All submitted manuscripts must be original work that is not under submission at another journal or under consideration for publication in another form, such as a monograph or chapter of a book. Authors of submitted papers are obligated not to submit their paper for publication elsewhere until an editorial decision is rendered on their submission. Further, authors of accepted papers are prohibited from publishing the results in other publications that appear before the paper is published in the Journal unless they receive approval for doing so from the Editor-In-Chief.
IJISAE open access articles are licensed under a Creative Commons Attribution-ShareAlike 4.0 International License. This license lets the audience to give appropriate credit, provide a link to the license, and indicate if changes were made and if they remix, transform, or build upon the material, they must distribute contributions under the same license as the original.