AI-Driven Autonomous IT Operations: A Human-in-the-Loop AIOps 2.0 Framework

Authors

  • Samrat Mukherjee

Keywords:

AIOps, Human-in-the-Loop, Autonomous IT Operations, Intelligent Incident Management, Predictive Analytics

Abstract

Artificial Intelligence for IT Operations (AIOps) has become a game-changer for the management of large and complex digital environments, driven by intelligent automation, predictive analytics, and decision-making. Current AIOps solutions, however, face challenges with explainability, governance, interoperability, and rely too much on fully automated remediation. This paper introduces a new framework for Human-in-the-Loop AIOps 2.0, combining anomaly detection with reinforcement learning and human validation, to ensure reliable autonomous IT operations. The proposed methodology involves utilising the Kaggle IT Incident Log Dataset, Data Preprocessing, Feature Engineering, anomaly detection using Isolation Forest (IF) algorithm and a Deep Q-Network (DQN)-based recommendation engine to create intelligent remediation actions. Human operators review AI-generated decisions for validation before execution, enhancing the trust, transparency, and governance of the operation. Through experimental assessment, the proposed framework has been proven to be the most effective with an accuracy (ACC) of 94.6%, precision (PRE) of 94.1%, recall (REC) of 94.8%, and F1 Score (F1) of 94.4% better than traditional monitoring systems and fully AI-driven monitoring systems. Additionally, the human validation process can greatly minimize false positives and mean time to resolution, and enhance downtime reduction. The findings of the study underline the need to build autonomous intelligence and integrate it with human intelligence for next-generation resilient, explainable and adaptive IT operations management.

DOI: https://doi.org/10.17762/ijisae.v12i23s.8304

Downloads

Download data is not yet available.

References

S. Tatineni, “AIOps in Cloud-native DevOps: IT Operations Management with Artificial Intelligence,” J. Artif. Intell. Cloud Comput., vol. 2, pp. 1–7, 2023, doi: 10.47363/JAICC/2023(2)154.

Tyagi, “Intelligent DevOps: Harnessing artificial intelligence to revolutionize CI/CD pipelines and optimize software delivery lifecycles,” J. Emerg. Technol. Innov. Res., vol. 8, pp. 367–385, 2021.

G. C. Kakaraparthi, “Integrating serverless architectures and Kubernetes for scalable and high-availability AI workflows,” Int. J. Intell. Syst. Appl. Eng., vol. 12, no. 4, pp. 5896–5905, 2024, doi: 10.6084/m9.figshare.30445046.

S. K. Malaraju and R. Bondalapati, “Least Outstanding Requests (LOR) Algorithm in Application Load Balancer,” Int. J. Sci. Technol., vol. 14, no. 3, p. 7, 2023, doi: 10.71097/ijsat.v14.i3.3171.

D. P. Guda and C. Appani, “Graph Neural Networks for Dynamic Malware Behaviour Analysis and Classification in Advanced Persistent Threats (APT),” Int. J. Commun. Networks Inf. Secur., vol. 14, no. 3, 2022.

S. Chatterjee, “Disaster Recovery Plan in Utility Industry for Virtual Asset Management - A Comprehensive Overview to Avoid Cyber Attacks,” Int. J. Sci. Res., vol. 13, no. 12, pp. 1163–1171, 2024, doi: 10.21275/sr241217215432.

S. Chatterjee, “A Data Governance Framework for Big Data Pipelines: Integrating Privacy, Security, and Quality in Multitenant Cloud Environments,” Tech. Int. J. Eng. Res., vol. 10, no. 5, 2023, doi: 10.56975/tijer.v10i5.158181.

L. S. Surisetty, “Modernizing Legacy Systems with AI Orchestration: From Monoliths to Autonomous Micro services,” Int. J. Adv. Res. Comput. Sci. & Technol., vol. 5, no. 6, pp. 7299–7306, 2022.

M. R. Mohammed, “Enhancing the Reliability of Cloud-Based Software Systems Using AI-Driven Fault Prediction and Auto-Remediation Techniques,” Am. Int. J. Comput. Sci. Technol., vol. 3, no. 5, pp. 1–13, 2021.

Patel, “A Review of Multi-Channel CRM Strategies Using Big Data and Cloud Integration,” Int. J. Sci. Res. Comput. Sci. Eng. Inf. Technol., vol. 8, no. 1, pp. 577–588, 2022.

Parupalli and H. Kali, “An In-Depth Review of Cost Optimization Tactics in Multi-Cloud Frameworks,” Int. J. Adv. Res. Sci. Commun. Technol., vol. 3, no. 5, pp. 1043–1052, Jun. 2023, doi: 10.48175/IJARSCT-11937Q.

Methuku, S. Kamatala, P. Naayini, and P. R. Vontela, “From Ethical Principles to Technical Safeguards: A Unified Framework for Safe and Human-Centred Artificial Intelligence,” Am. Int. J. Comput. Sci. Technol., vol. 4, no. 5, pp. 26–34, Sep. 2022, doi: 10.63282/3117-5481/AIJCST-V4I5P103.

S. Chitta, C. Ravi, V. K. R. Vangoor, and S. M. Yellepeddi, “AIOps: Integrating AI and Machine Learning into IT Operations,” Aust. J. Mach. Learn. Res. Appl., vol. 4, no. 1, pp. 288--305, Jan. 2024.

H. Allam, “From Monitoring to Understanding: AIOps for Dynamic Infrastructure,” Int. J. AI, BigData, Comput. Manag. Stud., vol. 4, no. 2, pp. 77–86, jun. 2023, doi: 10.63282/3050-9416.IJAIBDCMS-V4I2P109.

S. Polisetty, “Training Ai Models: Preparing and Managing Ai Algorithms for Aiops,” Int. J. Sci. Res. Comput. Sci. Eng. Inf. Technol., vol. 9, no. 5, 2023.

S. M. Polisetty, “Resolving Incidents and Alerts in AIOps with Predictive Analytics,” Int. J. Sci. Res. Comput. Sci. Eng. Inf. Technol., vol. 8, no. 5, pp. 375–387, 2022, doi: 10.32628/CSEIT23902182.

L. Li, “The Application of AI-Based Technology in Computer Network Operation and Maintenance,” Mob. Inf. Syst., vol. 2022, no. 1, p. 2971393, 2022, doi: 10.1155/2022/2971393.

S. Becker, F. Schmidt, A. Gulenko, A. Acker, and O. Kao, “Towards AIOps in Edge Computing Environments,” Publ. 2020 IEEE Int. Conf. Big Data (Big Data), 2021, doi: 10.1109/BigData50022.2020.9378038.

S. I. Shifat, “IT Incident Log Dataset,” Kaggle., 2020, [Online]. Available: https://www.kaggle.com/datasets/shamiulislamshifat/it-incident-log-dataset

Ampountolas, “Comparative Analysis of Machine Learning, Hybrid, and Deep Learning Forecasting Models: Evidence from European Financial Markets and Bitcoins,” Forecasting, vol. 5, no. 2, pp. 472–486, 2023, doi: 10.3390/forecast5020026.

D. Song, A. M. Chung Baek, and N. Kim, “Forecasting Stock Market Indices Using Padding-Based Fourier Transform Denoising and Time Series Deep Learning Models,” IEEE Access, vol. 9, pp. 83786–83796, 2021, doi: 10.1109/ACCESS.2021.3086537.

M. Jin et al., “An Anomaly Detection Algorithm for Microservice Architecture Based on Robust Principal Component Analysis,” in IEEE Access, IEEE, 2020, pp. 226397–226408. doi: 10.1109/ACCESS.2020.3044610.

M. Kayakuş, F. Yiğit Açikgöz, M. N. Dinca, and O. Kabas, “Sustainable Brand Reputation: Evaluation of iPhone Customer Reviews with Machine Learning and Sentiment Analysis,” Sustainability, vol. 16, no. 14, 2024, doi: 10.3390/su16146121.

Downloads

Published

06.08.2024

How to Cite

Samrat Mukherjee. (2024). AI-Driven Autonomous IT Operations: A Human-in-the-Loop AIOps 2.0 Framework . International Journal of Intelligent Systems and Applications in Engineering, 12(23s), 4317–4325. Retrieved from https://www.ijisae.org/index.php/IJISAE/article/view/8304

Issue

Section

Research Article