AI-Powered Self-Healing Cloud Infrastructures: A Paradigm For Autonomous Fault Recovery

Authors

  • Ravi Kumar Vankayalapati , Chandrashekar Pandugula , Venkata Krishna Azith Teja Ganti , Ghatoth Mishra

Abstract

This paper is about self-healing cloud infrastructures equipped with artificial intelligence to enable autonomous recovery from unforeseen runtime faults. As cloud-in-a-robot or robot clouds become a reality and promise to go beyond the cloud paradigm by enhancing edge computing platforms to deliver ultra-low-latency services to users, making them self-reliable is crucial. Although there is plenty of prior work on building robust deep learning models and deploying them in the cloud, there is a lack of comprehensive and systematic real-time fault recovery frameworks. This paper provides a detailed delineation of the different challenges and key aspects that are overlooked in prior work on building resilient cloud infrastructures. We present a system design of AI-powered self-healing cloud infrastructures that applies AI to different levels, including autonomous fault detection, reasoning-based fault diagnosis, and many techniques that use deep reinforcement learning to ensure expedited repair times.

Metrics

Metrics Loading ...

Downloads

Published

2022-12-20

How to Cite

Ravi Kumar Vankayalapati , Chandrashekar Pandugula , Venkata Krishna Azith Teja Ganti , Ghatoth Mishra. (2022). AI-Powered Self-Healing Cloud Infrastructures: A Paradigm For Autonomous Fault Recovery. Migration Letters, 19(6), 1173–1187. Retrieved from https://migrationletters.com/index.php/ml/article/view/11498

Issue

Section

Articles