Making the existing cluster automation platform more fault-tolerant, agile, hardware/networking aware, and resource-efficient. Enabling AI capabilities in the platform to enhance user experience and accelerate automation, and diagnosis and remediation...