Design and implement full-stack observability systemscovering metrics, logs, traces, and events for GPU-powered AI and HPC workloads. Build large-scale telemetry data pipelinesleveraging OpenTelemetry, Kafka, Prometheus, and other distributed systems to...