Develop full-fledged software tooling to deliver programmable infrastructure (infrastructure as code)
Develop tooling to drive end-to-end micro-services monitoring and management
Implement Kubernetes compliance and best practices in terms of security, audits, network policies, reporting
Develop a self-service Console to provide infrastructure visibility
Manage the availability, scalability, and performance of the platform's infrastructure
Create tools and infrastructure leveraged by the rest of the engineering teams
Convert other engineering team's application development bottlenecks as an opportunity to automate & scale the tooling of the platform's infrastructure
Create and maintain continuous integration and continuous deployment(CI/CD) environments for scaling SaaS applications to multi-region & multi-cloud patterns
Strong exposure of design patterns, SOLID Principles, architectural reviews and diagram designs
Strong exposure of converting product business requirements to technical solutions including multi cloud designs
Domain expertise in new product introduction and go to market deployments.
Expertise with monitoring, alerting, and incident management, such as Grafana, Prometheus, Alert Manager, Kibana, PagerDuty.
Operating experience in real-time data processing pipelines with data ingestion, Kafka, Flink, and Elastic Search.
Experience in the development of operational procedures, processes, and scripts