Job Overview
As a Senior Software Engineer on our Compute Platform & Strategy team, you will apply software development practices and DevOps knowledge to support the design, operation, and improvement of our compute infrastructure and platforms. You’ll join a collaborative group that values reliable delivery, continuous learning, and thoughtful problem-solving—focused on efficiency, resilience, and user needs.
We’re seeking a DevOps enthusiast with hands-on expertise in containerisation, cloud platforms, continuous-delivery pipelines, systems administration, and dashboarding solutions. In this role, you’ll partner with cross-functional colleagues to design CI/CD workflows, automate infrastructure tasks, create and refine dashboards, and help ensure our systems remain performant, secure, and cost-effective.
Responsibilities:
- Create and maintain monitoring systems and dashboards for health and performance, resource usage, and cost tracking—providing actionable insights that help improve efficiency.
- Work with team members to define requirements and architect resilient CI/CD pipelines from planning through delivery
- Architect, deploy, and validate the infrastructure that powers development and testing workflows
- Manage the full lifecycle of core infrastructure in public cloud and Kubernetes—from design and deployment to maintenance and performance tuning
- Proactively identify and automate manual processes to accelerate workflows, strengthen DevOps practices, and reduce costs
Required Skills and Experience:
- Hands-on experience with one or more public clouds (AWS, GCP, Azure)
- Solid programming experience in a high-level language (Python, Go, Java, etc.) and with Infrastructure-as-Code tools (Terraform, CloudFormation)
- Experience designing CI/CD pipelines (Jenkins, GitLab CI, Azure DevOps) with multi-stage workflows, blue/green & canary releases, and automated rollbacks
- Proficiency with Docker, Kubernetes, and related cloud-native orchestration patterns
- Proven track record building dashboards and visualizations across multi-such as in Grafana, Datadog, and AWS
- QuickSight; hands-on instrumentation such as using Prometheus; experience managing time-series stores such as Graphite and VictoriaMetrics
- Solid understanding of networking, security, and compliance in cloud environments
- Excellent written and verbal communication skills
In return, you will be provided with the training and environment to excel in this role. As well as a friendly and high-performance working environment.