You should be comfortable working across teams helping establish standards and best practices for infrastructure and code deployments, observability, and environment access accounting for the security and supportability of the system. You’ll thrive in both designing high-level architecture and implementing, deploying infrastructure and associated CI/CD pipelines. The backend systems you will help build in this role will support advanced ML-powered tools, including asset generation and LLM inference, to enable AI-assisted game creation.
What you'll be doing - Contribute to architectural decisions and technical direction of AI and backend platforms
- Implement high quality, maintainable infrastructure-as-code, Helm deployments, and help define best practices for other developers using Terraform cloud, Helm and Kubernetes
- Run spikes collaboratively with developers on technical approaches to address application infrastructure needs on Microsoft Azure and Google cloud. You will be part of building out and maturing the associated infrastructure and processes for promoting application code from development to production
- Support development investigations of issues including performance, network latency via observability and monitoring support in infrastructure
- Improve performance, reliability, observability, security and cost-efficiency of backend systems
What we're looking for - Extensive experience delivering and supporting cloud backend services using Terraform, Kubernetes, Helm, and CI/CD pipelines (e.g. Argo, GitHub Actions)
- Proven track record in observability, including monitoring, logging, alerting, and debugging tools such as Grafana, to ensure system reliability and performance
- Strong understanding of software delivery best practices, network security and a quality-first mentality and approach
- Strong interpersonal and communication skills, with successful experience aligning multiple stakeholders to deliver solutions
- Experience with Microsoft Azure or Google cloud platform offerings
You might also have - Exposure to ML infrastructure or LLM inference deployment
- Familiarity with Unity or similar 3D engines
- Familiarity with languages like C# / .Net, Python, Golang
- Backend service development experience including API design
- Familiarity with networking, caching, or real-time data pipelines, and relational databases such as PostgreSQL
Additional information - International relocation support is not available for this position
- Work visa/immigration sponsorship is not available for this position
This position requires the incumbent to have a sufficient knowledge of English to have professional verbal and written exchanges in this language since the performance of the duties related to this position requires frequent and regular communication with colleagues and partners located worldwide and whose common language is English.
Gross base salary$125,300—$187,900 CAD