Applicants should have prior experience with or be able to master the tools and techniques for building, deploying, operating, and supporting large scale hybrid cloud platforms, services, and applications, based on Kubernetes and DevOps practices. Challenges include optimizing demanding Open APIs, heterogeneous computation platforms, and operational requirements for highly available and reliable services.
Required Technical and Professional Expertise- Experience in a professional software engineering role using one or more programming language ecosystems and software development processes,
- experience in developing software with Python, Rust, C++ or Go and their ecosystems of tools and libraries,
- experience with cloud-native environments, such as AWS (S3, lambda, CloudTrail, etc.) or other cloud vendors,
- demonstrated ability to organize, prioritize, and multi-task in a fast paced, changing, and agile development environment to meet deadlines,
- effective written and verbal communication and interpersonal skills,
- demonstrated ability to organize, prioritize, and multi-task in a fast paced, changing, and agile development environment to meet deadlines,
- effective written and verbal communication and interpersonal skills,
- proficiency in English
Preferred Technical and Professional Expertise
- Experience with Kubernetes
- Experience with Ray, Skypilot, or vLLM,
- Experience with DevOps and Site Reliability Engineering (SRE) processes required to build Software as a Service (SaaS) products,
- Experience in a data science or data engineering environment,
- Experience with strongly typed languages/type systems or GraphQL