Responsibilities
- Design and implement software systems that are highly distributed, concurrent and reliable to manage the lifecycle of our Distributed Cloud.
- Implement fully tested code with Observability and Telemetry for Day 2 operations.
- Work closely with Product, Platform Services and SRE Engineers to make this system scale to the needs of business.
- Innovate by bringing the best of ideas to life and help business grow.
- Learn and share your ideas and practices, inside and outside the company, to uplift the entire engineering community.
Required Qualifications
- Bachelor’s degree in computer science or equivalent professional experience (5+ years).
- Strong foundational understanding of systems, Linux Kernel, TCP/IP stack, K8s.
- Proficiency in Cloud native development and programming languages such as GO, Rust and Python .
- Must demonstrate deep understanding of K8s machinery and have experience deploying Models (CRDs), Controllers and/or Operators in a mission critical environment.
- Must have skills in building large scale distributed systems, highly resilient, reliable and mission critical using K8s machinery.
- Must have experience in managing lifecycle of software systems with CI/CD using tools ArgoCD or similar.
- Strong analytical, troubleshooting, and end to end debugging skills in a highly asynchronous event driven system.
- Experience working in a 24x7 uptime SaaS environment with global customer base.
- Preferred extensive knowledge and/or experience in the domains of Network, routing, Infra, Underlay to VPN, DNS, Load Balancer technologies
The Job Description is intended to be a general representation of the responsibilities and requirements of the job. However, the description may not be all-inclusive, and responsibilities and requirements are subject to change.