Job responsibilities
- Execute software solutions, design, development, and technical troubleshooting with ability to think beyond routine or conventional approaches to build solutions or break down technical problems.
- Support the engineering teams in building fault-tolerant, scalable applications by engaging in design discussions, RFCs and code reviews.
- Drive decisions that influence the product design, application functionality, and technical operations and processes.
- Implement and regularly testing DR strategies to ensure highest level of resilience and fault tolerance of the platform.
- Automate the installation, upgrade, scaling, and management of a large and rapidly growing fleet of Kubernetes clusters. Develop custom platform control plane webhooks, CRDs and operators and more that provide a secure opinionated platform.
- Maintain and promoting high-quality written documentation of assets, processes and runbooks that are used by the team in their day-to-day operations.
- Add to the team culture of diversity, equity, inclusion, and respect.
Required qualifications, capabilities, and skills
- Possess an up-to-date understanding of design patterns relevant to hosting and networking architectures.
- Proactively champion product development, driven by a desire to build truly exceptional products, not just solve immediate challenges.
- A strong background working in either Python, Golang or Java, having used one of these programming languages to execute a significantly sized project or initiative.
- Extensive experience of working with Kubernetes and Cloud Platforms (AWS, GCP or Azure).
- Expertise in one or more of the following areas: Database Administration, Networking, Observability Tools, or automation of infrastructure.
- Ability to tackle design and functionality problems independently with little to no oversight.
- Excellent debugging and trouble shooting skills.
Preferred qualifications, capabilities, and skills
- Experience in Infrastructure Architecture designs.
- Certification in Cloud Platforms (AWS, GCP preferred)
- Certification in Kubernetes