The point where experts and best companies meet
Share
• Lead and mentor a team of Platform Engineers, fostering a culture of continuous improvement and innovation.
• Collaborate with product and engineering teams to design and implement scalable solutions.
• Develop and maintain a reliable monitoring and alerting system to detect and mitigate issues proactively.
• Handle incidents to reduce TTM and TTR consistently
• Participate and lead post-mortem analyses to prevent future outages.
• Manage priorities, projects, and the overall workflow of the SRE team.
• Ensure compliance with security best practices and company policies.
• Stay ahead of industry trends and emerging technologies to improve system reliability and performance continuously.
• Exceptional problem-solving skills and the ability to work under pressure.
• Excellent communication and team-building skills.
• 12 years of experience in Software Development, Platform Engineering, DevOps, or similar roles, with at least 5 years in a lead and/or architect position.
• Experience mentoring geographically dispersed teams.
• Recommend the appropriate technological approach, team structures, and skill sets
• Proficiency in programming languages such as Python, Go, or Java.
• Extensive experience with cloud services (AWS, GCP, Azure) and container orchestration tools (Kubernetes, Docker).
• Experience designing and implementing CI/CD pipelines and Configuration Management (Jenkins, Ansible, Terraform)
• Deliver architectural initiatives that drive and improve efficiency in line with business strategy.
• Familiarity with distributed systems design patterns using tools such as Kubernetes.
• Exceptional knowledge of observability tools and setting up architecture for proactive monitoring of the product.
• Experience in setting up SLOs & SLIs.
• Proven track record of designing and implementing scalable, high-availability systems.
Did you know...
If you want to help us build knowledge and solve big problems, let's talk.
These jobs might be a good fit