Expoint - all jobs in one place

The point where experts and best companies meet

Limitless High-tech career opportunities - Expoint

NetApp Platform Reliability Engineering Architect 
United States, North Carolina 
163348151

22.12.2024
Job Responsibilities

• Lead and mentor a team of Platform Engineers, fostering a culture of continuous improvement and innovation.
• Collaborate with product and engineering teams to design and implement scalable solutions.
• Develop and maintain a reliable monitoring and alerting system to detect and mitigate issues proactively.
• Handle incidents to reduce TTM and TTR consistently
• Participate and lead post-mortem analyses to prevent future outages.
• Manage priorities, projects, and the overall workflow of the SRE team.
• Ensure compliance with security best practices and company policies.
• Stay ahead of industry trends and emerging technologies to improve system reliability and performance continuously.
• Exceptional problem-solving skills and the ability to work under pressure.
• Excellent communication and team-building skills.


Job Requirements

• 12 years of experience in Software Development, Platform Engineering, DevOps, or similar roles, with at least 5 years in a lead and/or architect position.
• Experience mentoring geographically dispersed teams.
• Recommend the appropriate technological approach, team structures, and skill sets
• Proficiency in programming languages such as Python, Go, or Java.
• Extensive experience with cloud services (AWS, GCP, Azure) and container orchestration tools (Kubernetes, Docker).
• Experience designing and implementing CI/CD pipelines and Configuration Management (Jenkins, Ansible, Terraform)
• Deliver architectural initiatives that drive and improve efficiency in line with business strategy.
• Familiarity with distributed systems design patterns using tools such as Kubernetes.
• Exceptional knowledge of observability tools and setting up architecture for proactive monitoring of the product.
• Experience in setting up SLOs & SLIs.
• Proven track record of designing and implementing scalable, high-availability systems.

IC - Typically requires a minimum of 12 years of related experience.Mgr & Exec - Typically requires a minimum of 8 years of related experience.

Did you know...

If you want to help us build knowledge and solve big problems, let's talk.