Share
As a member of the Amazon MSK Infrastructure team, you will work on systems that maintain fleet health across 500,000 hosts spanning 37 regions. Your work will include building automation for fleet patching to keep RED hosts under 1% at any given time, developing region build automation to support MSK launches in new AWS regions, and ensuring feature parity across all regions. The scale of this fleet presents unique challenges in coordination, rollout strategies, and failure handling that require sophisticated automation and monitoring systems.Your responsibilities will include collaborating with other engineers to build reliable infrastructure for a large-scale AWS service, working with senior leaders to define infrastructure roadmaps, and ensuring MSK can scale globally while maintaining high availability standards.Utility Computing (UC)About AWS
- 3+ years of non-internship professional software development experience
- 2+ years of non-internship design or architecture (design patterns, reliability and scaling) of new and existing systems experience
- Experience programming with at least one software programming language
- 3+ years of full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations experience
- Bachelor's degree in computer science or equivalent
These jobs might be a good fit