Expoint - all jobs in one place

Finding the best job has never been easier

Limitless High-tech career opportunities - Expoint

Amazon Systems Development Engineer III Annapurna Labs Infrastructure 
United States, Texas, Austin 
381541959

12.06.2024
DESCRIPTION

Key job responsibilitiesYou will need to lead across teams to develop and execute in-depth infrastructure development plans that enables the engineering development of the Machine Learning Acceleration product family. You will dive deep to solve critical infrastructure issues involving networking, high performance compute clusters, infrastructure automation of hardware/software/firmware testing, and ASIC/EDA development. You will execute and scale the next generation of cloud infrastructure based on cloud frameworks and AWS services. You will own design reviews for infrastructure development and partner with AWS service teams and vendors. You will influence within your team, your customers and AWS service teams to help drive and develop the technical implementation for overall system designs. You will identify and implement process improvements which improve your team’s agility and operations, including improvements to design, automation, development, test or operations. You will define new mechanisms that execute system health monitoring, diagnostics, repair, and automation. You will develop, document and update operational runbooks as you participate in on-call rotations.A day in the life
Each day you will work with the best engineers in the industry to develop Machine Learning Accelerators. On-site in Austin, Texas, you will be apart of the team that develops custom silicon and you will own the infrastructure that enables this innovation. Take a look inside our labs to see what you will learn at Annapurna Labs:https://youtu.be/rViVFrQg4HkAustin, TX, USA


BASIC QUALIFICATIONS

- 5+ years of programming with at least one modern language such as C++, C#, Java, Python, Golang, PowerShell, Ruby experience
- 3+ years of non-internship professional software development experience
- 5+ years of designing or architecting (design patterns, reliability and scaling) of new and existing systems experience
- 5+ years of deploying and operating in a Linux/Unix environment experience
- 3+ years of systems design, software development, operations, automation, and process improvement experience
- Experience leading the design, build and deployment of complex and performant (reliable and scalable) software solutions in production
- 3+ years of systems development in an IT or data center environment experience
- Experience with debugging complex issues with HW/SW, networking and storage systems
- Experience with operations of large scale infrastructure deployments including improving operational excellence