Share
Key job responsibilities
As a systems development engineer you will build Cloud-Scale Machine Learning Acceleration Infrastructure including architecting and delivering directly to your customers, the Annapurna engineering teams, that are building hardware/software running in our data centers which are custom designed machine learning products: AWS Inferentia2 and Trainium.
You will interact with multiple teams to develop and execute in-depth infrastructure development plans including defining the security requirements and threat models. You will build mechanisms to automate host operating systems (OS) patching and define our software service's security posture. You will define and execute both short-term and long-term strategies for vulnerability remediation, and drive remediation activities across multiple Annapurna teams. You'll work closely with the Annapurna engineers to learn their operational constraints, architectural design requirements and partner with senior security engineers to review, refine and optimize remediation plans.Your impact will be critical in helping AWS/Annapurna maintain top-tier security by balancing risk and delivery. You'll focus on identifying security trends, enhancing security visibility, and recommending improvements to our security posture.
Dive deep to address infrastructure security risks involving networking, high performance compute clusters, infrastructure automation of hardware/software/firmware testing, and ASIC/EDA development. You will help scale the next generation of cloud infrastructure based on cloud frameworks and AWS services. You will own security reviews for infrastructure development and partner with AWS service teams and vendors with a focus on security. You will influence within your team, your customers and AWS service teams to help drive and develop the technical implementation for overall system designs. You will identify and implement process improvements which improve the team’s agility and operations, including improvements to design, automation, development, test or operations. You will develop, document and update operational runbooks as you participate in on-call rotations.
A day in the life
Each day you will work with the best engineers in the industry to develop Machine Learning Accelerators. On-site in Austin, Texas, you will be apart of the team that develops custom silicon and you will secure the infrastructure that enables this innovation. Take a look inside our labs to see what you will learn at Annapurna Labs:https://youtu.be/rViVFrQg4Hk
- 2+ years of non-internship professional software development experience
- 1+ years of designing or architecting (design patterns, reliability and scaling) of new and existing systems experience
- Knowledge of systems engineering fundamentals (networking, storage, operating systems)
- Experience programming with at least one modern language such as C++, C#, Java, Python, Golang, PowerShell, Ruby
- Bachelor's degree in computer science or equivalent
- Knowledge of system security vulnerabilities and remediation techniques.
- 2+ years of experience in any combination of the following: threat modeling experience, secure coding, identity management and authentication, security vulnerabilities, attacker exploit techniques, and methods/tools used for security risk remediation.
These jobs might be a good fit