Expoint – all jobs in one place
The point where experts and best companies meet

Systems Validation Manager Annapurna Labs jobs at Amazon in United States, Austin

Discover your perfect match with Expoint. Search for job opportunities as a Systems Validation Manager Annapurna Labs in United States, Austin and join the network of leading companies in the high tech industry, like Amazon. Sign up now and find your dream job with Expoint
Company (1)
Job type
Job categories
Job title (1)
United States
State
Austin
241 jobs found
09.11.2025
A

Amazon Sr SDE MLA hardware/software co-design Annapurna United States, Texas, Austin

Limitless High-tech career opportunities - Expoint
Description:
Description

As a senior SDE in the pre-silicon team, you will be responsible for driving the pre-silicon hardware/software co-development for our machine learning chips.You will work with architecture, design and emulation teams to build new silicon functionality.You will write bare-metal software to verify the end-to-end functionality of the SoC and the functionality and performance of different subsystems in the SoC.
Work/Life Balance
Mentorship and Career Growth
We’re continuously raising our performance bar as we strive to become Earth’s Best Employer. That’s why you’ll find endless knowledge-sharing, mentorship and other career-advancing resources here to help you develop into a better-rounded professional.Diverse Experiences
Amazon values diverse experiences. Even if you do not meet all of the preferred qualifications and skills listed in the job description, we encourage candidates to apply. If your career is just starting, hasn’t followed a traditional path, or includes alternative experiences, don’t let it stop you from applying.

Basic Qualifications

* 5+ YoE in software development
* Knowledge of HW/SW interfaces and computer architecture
* Proficiency in programming in C/C++, scripting in Bash/Python
* Proficiency in data structures and algorithms


Preferred Qualifications

* Knowledge in low level software such as firmware and device drivers
* Knowledge in SoC architecture
* Knowledge in IO(PCIE, AXI) , Memory(HBM, DDR), CPU architecture and Interconnects.

Expand
09.11.2025
A

Amazon Sr Quality & Reliability Engineer Trainium Servers Systems M... United States, Texas, Austin

Limitless High-tech career opportunities - Expoint
Description:
Description

The Trainium Manufacturing, Quality and Reliability (MQR) Team is part of AWS Annapurna Labs focused on Machine Learning products that designs cutting AI platforms for the world’s largest Cloud Services provider. As a Senior Reliability Engineer you will engage with an experienced cross-disciplinary staff to conceive and design infrastructure technologies. You will work closely with an internal inter-disciplinary team, and outside partners to drive key aspects of product definition, execution and test in manufacturing. A successful candidate will be responsive, flexible and able to succeed within an open collaborative peer environment. You will:* Be responsible for the test validation of future technologies.
* Drive manufacturing process improvements to address reliability issues and concerns.
* Qualify manufacturing lines and mechanisms for mass production
* You will have a fundamental understanding of Reliability statistics/Reliability tests and/or solid understanding of computer systems to influence design for reliability.
* Lead identifying and validating product/component risks and work with design teams to mitigate them and define the test methodology and test coverage to assure product reliability.
* Deep-dive in technologies aligned with product roadmap.
* Provide technical leadership and mentor engineers.
* Perform Reliability prediction of failure mechanisms, products under development and products in the field.
* Working with multiple vendors and ODMs to standardize component manufacturing and reliability expectations.Key job responsibilities
* Responsible for defining reliability tests to be implemented during manufacturing
* Drive manufacturing process improvements to address reliability issues and concerns.
* Perform Reliability prediction of failure mechanisms, products under development and products in the field.
* Working with multiple vendors and ODMs to standardize component manufacturing and reliability expectations.


Basic Qualifications

- Bachelor's or Master’s degree in Reliability Engineering, Physics or related field, or equivalent experience
- 7+ years of Reliability Engineering work experience with server compute platforms or on high-tech hardware

Expand
09.11.2025
A

Amazon Software Development Manager Foundations United States, Texas, Austin

Limitless High-tech career opportunities - Expoint
Description:
Description

In this role you will be responsible for building and supporting a team which is critical in providing compute sanitization to the Neuron ML accelerators fleet. You will work closely with the hardware and software teams to ensure the right tools are available for identifying defects or faulty states of the hardware before the customer hits an issue. Neuron Compute Sanitizer Tools develops and maintains a pre-check and functional correctness checking suite and provides visibility at the fleet level to understand the trends of hardware/software sanitization.Key job responsibilities
* Build and develop a strong team of engineers that would deliver the pre-check suite.
* Work closely with the hardware and firmware design teams.
* Collect requirements from various other teams including training, inference and runtime.
* Collaborate with the runtime team to ensure timely release of the pre-check tools.
* Anticipate future needs based on the product roadmap and develop necessary tools to sanitize compute.
Diverse Experiences
AWS values diverse experiences. Even if you do not meet all of the qualifications and skills listed in the job description, we encourage candidates to apply. If your career is just starting, hasn’t followed a traditional path, or includes alternative experiences, don’t let it stop you from applying.
Work/Life Balance
Mentorship & Career Growth
We’re continuously raising our performance bar as we strive to become Earth’s Best Employer. That’s why you’ll find endless knowledge-sharing, mentorship and other career-advancing resources here to help you develop into a better-rounded professional.
About AWS Utility Computing (UC):
About AWS
About AWS Neuron:

Basic Qualifications

- 8+ years of engineering experience
- 5+ years of engineering team management experience
- 10+ years of planning, designing, developing and delivering consumer software experience
- Experience partnering with product or program management teams
- Experience managing multiple concurrent programs, projects and development teams in an Agile environment


Preferred Qualifications

- Experience designing and developing large scale, high-traffic applications
- Experience with ML hardware/Software

Expand
09.11.2025
A

Amazon Checkout Engineering Systems Engineer GRAISE United States, Texas, Austin

Limitless High-tech career opportunities - Expoint
Description:
Description

The ideal candidate will own the "how" of delivering robust, scalable solutions, while contributing significantly to the "what" of our payment and POS systems. You will be the technical driving force behind implementation strategies and best practices, playing a key role in shaping the design and development of these critical systems.Key job responsibilities- Implement critical components of payment and POS systems.
- Create design specifications for complex integration between Checkout systems, internal boundary systems, and vendor systems.
- Establish patterns for automated testing strategies and monitoring solutions.
- Develop automated alerting systems based on log analysis, with the ability to identify both present and absent critical data elements in payment flows.
- Collaborate with cross-functional teams to ensure designs meet business requirements.
- Review and provide guidance on technical designs and implementations.
- Lead and drive resolution of highly complex production issues.
- Lead retrospectives and author Corrections of Error (COEs).
- Drive systematic improvements to product/system performance and availability.
- Proactively identify and execute opportunities to improve operations.
- Detect trends and define proactive solutions before problems occur.
- Break down complex problems into actionable solutions that can be worked in parallel.
- Author and review technical documentation.
- Exhibit excellent problem-solving and analytical skills.A day in the life
- Review incident reports and address critical issues that occurred during off-hours.
- Analyze system metrics to identify potential bottlenecks and areas for enhancement.
- Support critical POS functions while progressing high-priority items.- Participate in standups including support retrospectives and design reviews for new architecture solutions.- Participate or lead architecture discussions shaping the systems design and functionality.

Basic Qualifications

- 4+ years of site reliability engineering (SRE), systems engineering, systems administration, DevOps, security administration, or network administration experience
- 5+ years of Linux experience
- 5+ years of systems engineering experience
- Bachelor's degree in Systems Engineering, Computer Science, or related field or relevant work experience
- Experience in site reliability engineering (SRE), systems engineering, systems administration, DevOps, security administration, or network administration
- Experience working with Linux
- Experience in systems engineering
- Experience in any of the following: Python, Java, Perl, PHP, Ruby, Bash, Shell or equivalent


Preferred Qualifications

- Knowledge of TCP/IP and networking protocols such as HTTP and DNS
- Experience designing and developing scripts to automate operational burdens and reviewing scripting changes to ensure they meet the standards for maintainability, scalability and security
- Experience working in 24/7 production environment
- Experience with service-oriented architecture and web services

Expand
09.11.2025
A

Amazon CPLD/FPGA Firmware Engineer Annapurna Labs ML Accelerator Sy... United States, Texas, Austin

Limitless High-tech career opportunities - Expoint
Description:
Description

Technologies useful to this role include computer architecture, hardware description languages (HDLs), and embedded systems. Our team uses Verilog, C, C++, Lua, bash, Python and other similar languages. Although we use machine learning workloads to validate systems software, this team is focused on codeveloping reliable server software and hardware for customers to deploy their ML workloads at scale.Key job responsibilities- Develop CPLD and FPGA programs that implement power sequencing and manage various protocols, including PWM, I2C, and SPI
- Develop systems software, kernel drivers
- Define test and automation flows to validate firmware
- Evaluate and optimize firmware performance
- Build error detection and recovery mitigation systems at AWS scaleA day in the life
You will have the opportunity to develop server firmware in a highly cross-functional environment, working side by side with software and hardware teams to optimize customer experience. You will be responsible for building scalable designs that can be tested throughout the stages of product development including manufacturing and production. You will leverage automation, continuous integration, and fleet metrics to deploy and monitor your changes.Work/Life Balance
Mentorship & Career Growth
We’re continuously raising our performance bar as we strive to become Earth’s Best Employer. That’s why you’ll find endless knowledge-sharing, mentorship and other career-advancing resources here to help you develop into a better-rounded professional.

Basic Qualifications

- 3+ years of non-internship professional software development experience
- 2+ years of non-internship design or architecture (design patterns, reliability and scaling) of new and existing systems experience
- Experience programming with at least one software programming language
- 3+ years of programming with at least one hardware description language (HDL) experience


Preferred Qualifications

- 3+ years of full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations experience
- Bachelor's degree in computer science or equivalent
- Experience in embedded development in C/C++
- Experience in RTL development in Verilog, VHDL, or SystemC

Expand
09.11.2025
A

Amazon Retail Systems Linux Engineer GRAISE United States, Texas, Austin

Limitless High-tech career opportunities - Expoint
Description:
Description


Key job responsibilities
- Design and modify scalable software architecture of full stack systems.
- Deliver personalized experience that requires machine learning and data engineering skills
- Work independently to deliver and maintain features.
- Apply software engineering best practices to the development life cycle (incremental delivery, coding standards, code reviews, source control management, build processes, testing, operations...).
- Engage in continuous prioritization efforts to balance out fast paced business requirements with long term technical investments.A day in the life- Shipping and reviewing code
- Focusing on Operational Excellence (development processes, monitoring, code deployment, automated testing, dashboarding...).Sr. Software Development Engineers (SDEIIIs) oversees and secure the technical strategy, design and quality of this space.

Basic Qualifications

- Experience as a mentor, tech lead or leading an engineering team
- Experience leading the architecture and design (architecture, design patterns, reliability and scaling) of new and current systems
- Experience in professional, non-internship software development
- Experience programming with at least one modern language such as Java, C++, or C# including object-oriented design
- Experience in development in the last 3 years


Preferred Qualifications

- Bachelor's degree in computer science or equivalent
- Experience with full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations

Expand
09.11.2025
A

Amazon Sr Communication Systems Design Engineer United States, Texas, Austin

Limitless High-tech career opportunities - Expoint
Description:
Description

You’ll join a diverse team of software, hardware, and network engineers, supply chain specialists, security experts, operations managers, and other vital roles. You’ll collaborate with people across AWS to help us deliver the highest standards for safety and security while providing seemingly infinite capacity at the lowest possible cost for our customers. And you’ll experience an inclusive culture that welcomes bold ideas and empowers you to own them to completion
Key job responsibilities
The successful candidate will be operationally responsible for a Data Center. Some high-level responsibilities include:
- Prioritize and assign trouble tickets to data center technicians and operators
- Recruit and train data technicians to ensure appropriate staffing levels
- Ensure effective and efficient management of day to day data center operations including queue management, 7/24 shift arrangement and hardware logistics
- Fast learn or act as the subject matter expert across all aspects in data center operations
- Ensure all operational KPIs and metrics are being measured and met- Manage Large Scale Events (outages) and act as the call leader
- Manage and improve the work-flows and through-put for data centers operations
- Recommend, document, and oversee policies and procedures to meet industry best practices and to meet required SLAs
- Maintain the on-call schedule coordinating absence and vacationsDiverse Experiences
AWS values diverse experiences. Even if you do not meet all of the preferred qualifications and skills listed in the job description, we encourage candidates to apply. If your career is just starting, hasn’t followed a traditional path, or includes alternative experiences, don’t let it stop you from applying.Mentorship & Career Growth
We’re continuously raising our performance bar as we strive to become Earth’s Best Employer. That’s why you’ll find endless knowledge-sharing, mentorship and other career-advancing resources here to help you develop into a better-rounded professional.Work/Life Balance

Basic Qualifications

- 4+ years of Information Technology (IT) experience, or Bachelor's degree in computer science, engineering, mathematics or equivalent
- 2+ years of experience managing people in a technical environment.
- 2+ years experience in participating in on-call rotations, and providing after-hours support in an environment that operates 24/7, Networking and Computer Hardware.


Preferred Qualifications

- Experience in technical writing in a relevant field
- Experience in project management
- In-depth knowledge of Linux systems administration, Networking and Cabling best practices
- In-depth hardware architectures knowledge and troubleshooting experience, system management tools and client/server environments

Expand
Limitless High-tech career opportunities - Expoint
Description:
Description

As a senior SDE in the pre-silicon team, you will be responsible for driving the pre-silicon hardware/software co-development for our machine learning chips.You will work with architecture, design and emulation teams to build new silicon functionality.You will write bare-metal software to verify the end-to-end functionality of the SoC and the functionality and performance of different subsystems in the SoC.
Work/Life Balance
Mentorship and Career Growth
We’re continuously raising our performance bar as we strive to become Earth’s Best Employer. That’s why you’ll find endless knowledge-sharing, mentorship and other career-advancing resources here to help you develop into a better-rounded professional.Diverse Experiences
Amazon values diverse experiences. Even if you do not meet all of the preferred qualifications and skills listed in the job description, we encourage candidates to apply. If your career is just starting, hasn’t followed a traditional path, or includes alternative experiences, don’t let it stop you from applying.

Basic Qualifications

* 5+ YoE in software development
* Knowledge of HW/SW interfaces and computer architecture
* Proficiency in programming in C/C++, scripting in Bash/Python
* Proficiency in data structures and algorithms


Preferred Qualifications

* Knowledge in low level software such as firmware and device drivers
* Knowledge in SoC architecture
* Knowledge in IO(PCIE, AXI) , Memory(HBM, DDR), CPU architecture and Interconnects.

Expand
Find your dream job in the high tech industry with Expoint. With our platform you can easily search for Systems Validation Manager Annapurna Labs opportunities at Amazon in United States, Austin. Whether you're seeking a new challenge or looking to work with a specific organization in a specific role, Expoint makes it easy to find your perfect job match. Connect with top companies in your desired area and advance your career in the high tech field. Sign up today and take the next step in your career journey with Expoint.