

Share
As a senior SDE in the pre-silicon team, you will be responsible for driving the pre-silicon hardware/software co-development for our machine learning chips.You will work with architecture, design and emulation teams to build new silicon functionality.You will write bare-metal software to verify the end-to-end functionality of the SoC and the functionality and performance of different subsystems in the SoC.
Work/Life Balance
Mentorship and Career Growth
We’re continuously raising our performance bar as we strive to become Earth’s Best Employer. That’s why you’ll find endless knowledge-sharing, mentorship and other career-advancing resources here to help you develop into a better-rounded professional.Diverse Experiences
Amazon values diverse experiences. Even if you do not meet all of the preferred qualifications and skills listed in the job description, we encourage candidates to apply. If your career is just starting, hasn’t followed a traditional path, or includes alternative experiences, don’t let it stop you from applying.
* 5+ YoE in software development
* Knowledge of HW/SW interfaces and computer architecture
* Proficiency in programming in C/C++, scripting in Bash/Python
* Proficiency in data structures and algorithms
* Knowledge in low level software such as firmware and device drivers
* Knowledge in SoC architecture
* Knowledge in IO(PCIE, AXI) , Memory(HBM, DDR), CPU architecture and Interconnects.
These jobs might be a good fit

Share
The Trainium Manufacturing, Quality and Reliability (MQR) Team is part of AWS Annapurna Labs focused on Machine Learning products that designs cutting AI platforms for the world’s largest Cloud Services provider. As a Senior Reliability Engineer you will engage with an experienced cross-disciplinary staff to conceive and design infrastructure technologies. You will work closely with an internal inter-disciplinary team, and outside partners to drive key aspects of product definition, execution and test in manufacturing. A successful candidate will be responsive, flexible and able to succeed within an open collaborative peer environment. You will:* Be responsible for the test validation of future technologies.
* Drive manufacturing process improvements to address reliability issues and concerns.
* Qualify manufacturing lines and mechanisms for mass production
* You will have a fundamental understanding of Reliability statistics/Reliability tests and/or solid understanding of computer systems to influence design for reliability.
* Lead identifying and validating product/component risks and work with design teams to mitigate them and define the test methodology and test coverage to assure product reliability.
* Deep-dive in technologies aligned with product roadmap.
* Provide technical leadership and mentor engineers.
* Perform Reliability prediction of failure mechanisms, products under development and products in the field.
* Working with multiple vendors and ODMs to standardize component manufacturing and reliability expectations.Key job responsibilities
* Responsible for defining reliability tests to be implemented during manufacturing
* Drive manufacturing process improvements to address reliability issues and concerns.
* Perform Reliability prediction of failure mechanisms, products under development and products in the field.
* Working with multiple vendors and ODMs to standardize component manufacturing and reliability expectations.
- Bachelor's or Master’s degree in Reliability Engineering, Physics or related field, or equivalent experience
- 7+ years of Reliability Engineering work experience with server compute platforms or on high-tech hardware
These jobs might be a good fit

Share
In this role you will be responsible for building and supporting a team which is critical in providing compute sanitization to the Neuron ML accelerators fleet. You will work closely with the hardware and software teams to ensure the right tools are available for identifying defects or faulty states of the hardware before the customer hits an issue. Neuron Compute Sanitizer Tools develops and maintains a pre-check and functional correctness checking suite and provides visibility at the fleet level to understand the trends of hardware/software sanitization.Key job responsibilities
* Build and develop a strong team of engineers that would deliver the pre-check suite.
* Work closely with the hardware and firmware design teams.
* Collect requirements from various other teams including training, inference and runtime.
* Collaborate with the runtime team to ensure timely release of the pre-check tools.
* Anticipate future needs based on the product roadmap and develop necessary tools to sanitize compute.
Diverse Experiences
AWS values diverse experiences. Even if you do not meet all of the qualifications and skills listed in the job description, we encourage candidates to apply. If your career is just starting, hasn’t followed a traditional path, or includes alternative experiences, don’t let it stop you from applying.
Work/Life Balance
Mentorship & Career Growth
We’re continuously raising our performance bar as we strive to become Earth’s Best Employer. That’s why you’ll find endless knowledge-sharing, mentorship and other career-advancing resources here to help you develop into a better-rounded professional.
About AWS Utility Computing (UC):
About AWS
About AWS Neuron:
- 8+ years of engineering experience
- 5+ years of engineering team management experience
- 10+ years of planning, designing, developing and delivering consumer software experience
- Experience partnering with product or program management teams
- Experience managing multiple concurrent programs, projects and development teams in an Agile environment
- Experience designing and developing large scale, high-traffic applications
- Experience with ML hardware/Software
These jobs might be a good fit

Share
Key job responsibilities- Enhance detection engineering processes to improve the detection engineering lifecycle.
- Develop platform requirements used to enrich alerts, and automate remediation and response actions.
- Research and develop mechanisms across to enhanceMachine-Learning, advanced data correlation, risk-based alerting, or Generative AI.
- Provide tactical detection support during security incidents.
- Automate your way through challenges using Python or other scripting language.Work/Life BalanceTraining and Career Growth
We’re continuously raising our performance bar as we strive to become Earth’s Best Employer. That’s why you’ll find endless knowledge-sharing, training, and other career-advancing resources here to help you develop into a better-rounded professional.
- Experience triaging and developing security alerts and response automation, conducting front-line analysis, and providing escalation support
- Experience scripting with Python, Perl, Bash or PowerShell
- 1+ years of non academic experience in any combination of the following: threat modeling experience, secure coding, identity management and authentication, software development, cryptography, system administration and network security experience
- Bonus: Experience using Machine-Learning, Large Language Models (LLM), or Agentic workflows
These jobs might be a good fit

Share
This position can be located in Austin, Seattle, or Arlington (DC).**Must be open to travel at least 30% including international**Key job responsibilities
- Develop solutions that make the best use of the AWS services like AWS EC2, EKS, ECS, SageMaker and other computing platform for GenAI practice.- Provide one-to-few and one-to-many training sessions to transfer knowledge to builder considering or already using AWS.- Build deep relationships with senior technical individuals within partners to enable them to be cloud advocates.- Be able to develop proof-of-concepts for solutions involving AWS services.
- Driving product integrations between partner products and AWS services
- Proving thought leadership in the form of publishing blog posts, public speaking, white papers and reference architecturesA day in the life
- Building and testing a Proof of Concept (PoC) or create a code sample.- Writing a blog post or white paper.
AWS values diverse experiences. Even if you do not meet all of the preferred qualifications and skills listed in the job description, we encourage candidates to apply. If your career is just starting, hasn’t followed a traditional path, or includes alternative experiences, don’t let it stop you from applying.Mentorship & Career GrowthWe’re continuously raising our performance bar as we strive to become Earth’s Best Employer. That’s why you’ll find endless knowledge-sharing, mentorship and other career-advancing resources here to help you develop into a better-rounded professional.Work/Life Balance
- 8+ years of specific technology domain areas (e.g. software development, cloud computing, systems engineering, infrastructure, security, networking, data & analytics) experience
- 3+ years of design, implementation, or consulting in applications and infrastructures experience
- 10+ years of IT development or implementation/consulting in the software or Internet industries experience
- Recent and demonstrable hands-on experience with AI/ML workloads.
- 5+ years of infrastructure architecture, database architecture and networking experience
- Knowledge of AWS services, market segments, customer base and industry verticals
- Experience working with end user or developer communities
These jobs might be a good fit

Share
You’ll join a diverse team of software, hardware, and network engineers, supply chain specialists, security experts, operations managers, and other vital roles. You’ll collaborate with people across AWS to help us deliver the highest standards for safety and security while providing seemingly infinite capacity at the lowest possible cost for our customers. And you’ll experience an inclusive culture that welcomes bold ideas and empowers you to own them to completion
Key job responsibilities
The successful candidate will be operationally responsible for a Data Center. Some high-level responsibilities include:
- Prioritize and assign trouble tickets to data center technicians and operators
- Recruit and train data technicians to ensure appropriate staffing levels
- Ensure effective and efficient management of day to day data center operations including queue management, 7/24 shift arrangement and hardware logistics
- Fast learn or act as the subject matter expert across all aspects in data center operations
- Ensure all operational KPIs and metrics are being measured and met- Manage Large Scale Events (outages) and act as the call leader
- Manage and improve the work-flows and through-put for data centers operations
- Recommend, document, and oversee policies and procedures to meet industry best practices and to meet required SLAs
- Maintain the on-call schedule coordinating absence and vacationsDiverse Experiences
AWS values diverse experiences. Even if you do not meet all of the preferred qualifications and skills listed in the job description, we encourage candidates to apply. If your career is just starting, hasn’t followed a traditional path, or includes alternative experiences, don’t let it stop you from applying.Mentorship & Career Growth
We’re continuously raising our performance bar as we strive to become Earth’s Best Employer. That’s why you’ll find endless knowledge-sharing, mentorship and other career-advancing resources here to help you develop into a better-rounded professional.Work/Life Balance
- 4+ years of Information Technology (IT) experience, or Bachelor's degree in computer science, engineering, mathematics or equivalent
- 2+ years of experience managing people in a technical environment.
- 2+ years experience in participating in on-call rotations, and providing after-hours support in an environment that operates 24/7, Networking and Computer Hardware.
- Experience in technical writing in a relevant field
- Experience in project management
- In-depth knowledge of Linux systems administration, Networking and Cabling best practices
- In-depth hardware architectures knowledge and troubleshooting experience, system management tools and client/server environments
These jobs might be a good fit

Share
We’re searching for an experienced Circuit Design & Analysis engineer with a background in custom circuit design & analysis, system level thermal & power analysis with a proven track record of handling challenges at scale. In this role, you’ll be working directly with product engineers, signal & power integrity engineers and physical design experts - defining best practices, driving correlation of pre-silicon simulation of thermal & power integrity to post silicon analysis and developing custom circuits that help raise the bar in implementing state-of-the-art machine learning hardware.Key job responsibilities
- Design and implement custom cells / IP.
- Develop & run characterization flows for custom cells / IP developed.
- Own integration & post-silicon qualification of IPs like PLL, PCIE, UCIE, HBM, sensors/monitors.
- Develop scripts to automate running analysis and collect reports.
- Develop test-plan and perform measurements in the lab to correlate with simulation data.
A day in the life
Depending on the state of the project, you may find yourself working on the following:- Evaluate IPs (like sensors, process monitors) from a 3rd party
- Develop an characterize custom IPs like ganged buffers, custom logic cells for specialized operations (like MACs)
- Work with designers and architects to identify pain-points and areas where custom solutions can improve PPAS
- Do post-silicon quality checks for key IP like PLLs, UCIE/PCIE, HBM
- Do post-silicon power measurements of jitter, sensor calibration, power and correlate with simulationWork/Life Balance
Mentorship and Career Growth
We’re continuously raising our performance bar as we strive to become Earth’s Best Employer. That’s why you’ll find endless knowledge-sharing, mentorship and other career-advancing resources here to help you develop into a better-rounded professional.Diverse Experiences
Amazon values diverse experiences. Even if you do not meet all of the preferred qualifications and skills listed in the job description, we encourage candidates to apply. If your career is just starting, hasn’t followed a traditional path, or includes alternative experiences, don’t let it stop you from applying.
- BS + 8yrs or MS + 6yrs or PhD + 3yr in EE/CS
- Expertise on circuit level analysis using tools like SPICE / SPECTRE
- Expertise in interconnect & transistor fundamentals in deep sub-micron processes
- Understanding of ASIC Physical Design from RTL-to-GDSII
- Understanding of other sign-off activities (ir/em, physical verification, timing closure, DFT)
- 3+ years of scripting experience with Tcl, Perl or Python
These jobs might be a good fit

Share
As a senior SDE in the pre-silicon team, you will be responsible for driving the pre-silicon hardware/software co-development for our machine learning chips.You will work with architecture, design and emulation teams to build new silicon functionality.You will write bare-metal software to verify the end-to-end functionality of the SoC and the functionality and performance of different subsystems in the SoC.
Work/Life Balance
Mentorship and Career Growth
We’re continuously raising our performance bar as we strive to become Earth’s Best Employer. That’s why you’ll find endless knowledge-sharing, mentorship and other career-advancing resources here to help you develop into a better-rounded professional.Diverse Experiences
Amazon values diverse experiences. Even if you do not meet all of the preferred qualifications and skills listed in the job description, we encourage candidates to apply. If your career is just starting, hasn’t followed a traditional path, or includes alternative experiences, don’t let it stop you from applying.
* 5+ YoE in software development
* Knowledge of HW/SW interfaces and computer architecture
* Proficiency in programming in C/C++, scripting in Bash/Python
* Proficiency in data structures and algorithms
* Knowledge in low level software such as firmware and device drivers
* Knowledge in SoC architecture
* Knowledge in IO(PCIE, AXI) , Memory(HBM, DDR), CPU architecture and Interconnects.
These jobs might be a good fit