

The Trainium Manufacturing, Quality and Reliability (MQR) Team is part of AWS Annapurna Labs focused on Machine Learning products that designs cutting AI platforms for the world’s largest Cloud Services provider. As a Senior Reliability Engineer you will engage with an experienced cross-disciplinary staff to conceive and design infrastructure technologies. You will work closely with an internal inter-disciplinary team, and outside partners to drive key aspects of product definition, execution and test in manufacturing. A successful candidate will be responsive, flexible and able to succeed within an open collaborative peer environment. You will:* Be responsible for the test validation of future technologies.
* Drive manufacturing process improvements to address reliability issues and concerns.
* Qualify manufacturing lines and mechanisms for mass production
* You will have a fundamental understanding of Reliability statistics/Reliability tests and/or solid understanding of computer systems to influence design for reliability.
* Lead identifying and validating product/component risks and work with design teams to mitigate them and define the test methodology and test coverage to assure product reliability.
* Deep-dive in technologies aligned with product roadmap.
* Provide technical leadership and mentor engineers.
* Perform Reliability prediction of failure mechanisms, products under development and products in the field.
* Working with multiple vendors and ODMs to standardize component manufacturing and reliability expectations.Key job responsibilities
* Responsible for defining reliability tests to be implemented during manufacturing
* Drive manufacturing process improvements to address reliability issues and concerns.
* Perform Reliability prediction of failure mechanisms, products under development and products in the field.
* Working with multiple vendors and ODMs to standardize component manufacturing and reliability expectations.
- Bachelor's or Master’s degree in Reliability Engineering, Physics or related field, or equivalent experience
- 7+ years of Reliability Engineering work experience with server compute platforms or on high-tech hardware
משרות נוספות שיכולות לעניין אותך

OpsTech Solutions is looking for experienced and dedicated team members to help manage the administration, maintenance, and development of the ServiceNow platform for the organization. The ideal candidate is innovative and has great problem-solving skills. You must be very comfortable working on core application administration, creating workflows, data management, scripting and process automation. You will need to work with non-technical and technical teams to translate business cases to scalable solutions for multiple teams. In addition, you’ll need to be able to clearly and effectively communicate to larger non-technical audiences on system changes and enhancements. You will be working in a hyper-growth environment where priorities shift quickly. You must be flexible and adapt well to a wide range of tasks and technologies.
Key job responsibilities- ITSM (IT Service Management)
- HAM (Hardware Asset Management)
- System Integrations
- Perform technical administration and maintenance of ServiceNow
- Interface with ServiceNow for all operational requests / issues / planning.
- Lead ServiceNow release upgrades including technical and stakeholder management activities.
- Work closely with Software Development Engineers to coordinate system updates, patches, and configuration changes.
- Provide user training related to ServiceNow processes, policies and procedures.A day in the life- Medical, Dental, and Vision Coverage
- Maternity and Parental Leave Options
- Paid Time Off (PTO)
- 401(k) Plan
- Bachelors degree in Information Technology, Management Information Systems, Computer Science, Business or equivalent work experience.
- Certified ServiceNow Administrator (CSA) or other ServiceNow certification is required.
- 3+ years of direct experience with the ServiceNow platform as an administrator, developer and/or comparable role.
- 3+ years direct experience working with customers to understand requirements and implementing information technology solutions / initiatives.
- 5+ years experience in multiple areas of technology, including application administration, configuration, and/or data services.
- Proven experience Agile scrum/Kanban methodology.
- Experience with JavaScript, Perl or PHP is strongly preferred.
- Experience customizing or developing reports from data residing in relational databases.
- Experience with web service integrations using REST, SOAP, etc.
- ITIL Foundation certified or related experience preferred.
משרות נוספות שיכולות לעניין אותך

This position can be located in Austin, Seattle, or Arlington (DC).**Must be open to travel at least 30% including international**Key job responsibilities
- Develop solutions that make the best use of the AWS services like AWS EC2, EKS, ECS, SageMaker and other computing platform for GenAI practice.- Provide one-to-few and one-to-many training sessions to transfer knowledge to builder considering or already using AWS.- Build deep relationships with senior technical individuals within partners to enable them to be cloud advocates.- Be able to develop proof-of-concepts for solutions involving AWS services.
- Driving product integrations between partner products and AWS services
- Proving thought leadership in the form of publishing blog posts, public speaking, white papers and reference architecturesA day in the life
- Building and testing a Proof of Concept (PoC) or create a code sample.- Writing a blog post or white paper.
AWS values diverse experiences. Even if you do not meet all of the preferred qualifications and skills listed in the job description, we encourage candidates to apply. If your career is just starting, hasn’t followed a traditional path, or includes alternative experiences, don’t let it stop you from applying.Mentorship & Career GrowthWe’re continuously raising our performance bar as we strive to become Earth’s Best Employer. That’s why you’ll find endless knowledge-sharing, mentorship and other career-advancing resources here to help you develop into a better-rounded professional.Work/Life Balance
- 8+ years of specific technology domain areas (e.g. software development, cloud computing, systems engineering, infrastructure, security, networking, data & analytics) experience
- 3+ years of design, implementation, or consulting in applications and infrastructures experience
- 10+ years of IT development or implementation/consulting in the software or Internet industries experience
- Recent and demonstrable hands-on experience with AI/ML workloads.
- 5+ years of infrastructure architecture, database architecture and networking experience
- Knowledge of AWS services, market segments, customer base and industry verticals
- Experience working with end user or developer communities
משרות נוספות שיכולות לעניין אותך

You’ll join a diverse team of software, hardware, and network engineers, supply chain specialists, security experts, operations managers, and other vital roles. You’ll collaborate with people across AWS to help us deliver the highest standards for safety and security while providing seemingly infinite capacity at the lowest possible cost for our customers. And you’ll experience an inclusive culture that welcomes bold ideas and empowers you to own them to completion.As part of the global controls team, you will work with highly motivated experts and innovators in the data center industry. You will be responsible for troubleshooting, project management, and maintaining the building management system (BMS) and electrical power monitoring system (EPMS). Using Amazon leadership principles, you will develop new processes and standards while innovating in the controls space.AWS Data centers have multiple components such as generators, uninterruptable power sources, diesel generators, electrical switchgear, power distribution units, variable frequency drives, automatic/static transfer switches, chillers [air-cooled and water-cooled], pumps, cooling towers, heat exchangers, CRAHs, air economizers, etc. All these components have local control systems that interact with each other via open and/or proprietary communications protocols. The BMS is the primary method of control of all mechanical systems within a data center. The EPMS is the primary method of monitoring all electrical systems within a data center.Key job responsibilities
As a Data Center Controls Engineer you will:• Troubleshoot and perform Root Cause Analysis or Corrective Action for BMS and EPMS related issues in AWS data centers.
• Train and assist internal customers and stakeholders with the creation, design, configuration, validation, installation, commissioning and operation of BMS and EPMS systems.
• Provide technical assistance and support to operations during life cycle of the data center.
• Review results and action items from the quarterly maintenances for BMS and EPMS and take actions to get them resolved.
• Develop BMS & EPMS projects scope of work, schedule, budget, and level of efforts (LOE) to projects requested by customers and stakeholders.
• Manage scope, schedule, finance and execution of BMS and EPMS improvement projects in AWS data centers.
• Assist in procurement related activities including request for quotation/proposals, responding to request for information, review of vendors proposal and issuance of purchase orders.
• Participate in AWS global on-call schedule to provide immediate BMS and EPMS technical support to in-service data centers.
• Attend project related meetings, coordinate with project leaders and regularly report status to Controls and stakeholders management.
• Support Controls projects related commissioning activities in the data centers.
• Review, implement, troubleshoot and iterate on the controls sequence of operation (SOO) and provide necessary feedback to the design team.
• Develop and modify controls logic programming and graphical user interfaces.
• Manage multiple stakeholder deliverables, requirements and navigate challenging situations.
• Financially manage BMS and EPMS service contracts.
• Frequently visit (locally) assigned in-operation data centers to troubleshoot, meet customers, supervise vendor’s work to ensure compliance with the scope, design, SOO and applicable local codes.Diverse Experiences
Amazon values diverse experiences. Even if you do not meet all of the preferred qualifications and skills listed in the job description, we encourage candidates to apply. If your career is just starting, hasn’t followed a traditional path, or includes alternative experiences, don’t let it stop you from applying.Work/Life BalanceMentorship and Career Growth
We’re continuously raising our performance bar as we strive to become Earth’s Best Employer. That’s why you’ll find endless knowledge-sharing, mentorship and other career-advancing resources here to help you develop into a better-rounded professional
- Bachelor's degree in Electrical Engineering, Mechanical Engineering, or a related field
- Experience carrying new design concepts through exploration, development, and into deployment or mass production
- Experience in MS Excel, Word, and Windows Operating Systems
- Experience with power management and power monitoring systems
- 5+ years of project management in data centers or comparable critical infrastructure experience
- Knowledge of critical data center equipment
- Knowledge of engineering documentation, electrical diagrams and standard operating procedures
- Knowledge of building codes and regulations including Life Safety, BOCA, NFPA, NEC, or OSHA
- Experience in project management in data centers or comparable critical infrastructure
- Experience in Data Center Engineering Operations, with a deep understanding of electrical and mechanical data center infrastructure
- Experience reading and interpreting construction specifications and drawings for all domains
משרות נוספות שיכולות לעניין אותך

We are seeking a talented and motivated Manufacturing Engineering Lead with a proven track record of implementing best in class processes within a complex manufacturing environment. This role will report to the Director of Engineering, and is an integral member of the Machine Learning Acceleration server development team. You will work closely with the hardware design and test engineers to implement, test and improve New Product assembly processes on the most advanced machine learning servers.
You will participate in the early phase of manufacturing line development for our next generation servers and racks to improve our manufacturing flows informing system design, manufacturing, and fleet operations.
You will manage early lifecycle changes, identify initial product quality improvements, and drive to technical root cause in supplier quality activities. The candidate will have experience in design or manufacturing and is capable of making wide-ranging business decisions on behalf of the organization.
Key job responsibilities
* Design and build an assembly line for machine learning acceleration servers.
* Test individual server components and multi-rack systems.
* Propose design changes that will enhance product manufacturability and testability.
* Guide best practices on manufacturing tolerances..
* Review external production processes, identify risk and inform improvements..
* Identify supplier quality problems; inform containment and root cause activities.
* Hire and mentor team members including design engineers and technicians.
* Evaluate production, supplier and field failures for root cause analysis and resolution.
* Travel to manufacturing and ODM development sites.A day in the life
Diverse ExperiencesAWS values diverse experiences. Even if you do not meet all of the qualifications and skills listed in the job description, we encourage candidates to apply. If your career is just starting, hasn’t followed a traditional path, or includes alternative experiences, don’t let it stop you from applying.About AWSMentorship & Career Growth
We’re continuously raising our performance bar as we strive to become Earth’s Best Employer. That’s why you’ll find endless knowledge-sharing, mentorship and other career-advancing resources here to help you develop into a better-rounded professional.
for more information. If the country/region you’re applying in isn’t listed, please contact your Recruiting Partner.
משרות נוספות שיכולות לעניין אותך

This position can be located in San Francisco, Seattle, Arlington, New York, or Austin**Must be open to travel at least 30% including international**Key job responsibilities- Develop solutions that make the best use of the AWS services like AWS EC2, EKS, ECS, SageMaker and other communications platform for interactive voice response (IVR), automatic call distribution, and integration with other AWS services for data storage and analytics.- Provide one-to-few and one-to-many training sessions to transfer knowledge to builder considering or already using AWS.- Build deep relationships with senior technical individuals within partners to enable them to be cloud advocates.- Hands on experience and be able to develop proof-of-concepts for solutions involving AWS services.
- Driving product integrations between partner products and AWS services
- Proving thought leadership in the form of publishing blog posts, public speaking, white papers and reference architecturesA day in the life- Building and testing a Proof of Concept (PoC) or create a code sample.
- Writing a blog post or white paper.AWS values diverse experiences. Even if you do not meet all of the preferred qualifications and skills listed in the job description, we encourage candidates to apply. If your career is just starting, hasn’t followed a traditional path, or includes alternative experiences, don’t let it stop you from applying.Mentorship & Career GrowthWe’re continuously raising our performance bar as we strive to become Earth’s Best Employer. That’s why you’ll find endless knowledge-sharing, mentorship and other career-advancing resources here to help you develop into a better-rounded professional.Work/Life Balance
- 2+ years of design, implementation, or consulting in applications and infrastructures experience
- 3+ years of specific technology domain areas (e.g. software development, cloud computing, systems engineering, infrastructure, security, networking, data & analytics) experience
- 4+ years of IT development or implementation/consulting in the software or Internet industries experience
- 2+ years of experience with Unified Communications and Collaboration role involving the design, development, and implementation of complex voice/video solutions. Recent and demonstrable hands-on experience with AI/ML, GenAI workloads.
Pursuant to the San Francisco Fair Chance Ordinance, we will consider for employment qualified applicants with arrest and conviction records.
משרות נוספות שיכולות לעניין אותך

Key job responsibilitiesAbout the team
Diverse Experiences
Amazon Security values diverse experiences. Even if you do not meet all of the qualifications and skills listed in the job description, we encourage candidates to apply. If your career is just starting, hasn’t followed a traditional path, or includes alternative experiences, don’t let it stop you from applying.Training & Career Growth
We’re continuously raising our performance bar as we strive to become Earth’s Best Employer. That’s why you’ll find endless knowledge-sharing, training, and other career-advancing resources here to help you develop into a better-rounded professional.Work/Life Balance
- 8+ years of specific technology domain areas (e.g. software development, cloud computing, systems engineering, infrastructure, security, networking, data & analytics) experience
- 3+ years of design, implementation, or consulting in applications and infrastructures experience
- 8+ years of IT development or implementation/consulting in the software or Internet industries experience
- 5+ years of infrastructure architecture, database architecture and networking experience
- Experience working with end user or developer communities
- Cloud Technology Certification (such as Solutions Architecture, Cloud Security Professional or Cloud DevOps Engineering)
- Experience communicating across technical and non-technical audiences, including executive level stakeholders or clients
משרות נוספות שיכולות לעניין אותך

The Trainium Manufacturing, Quality and Reliability (MQR) Team is part of AWS Annapurna Labs focused on Machine Learning products that designs cutting AI platforms for the world’s largest Cloud Services provider. As a Senior Reliability Engineer you will engage with an experienced cross-disciplinary staff to conceive and design infrastructure technologies. You will work closely with an internal inter-disciplinary team, and outside partners to drive key aspects of product definition, execution and test in manufacturing. A successful candidate will be responsive, flexible and able to succeed within an open collaborative peer environment. You will:* Be responsible for the test validation of future technologies.
* Drive manufacturing process improvements to address reliability issues and concerns.
* Qualify manufacturing lines and mechanisms for mass production
* You will have a fundamental understanding of Reliability statistics/Reliability tests and/or solid understanding of computer systems to influence design for reliability.
* Lead identifying and validating product/component risks and work with design teams to mitigate them and define the test methodology and test coverage to assure product reliability.
* Deep-dive in technologies aligned with product roadmap.
* Provide technical leadership and mentor engineers.
* Perform Reliability prediction of failure mechanisms, products under development and products in the field.
* Working with multiple vendors and ODMs to standardize component manufacturing and reliability expectations.Key job responsibilities
* Responsible for defining reliability tests to be implemented during manufacturing
* Drive manufacturing process improvements to address reliability issues and concerns.
* Perform Reliability prediction of failure mechanisms, products under development and products in the field.
* Working with multiple vendors and ODMs to standardize component manufacturing and reliability expectations.
- Bachelor's or Master’s degree in Reliability Engineering, Physics or related field, or equivalent experience
- 7+ years of Reliability Engineering work experience with server compute platforms or on high-tech hardware
משרות נוספות שיכולות לעניין אותך