BS degree in Mechanical, Electrical, Industrial or Manufacturing Engineering
5+ years of technical engineering experience
5+ years of experience managing a hardware engineering team, with hardware reliability and operational experience
Strong people management, program management, time management, prioritization, and organizational skills
Experience and understanding of troubleshooting processes, and diagnostic methods through system development and deployment.
Good communications skills for describing technical and risk findings
Preferred Qualifications:
MS degree in Mechanical, Electrical, Industrial or Manufacturing Engineering
Experience with cloud hardware, servers, networking equipment, liquid cooling infrastructure and IT racks
Responsibilities
Responsible for Design for Reliability & Availability execution by managing a team of Reliability Engineers and supporting hardware design teams throughout the product life cycle.
Responsible for supporting Reliability tool kits, Design for Reliability & Availability principles, DFMEA, accelerated life testing, physics of failure, FA & RCA assessment.
Responsible for qualification test plans and execution for new product introduction and second sources. Ensure Cloud Hardware is developed and delivered to Microsoft’s datacenters to meet specified use-conditions and reliability performance for applicable stresses.
Coordinate and facilitate Reliability execution in conjunction with program management and engineering teams, as well as with Supply Chain and suppliers.
Responsible to create or revise reliability engineering guidelines to improve product field performance through design enhancements.
Responsible for coordinating use principles of performance evaluation and prediction to improve the reliability and maintainability of Cloud Infrastructure equipment, including compute and storage systems, power equipment, and network equipment.
Responsible for performing PCB stack up design reviews, and provide design analysis for reliability, manufacturability, and quality, and define technology envelop.
Responsible for developing models that represent the expected environment and operational conditions.
Responsible for selecting, analyzing, and interpreting results of various test methods used during product development.
Collaborate with other development functional teams and internal stakeholders regarding the application of Design for Reliability principles to ensure products meet customer expectations.