Required Qualifications:
- Master's Degree in Electrical Engineering, or related field AND 3+ years technical engineering experience
- OR Bachelor's Degree in Electrical Engineering, or related field AND 5+ years technical engineering experience
- OR equivalent experience.
- 5+ years of work experience in managing product quality in the electronic industry.
- 5+ years of direct engineering experience in hardware system issue resolution for GPU Servers.
- Versed in filtering through applicable debug data, like telemetry and logs to identify and investigate HW failure signatures.
Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include but are not limited to the following specialized security screenings: Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud Background Check upon hire/transfer and every two years thereafter.
Preferred Qualifications:
- Bachelor's Degree in electrical and systems engineering, or related field AND 7+ years experience in a large scale manufacturing and/or data center environment/repair
- OR Master's Degree in manufacturing, material, mechanical, electrical, and industrial engineering, or related field AND 6+ years experience in a high-volume manufacturing environment
- OR Doctorate in manufacturing, material, mechanical, electrical, and industrial engineering, or related field AND 3+ years experience in a manufacturing environment/repair
- OR 9+ years equivalent experience.
- Patent or track record of engineering excellency.
- Experience with Liquid Cooling Systems in Data Centers
- 12+ years of experience in working with the modern server architectures – includes understanding of GPU, CPU methods for failure analysis, debugging or validation.
- 8+ years of system level server debugging with anunderstanding of power, system and network environments
- 3+ years of direct GPU related engineering experience in issue debug/test log review.
- Leadership skills and ability to collaborate with diverse teams and drive a call to action.
- Experience in root cause analysis and corrective action methods to identify contributing factors of production defects.
- Ability to analyze large data sets, extract key insights, and effectively present and communicate the results.
- Proficient communication and project management skills.