Finding the best job has never been easier
Share
You will play a critical role in delivering and maintaining the infrastructure for our cloud supercomputers and enabling the revolution of AI. You will be responsible for owning the delivery and burn-in of clusters into Azure independently, ensuring that the hardware is stable for customers to run their applications. This will involve working closely with hardware vendors and other teams to ensure that the clusters are properly configured and optimized for performance across CPU (Central Processing Unit), accelerators, and network infrastructure as well as tracking progress during all the stages of the process.In addition, you will be responsible for automating the quality process and debugging issues as they arise, ensuring successful resolution. This will involve developing and maintaining tools and processes to automate testing and ensure that quality is built into every step of the development process. You will also work closely with other teams to diagnose and resolve issues, and to ensure that our customers have seamless experience using our cloud supercomputers, as well as becoming the voice of the customer to represent their issues.Your attention to detail will be critical in this role, as you will be responsible for ensuring that quality is always front and center as well as having the desire to identify and isolate potential issues in the early phases of the project. This will involve reviewing system level specification, code and configurations, and working with other teams to identify and address any issues that arise. You will also be responsible for documenting processes and procedures, and for ensuring that our team is following the industry’s best practices and standards for software development and deployment.
Required Qualifications:
Other Requirements:
Preferred Qualifications:
Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here:
These jobs might be a good fit