Expoint - all jobs in one place

Finding the best job has never been easier

Limitless High-tech career opportunities - Expoint

Microsoft Member Technical Staff Pre-training Platform Program Manager 
United States, California, Mountain View 
610898772

03.12.2024

We are looking for an outstandingMember of Technical Staff, Pre-training Platform Technical Program Managerwho would be the driving force behind all the activities mentioned above, track and manage the priorities across the various engineering teams delivering both hardware capacity and software components for pretraining. The individual should be excited and be proud about contributing to the next generation of systems that will transform the field. We are looking for candidates who:

  • Are passionate about managing high stakes time-sensitive large-scale programs
  • Are keenly aware and able to manage timelines and schedules of mission critical cluster hardware, software and services
  • Have experience in navigating the timelines of datacenter capacity deploying thousands of GPUs and enable large-scale AI model training or inference clusters
  • Will thrive in a highly collaborative, fast-paced environment
  • Have a high degree of craftsmanship and pay close attention to details
  • Demonstrate a proactive attitude and enthusiasm for exploring new methods and technologies
  • Effectively manage multiple responsibilities and can adjust to shifting priorities.

By applying to this position, you are required to be local to the Montain View, California OR Redmond, Washington area.

Required Qualifications:

  • Bachelor's Degree AND 4+ years experience in engineering, product/technical program management, data analysis, or product development
    • OR equivalent experience.
  • 2+ years experience managing cross-functional and/or cross-team projects.

Preferred Qualifications:

  • Bachelor's Degree AND 8+ years experience in engineering, product/technical program management, data analysis, or product development
    • OR equivalent experience.
  • 6+ years experience managing cross-functional and/or cross-team projects.
  • 1+ year(s) experience reading and/or writing code (e.g., sample documentation, product demos).
  • 2+ yrs oftracking and managing data center bring-up.
  • 1+ yrs of tracking and managing capacity deployment, validation and benchmarking of GPU clusters for AI training.

Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here:Microsoft will accept applications for the role until December 23, 2024.


Responsibilities
  • Deeply understand, track and manage the timelines of datacenter construction and bring-up
  • Deeply understand, track and manage node, rack and cluster validation processes
  • Hold execution-focused meetings with various stake holders to accelerate GPU delivery timelines
  • Track and manage the capacity deployment, validation and benchmarking of AI supercomputers
  • Collaborate with the product team and other engineers and researchers across Microsoft and other vendors to identify gaps and drive timelines towards resolutions and mitigations
  • Working with a cross-disciplined crew across design, research, engineering, and data analysis to deliver a high-quality product and evaluate success towards business goals.
  • Embody our and .