Expoint - all jobs in one place

המקום בו המומחים והחברות הטובות ביותר נפגשים

Limitless High-tech career opportunities - Expoint

Microsoft Software Engineer II 
Canada, British Columbia, Vancouver 
818399299

04.02.2025

As a Software Engineer II on the MAIA System Infrastructure team, you will play a crucial role in building and optimizing the infrastructure that underpins our observability and data flow infrastructure for MAIA AI accelerators. Your primary focus will be on developing and enhancing the data flows that support our complex data flows across hosts and networks, ensuring they provide accurate and actionable insights into the complex operations of our AI hardware. This role involves working closely with senior engineers to design and implement data flow mechanisms that are efficient, scalable, and capable of handling the intricacies of our advanced accelerator architecture.

Required Qualifications

  • Bachelor's Degree in Computer Science or related technical field AND 2+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
    • OR equivalent experience.
  • 2+ years experience in system-level programming (C/C++), with a focus on building and optimizing low-level infrastructure.

Other Requirements

  • Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include, but are not limited to the following specialized security screenings: Microsoft Cloud Background Check:
    • This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter.

Preferred Qualifications

  • Proficient foundation in system-level programming (C/C++), with a focus on building and optimizing low-level infrastructure.
  • Experience or a keen interest in data flow management, particularly in the context of high-performance computing systems and hardware accelerators.
  • Understanding of or willingness to learn about high performance communication patterns over PCIe, eBPF, Networking, and various memory fabrics within and across our hardware and software stacks.
  • Proven problem-solving skills with the ability to tackle complex technical challenges related to data flow efficiency and infrastructure optimization.
  • A track record of working on high-impact projects, demonstrating a passion for building robust, high-performance systems.
  • Excellent collaboration and communication skills, with a drive to work alongside top-tier engineers to push the boundaries of AI acceleration tooling.
  • Familiarity with performance optimization and debugging tools is a plus, with a desire to contribute to the development of such tools.
  • Familiarity with Linux kernel and eBPF tooling (e.g., BCC, bpftrace) is a plus, demonstrating an ability to utilize eBPF for real-time data analysis and system diagnostics.

Software Engineering IC3 - The typical base pay range for this role across Canada is CAD $83,600 - CAD $159,600 per year.

Find additional pay information here:
Microsoft will accept applications for the role until February 10, 2025.

Responsibilities

In this position, you'll be hands-on in developing and optimizing the infrastructure that enables our observability and debugging tools to function seamlessly across multi-chip, multi-server environments. Your work will directly contribute to how developers interact with, analyze, and optimize AI workloads on our accelerators, ensuring that data transfer and processing are handled with maximum efficiency.

Develop and Optimize Tooling Infrastructure:

Work on the core infrastructure that supports our observability tools, focusing on the data flows and the efficient management of information between host systems and MAIA accelerators.
Implement and refine data transfer mechanisms, ensuring they are optimized for speed, reliability, and scalability across a distributed system of accelerators.

Participate in Building Robust Systems:

Assist in building and maintaining the infrastructure that allows seamless interaction between the tooling stack and the MAIA chips, ensuring reliable data collection and analysis.
Contribute to the development of internal APIs and libraries that facilitate data transfer, processing, and storage, supporting a high-performance observability ecosystem.


Engage with High-Performance Systems Design:

Other

  • Embody our and