Expoint - all jobs in one place

מציאת משרת הייטק בחברות הטובות ביותר מעולם לא הייתה קלה יותר

Limitless High-tech career opportunities - Expoint

Red hat Principal Software Engineering AI Model Serving 
United States, Massachusetts, Boston 
822443822

08.12.2024

What you will do:

  • Be an influencer and leader in MLOps-related open source communities to help build an active MLOps open source ecosystem for Open Data Hub and OpenShift AI

  • Act as an MLOps SME within Red Hat by supporting customer-facing discussions, presenting at technical conferences, and evangelizing OpenShift AI within the internal community of practices

  • Architect and design new features for open-source MLOps communities such as KubeFlow and KServe

  • Provide technical vision and leadership on critical and high-impact projects

  • Mentor, influence, and coach a team of distributed engineers

  • Ensure non-functional requirements including security, resiliency, and maintainability are met

  • Write unit and integration tests and work with quality engineers to ensure product quality

  • Use CI/CD best practices to deliver solutions as productization efforts into RHOAI

  • Contribute to a culture of continuous improvement by sharing recommendations and technical knowledge with team members

  • Collaborate with product management, other engineering, and cross-functional teams to analyze and clarify business requirements

  • Communicate effectively to stakeholders and team members to ensure proper visibility of development efforts

  • Give thoughtful and prompt code reviews

  • Represent RHOAI in external engagements including industry events, customer meetings, and open-source communities

What you will bring

  • An existing contributor in one or more MLOps open source projects such as KubeFlow, KServe, RayServe, and vLLM.

  • Recent hands-on experience in deploying and maintaining machine learning models in production environments

  • Passion for writing and maintaining reliable code

  • Experience with monitoring and alerting tools such as Prometheus and Grafana

  • Excellent written and verbal communication skills; fluent English language skills

  • Advanced experience developing applications in Python and Go

  • Advanced level of experience in Kubernetes or OpenShift

  • Ability to quickly learn and guide others on using new tools and technologies

  • Experience with source code management tools such as Git

  • Proven ability to innovate and a passion for staying at the forefront of technology.

  • Excellent system understanding and troubleshooting capabilities

  • Autonomous work ethic, thriving in a dynamic, fast-paced environment.

  • Technical leadership acumen in a global team environment

The following will be considered a plus:

  • Bachelor's degree in statistics, mathematics, computer science, operations research, or a related quantitative field, or equivalent expertise; Master’s or PhD is a big plus

  • Understanding of how Open Source and Free Software communities work

  • Experience with development for public cloud services (AWS, GCE, Azure)

  • Experience in engineering, consulting or another field related to model serving and monitoring, model registry, explainable AI, deep neural networks, in a customer environment or supporting a data science team

  • Highly experienced in OpenShift

  • Familiarity with popular Python machine learning libraries such as PyTorch, Tensorflow, and Hugging Face

The salary range for this position is $163,420.00 - $269,640.00. Actual offer will be based on your qualifications.

Pay Transparency

● Comprehensive medical, dental, and vision coverage

● Flexible Spending Account - healthcare and dependent care

● Health Savings Account - high deductible medical plan

● Retirement 401(k) with employer match

● Paid time off and holidays

● Paid parental leave plans for all new parents

● Leave benefits including disability, paid family medical leave, and paid military leave