Finding the best job has never been easier
Share
RHOAI Principal Software Engineer - Workload Orchestration
Job Summary:
You will be contributing as a core engineer for the Workload Orchestration team. In this role, you will be working on solutions that enhance Kubernetes-native workload orchestration, focusing on AI/ ML jobs, enabling enterprise customers to scale and manage their AI-driven applications across the hybrid cloud. You will also help build SDKs which help simplify the integration of tools like Ray, Kueue, and Kubeflow via IDEs such as Jupyter Notebooks, empowering data scientists and ML engineers to streamline complex workflows with ease.
What you will do
Architect and lead implementation of new features and solutions for OpenShift AI
Innovate and collaborate closely with upstream communities contributing meaningfully to the open-source efforts around Kubernetes workload orchestration.
Provide technical vision and leadership on critical and high-impact projects
Ensure non-functional requirements including security, resiliency, and maintainability are met
Write high-quality, including unit and integration tests, and collaborate with quality engineers to ensure top-tier product quality.
Use CI/CD best practices to deliver solutions as productization efforts into RHOAI
Contribute to a culture of continuous improvement by sharing recommendations and technical knowledge with team members
Collaborate with product management, other engineering and cross-functional teams to analyze and clarify business requirements
Communicate effectively to stakeholders and team members to ensure proper visibility of development efforts
Give thoughtful and prompt code reviews
Represent OpenShift AII in external engagements including industry events, customer meetings, and open source communities
Mentor, influence, and coach a distributed team of engineers
What you will bring
Advanced experience developing applications in Go or Python, or other language
Advanced experience in Kubernetes, OpenShift or other cloud-native technologies
Ability to quickly learn and guide others on using new tools and technologies
Experience with source code management tools such as Git
Proven ability to innovate and a passion for staying at the forefront of technology.
Excellent distributed system understanding and troubleshooting capabilities,
Autonomous work ethic, thriving in a dynamic, fast-paced environment.
Technical leadership acumen in a global team environment and executing on team vision
Excellent written and verbal communication skills
The following will be considered a plus:
While a Bachelor’s degree or higher in computer science or a related discipline is valued, we prioritize practical experience and technical prowess
Understanding of how Open Source and Free Software communities work
Experience with development for public cloud services (AWS, GCE, Azure)
Experience working with or deploying MLOps platforms
Familiarity with data science workflows and machine learning pipelines, batch processing and/ or gang-scheduling.
Experience writing Kubernetes/OpenShift controllers and operators
Experience writing user-facing SDK packages in Python or similar
These jobs might be a good fit