

What you will do
Testing the performance and accuracy of LLMs for vLLM and llama.cpp inference on different accelerators.
Making awesome test plans and cases to hit product requirements.
Doing all sorts of testing: functional, performance, regression, you name it, to make sure the product is solid.
Writing test code and frameworks to automate testing.
Monitoring, analyzing, and reporting test results and failures.
Sharing your knowledge and recommendations to help the team keep getting better.
Keeping everyone in the loop about quality efforts.
Giving good and quick code reviews.
What you will bring
At least 3 years of software testing experience.
Solid experience evaluating LLMs for performance on accelerators and accuracy (think HellaSwag, MMLU, Chatbot Arena, TruthfulQA, etc.).
Being super comfortable with Python and PyTest is a must.
Familiarity with Git, GitHub, or GitLab.
Strong experience with API and performance testing, especially for C++ and Python.
You should be a pro with Docker, Podman, and Kubernetes or Openshift.
Highly experienced in setting up CI/CD processes like Jenkins and GitHub Actions.
Understanding of core Machine Learning algorithms and basics
Bonus points if you have:
A Bachelor’s degree (or higher) in computer science, math, or a related field is cool, but practical experience and technical skills are what really matter.
Knowing how to excel in Open Source communities.
Understanding how application build pipelines work (like how code becomes binaries or Python wheels).
A track record of contributing to the vLLM community is ahugeplus!
The salary range for this position is $133,650.00 - $220,680.00. Actual offer will be based on your qualifications.
Pay Transparency
● Comprehensive medical, dental, and vision coverage
● Flexible Spending Account - healthcare and dependent care
● Health Savings Account - high deductible medical plan
● Retirement 401(k) with employer match
● Paid time off and holidays
● Paid parental leave plans for all new parents
● Leave benefits including disability, paid family medical leave, and paid military leave
משרות נוספות שיכולות לעניין אותך

Job Summary
You’ll help accelerate the development of AI prototypes by ensuring seamless platform integration, CI/CD pipelines, and other critical infrastructure to enable high-speed experimentation and iteration.
What you’ll do
Platform Support and Optimization: Design and maintain scalable, secure, and efficient platforms to support AI Catalyst team initiatives, ensuring smooth integration of AI models and workflows.
Infrastructure Management: Provide expertise in Kubernetes and cloud platforms (GCP, AWS, Azure) for container orchestration, scalable deployments, and real-time operations.
Partner with the AI Catalyst team to identify bottlenecks, remove blockers, and optimize workflows for faster delivery of AI prototypes.
Technical Leadership: Lead the implementation of critical systems (APIs, orchestration, observability, deployment) to ensure speed, reliability, and maintainability.
Cross-Functional Collaboration: Work closely with engineering, product, and design teams to align technical priorities and drive impactful AI initiatives.
Mentorship: Guide and mentor engineers, fostering a culture of technical excellence, collaboration, and rapid execution.
Demonstrate proficiency in Kubernetes for container orchestration and scalable deployments.
Mentor senior engineers and contribute to a culture of technical excellence, velocity, and pragmatic decision-making
Proactively utilize AI-assisted development tools (e.g., GitHub Copilot, Cursor, Claude Code) for code generation, auto-completion, and intelligent suggestions to accelerate development cycles and enhance code quality.
Explore and experiment with emerging AI technologies relevant to software development, proactively identifying opportunities to incorporate new AI capabilities into existing workflows and tooling.
What you’ll bring
10+ years of software engineering experience
Strong background in Python and background in C, C++, Go or Rust.
Proficiency in RHEL or other Linux distributions.
Communication Skills: Strong ability to communicate technical tradeoffs and bring clarity to ambiguous situations
Passion for AI Innovation: Enthusiasm for enabling AI initiatives that drive real-world impact and accelerate prototyping efforts.
Ability to move fast without compromising quality, thriving in environments where rapid iteration and high ownership are the norm
PoC Experience: Proven ability to work on and deliver successful Proof of Concepts or initiatives, showcasing the ability to rapidly prototype and validate ideas.
Nice to have
Experience with cloud platforms such as GCP, AWS, or Azure.
experience with building and packaging Python projects, package managers (dnf, pip), and build systems (cmake, meson)
experience in working with upstream projects and Open Source communities.
Experience in early-stage product incubation or 0→1 product delivery
Contributions to internal AI platforms, model evaluation frameworks, or observability for AI systems
This is a rare opportunity to help shape how our company brings AI innovation to life - bridging research and real-world usage at a moment when speed, safety, and product quality matter most. If you're energized by rapid iteration, high autonomy, and making AI tangible for millions, we’d love to talk.
Previous experience with hardware acceleration, either generic GPU experience or specific ones, such as CUDA and ROCm
Knowledge of AI frameworks, such as PyTorch and/or TensorFlow
Familiarity with containerization and orchestration
Understanding of Open Source development models
Experience with test-base development and agile/scrum methodologies

What you will do:
Examine new project opportunities, identify the right approach to meeting or exceeding the requirements for these projects and develop solutions with an eye toward quality, security, maintainability, supportability, performance and resilience
Work closely with Engineering, Product Management and Support stakeholders to prioritize features and bugs during all phases of development
Participate in the interaction with relevant hardware partners with a focus on getting key functionality included in their roadmap
Communicate architectural concepts and decisions to various audiences
Be a leader and mentor for more junior members of the team and help expand their skill sets
Participate in upstream AI/ML communities with a focus on learning more about the various technologies and how they might be used within our offerings
What you will bring:
Strong experience with RHEL or other Linux distributions
Strong experience with software development with programming languages such as Python, Go or similar
Problem solving and troubleshooting skills with a focus on root cause analysis
Experience with container technologies, such as Kubernetes/OpenShift and Podman
Hands-on learning and demonstrable experience with implementing and owning complex features individually and in collaboration with others
Nice to have:
Previous experience with hardware acceleration, either generic GPU experience or specific ones, such as CUDA and ROCm
Knowledge of AI frameworks, such as PyTorch and/or TensorFlow
Familiarity with containerization and orchestration
Understanding of Open Source development models
Experience with test-base development and agile/scrum methodologies

This team focuses on a few key project areas:
Developing & enhancing Backstage plugins
Improving product security
Deployment targets & installation methods
Contributing to the Backstage communities by delivering features and fixes to the upstream projects
Creating Software Templates & Actions
What you will do
Develop a deep understanding of the technologies and frameworks used within the Red Hat Developer Hub and related projects
Understand the product security concepts
Create and maintain technical documentation for new and existing functionality
Design and implement automation frameworks, including automated tests and quality checks, to support robust CI/CD pipelines
Operate effectively in a fast-paced, agile environment where both timely delivery and long-term vision are valued
Coordinate with team leads, architects, and other engineers to design and implement scalable, maintainable solutions
Perform code reviews and provide constructive feedback to peers
Actively participate in Scrum ceremonies and contribute to an agile development process
Contribute to upstream projects by submitting and reviewing patches for bug fixes and feature requests to and from the community
Coordinate and communicate effectively with engineering and leadership teams across global time zones
Help establish and refine processes that enhance release quality, consistency, and automation
Advocate for the team’s work through blog posts, community updates, and conference presentations
What will you bring
Strong understanding of software development processes and methodologies (Agile, DevOps)
Knowledge of cloud security principles and securing cloud environments
Experience with programming languages relevant to the products (e.g., typescript, Python, Go, Node.js)
Strong analytical, problem-solving, and critical thinking skills
Bachelor's degree in computer science or a related field, or equivalent working experience
Effective English Communication: Experience communicating effectively with other teams and departments across a broad organization
The following are considered a plus:
Good understanding of common security vulnerabilities, (e.g. OWASP Top Ten) including how to detect, demonstrate, mitigate and resolve them.Knowledge of Security tools (SAST, DAST, SCA, vulnerability scanners, penetration testing)
Experience in parsing and rendering YAML/JSON using tools such as jq and yq
A passion for open source technologies, especially around data solutions
Familiarity with design-thinking concepts and implementations

We are looking for a Senior Machine Learning Research Engineer with a strong research background and hands-on experience in building and optimizing deep learning models. In this role, you will explore and develop cutting-edge techniques in model compression, including pruning, quantization, knowledge distillation, and speculative decoding. You will help design and evaluate novel algorithms that bridge theory and real-world deployment.
Your Role and Responsibilities
As a core member of our ML research team, you will:
Design and conduct experiments to evaluate model compression strategies for large-scale deep learning models.
Develop scalable and modular research code in Python.
Work closely with software engineers and product teams to translate research into deployable systems.
Explore emerging techniques in efficient inference and help define future directions for model optimization.
Collaborate on publications in top-tier ML/AI conferences and contribute to open-source initiatives.
Benchmark models across hardware configurations, contributing to the broader understanding of how model optimizations affect performance in real-world deployment scenarios.
Participate in reading groups, internal workshops, and mentoring activities.
Required Qualifications
PhD in Machine Learning, Computer Science, Electric Engineering, Applied Mathematics, or a related field.
Strong foundation in machine learning algorithms and numerical optimization.
Proficiency in Python and deep learning frameworks such as PyTorch, TensorFlow, or JAX.
Strong analytical and problem-solving skills.
Experience with experimental design and empirical research, including model evaluation and benchmarking.
Excellent written and verbal communication skills, including the ability to explain complex ideas to a technical audience.
Preferred Qualifications
Familiarity with model compression techniques such as quantization, pruning, knowledge distillation, or speculative decoding.
Experience contributing to open-source machine learning projects.
Experience optimizing model performance for inference efficiency, particularly on GPUs or specialized accelerators.
Publication record in top-tier conferences (e.g., NeurIPS, ICML, ICLR, CVPR).
Comfortable navigating large codebases and collaborating in a research-oriented engineering team.
What We Offer
A dynamic and intellectually stimulating environment with opportunities to shape the future of efficient ML systems.
A collaborative team that values curiosity, creativity, and impact.
Support for academic engagement (publishing, conference travel, workshops).
Access to high-performance computing resources and state-of-the-art ML infrastructure.
Comprehensive benefits, flexible work arrangements, and opportunities for career growth.
The salary range for this position is $170,770.00 - $281,770.00. Actual offer will be based on your qualifications.
Pay Transparency
● Comprehensive medical, dental, and vision coverage
● Flexible Spending Account - healthcare and dependent care
● Health Savings Account - high deductible medical plan
● Retirement 401(k) with employer match
● Paid time off and holidays
● Paid parental leave plans for all new parents
● Leave benefits including disability, paid family medical leave, and paid military leave

What You Will Do
Support the user experience design process from concept through implementation.
Collaborate with product managers, analysts, project managers, copywriters, engineers, and researchers to define and meet design goals.
Translate business requirements and user insights into actionable and effective design solutions.
Create and deliver UX artifacts including journey maps, wireframes, high-fidelity mockups, interactive prototypes, and production-ready specifications.
Advocate for user needs and accessibility throughout the design process.
Evaluate and improve existing interfaces by identifying usability issues and proposing enhancements.
Articulate and explain design rationale to stakeholders.
Evaluate solutions against desired outcomes.
What You Will Bring
5+ years of experience designing user experiences for applications, especially for technical or enterprise audiences.
Skilled in designing user interfaces for diverse platforms, including GUIs, CLIs, APIs, and configuration workflows.
Proven ability to design for technical users (e.g., developers, sysadmins, platform engineers)
Strong technical acumen and ability to quickly grasp and simplify complex domains
Strong portfolio showcasing end-to-end UX design work and high-quality deliverables.
Deep understanding of user-centered design principles and accessibility standards.
Proficiency in design tools such as Figma, Sketch, Adobe XD, or similar.
Excellent communication and presentation skills with the ability to advocate for design solutions.
Ability to work independently and manage multiple priorities in a fast-paced environment.
Comfort working with ambiguity and driving clarity through design.
The Following Are Considered a Plus
Experience designing user experiences for AI/ML-powered applications or features.
Familiarity with common AI/ML concepts and their application in user interfaces.
Understanding of ethical AI design principles and considerations.
Experience working in open source software environments.
Familiarity with agile methodologies and design systems.
Knowledge of front-end development practices (HTML, CSS, JS) or collaboration with developers.
Experience conducting user research or usability testing.
The salary range for this position is $116,270.00 - $191,840.00. Actual offer will be based on your qualifications.
Pay Transparency
● Comprehensive medical, dental, and vision coverage
● Flexible Spending Account - healthcare and dependent care
● Health Savings Account - high deductible medical plan
● Retirement 401(k) with employer match
● Paid time off and holidays
● Paid parental leave plans for all new parents
● Leave benefits including disability, paid family medical leave, and paid military leave

Job Responsibilities
Work with Red Hat engineers and research project teams to develop, test, deploy and operate software for distributed research environments built with OpenShift, OpenStack, OpenShift AI, InstructLab and other open source software.
Work with Red Hat product development teams to explore and help transition selected new functionality into supported products
Develop, deploy, upgrade, monitor and troubleshoot software in research environments such as the Mass Open Cloud Alliance, as well as other university research computing environments in North America
Identify, track and resolve issues as part of a worldwide development team analyzing distributed systems and data using GitOps techniques and tools
Contribute software to open source projects to help advance research computing
As part of the CTO office, write, speak and promote software development research projects, as well as student-oriented development and education activities such as hackathons, tutorials and independent student projects.
Requirements
Software development experience with multiple programming languages (C++, Python, Go)
Experience with software development for distributed systems and AI systems, particularly accelerators, virtual machines and containers
Deep expertise in at least one broad technical area (e.g. operating systems), with a demonstrated understanding of subsystems and their interactions in real-world use
Ability to decompose large complex systems and development tasks and work as a technical leader in a distributed team to release new functionality and resolve issues with deployed systems
Experience maintaining and contributing to linux software (Red Hat Enterprise Linux (RHEL), CentOS, or Fedora preferred)
Detailed understanding of Agile software development processes
Detailed knowledge of development tools, repository management, and CI/CD platforms such as Ansible
Experience working with users and design engineers in a research or production computing environment
Demonstrated ability to work with independence on software design and implementation, while providing technical leadership and some mentoring to a larger team of developers and system engineers
Good oral and written communications
PhD, Master’s or Bachelor’s degree, with work or academic project experience
The salary range for this position is $111,260.00 - $183,580.00. Actual offer will be based on your qualifications.
Pay Transparency
● Comprehensive medical, dental, and vision coverage
● Flexible Spending Account - healthcare and dependent care
● Health Savings Account - high deductible medical plan
● Retirement 401(k) with employer match
● Paid time off and holidays
● Paid parental leave plans for all new parents
● Leave benefits including disability, paid family medical leave, and paid military leave

What you will do
Testing the performance and accuracy of LLMs for vLLM and llama.cpp inference on different accelerators.
Making awesome test plans and cases to hit product requirements.
Doing all sorts of testing: functional, performance, regression, you name it, to make sure the product is solid.
Writing test code and frameworks to automate testing.
Monitoring, analyzing, and reporting test results and failures.
Sharing your knowledge and recommendations to help the team keep getting better.
Keeping everyone in the loop about quality efforts.
Giving good and quick code reviews.
What you will bring
At least 3 years of software testing experience.
Solid experience evaluating LLMs for performance on accelerators and accuracy (think HellaSwag, MMLU, Chatbot Arena, TruthfulQA, etc.).
Being super comfortable with Python and PyTest is a must.
Familiarity with Git, GitHub, or GitLab.
Strong experience with API and performance testing, especially for C++ and Python.
You should be a pro with Docker, Podman, and Kubernetes or Openshift.
Highly experienced in setting up CI/CD processes like Jenkins and GitHub Actions.
Understanding of core Machine Learning algorithms and basics
Bonus points if you have:
A Bachelor’s degree (or higher) in computer science, math, or a related field is cool, but practical experience and technical skills are what really matter.
Knowing how to excel in Open Source communities.
Understanding how application build pipelines work (like how code becomes binaries or Python wheels).
A track record of contributing to the vLLM community is ahugeplus!
The salary range for this position is $133,650.00 - $220,680.00. Actual offer will be based on your qualifications.
Pay Transparency
● Comprehensive medical, dental, and vision coverage
● Flexible Spending Account - healthcare and dependent care
● Health Savings Account - high deductible medical plan
● Retirement 401(k) with employer match
● Paid time off and holidays
● Paid parental leave plans for all new parents
● Leave benefits including disability, paid family medical leave, and paid military leave
משרות נוספות שיכולות לעניין אותך