דרושים Senior Ux Designer- Rhel Ai ב-Red Hat ב-United States, Boston

What you will do

Testing the performance and accuracy of LLMs for vLLM and llama.cpp inference on different accelerators.
Making awesome test plans and cases to hit product requirements.
Doing all sorts of testing: functional, performance, regression, you name it, to make sure the product is solid.
Writing test code and frameworks to automate testing.
Monitoring, analyzing, and reporting test results and failures.
Sharing your knowledge and recommendations to help the team keep getting better.
Keeping everyone in the loop about quality efforts.
Giving good and quick code reviews.

What you will bring

At least 3 years of software testing experience.
Solid experience evaluating LLMs for performance on accelerators and accuracy (think HellaSwag, MMLU, Chatbot Arena, TruthfulQA, etc.).
Being super comfortable with Python and PyTest is a must.
Familiarity with Git, GitHub, or GitLab.
Strong experience with API and performance testing, especially for C++ and Python.
You should be a pro with Docker, Podman, and Kubernetes or Openshift.
Highly experienced in setting up CI/CD processes like Jenkins and GitHub Actions.
Understanding of core Machine Learning algorithms and basics

Bonus points if you have:

A Bachelor’s degree (or higher) in computer science, math, or a related field is cool, but practical experience and technical skills are what really matter.
Knowing how to excel in Open Source communities.
Understanding how application build pipelines work (like how code becomes binaries or Python wheels).
A track record of contributing to the vLLM community is ahugeplus!

The salary range for this position is $133,650.00 - $220,680.00. Actual offer will be based on your qualifications.

Pay Transparency

● Comprehensive medical, dental, and vision coverage

● Flexible Spending Account - healthcare and dependent care

● Health Savings Account - high deductible medical plan

● Retirement 401(k) with employer match

● Paid time off and holidays

● Paid parental leave plans for all new parents

● Leave benefits including disability, paid family medical leave, and paid military leave

משרות נוספות שיכולות לעניין אותך

Nvidia Senior Software Engineer LLM Inference China, Shanghai

Nvidia Software Engineer LLM Inference China, Shanghai

Nvidia DL Performance Software Engineer - LLM Inference United States, California

Red hat Senior Performance Resilience Engineer - LLM Inference United States, Colorado, Denver

05.09.2025

Red hat Principal Software Engineer AI Catalyst Platform team United States, Massachusetts, Boston

שיתוף

Job Summary

You’ll help accelerate the development of AI prototypes by ensuring seamless platform integration, CI/CD pipelines, and other critical infrastructure to enable high-speed experimentation and iteration.

What you’ll do

Platform Support and Optimization: Design and maintain scalable, secure, and efficient platforms to support AI Catalyst team initiatives, ensuring smooth integration of AI models and workflows.
Infrastructure Management: Provide expertise in Kubernetes and cloud platforms (GCP, AWS, Azure) for container orchestration, scalable deployments, and real-time operations.
Partner with the AI Catalyst team to identify bottlenecks, remove blockers, and optimize workflows for faster delivery of AI prototypes.
Technical Leadership: Lead the implementation of critical systems (APIs, orchestration, observability, deployment) to ensure speed, reliability, and maintainability.
Cross-Functional Collaboration: Work closely with engineering, product, and design teams to align technical priorities and drive impactful AI initiatives.
Mentorship: Guide and mentor engineers, fostering a culture of technical excellence, collaboration, and rapid execution.
Demonstrate proficiency in Kubernetes for container orchestration and scalable deployments.
Mentor senior engineers and contribute to a culture of technical excellence, velocity, and pragmatic decision-making
Proactively utilize AI-assisted development tools (e.g., GitHub Copilot, Cursor, Claude Code) for code generation, auto-completion, and intelligent suggestions to accelerate development cycles and enhance code quality.
Explore and experiment with emerging AI technologies relevant to software development, proactively identifying opportunities to incorporate new AI capabilities into existing workflows and tooling.

What you’ll bring

10+ years of software engineering experience
Strong background in Python and background in C, C++, Go or Rust.
Proficiency in RHEL or other Linux distributions.
Communication Skills: Strong ability to communicate technical tradeoffs and bring clarity to ambiguous situations
Passion for AI Innovation: Enthusiasm for enabling AI initiatives that drive real-world impact and accelerate prototyping efforts.
Ability to move fast without compromising quality, thriving in environments where rapid iteration and high ownership are the norm
PoC Experience: Proven ability to work on and deliver successful Proof of Concepts or initiatives, showcasing the ability to rapidly prototype and validate ideas.

Nice to have

Experience with cloud platforms such as GCP, AWS, or Azure.
experience with building and packaging Python projects, package managers (dnf, pip), and build systems (cmake, meson)
experience in working with upstream projects and Open Source communities.
Experience in early-stage product incubation or 0→1 product delivery
Contributions to internal AI platforms, model evaluation frameworks, or observability for AI systems

This is a rare opportunity to help shape how our company brings AI innovation to life - bridging research and real-world usage at a moment when speed, safety, and product quality matter most. If you're energized by rapid iteration, high autonomy, and making AI tangible for millions, we’d love to talk.

Previous experience with hardware acceleration, either generic GPU experience or specific ones, such as CUDA and ROCm
Knowledge of AI frameworks, such as PyTorch and/or TensorFlow
Familiarity with containerization and orchestration
Understanding of Open Source development models
Experience with test-base development and agile/scrum methodologies

19.07.2025

Red hat Senior Principal Staff Software Engineer Generalist AI Engin... United States, Massachusetts, Boston

שיתוף

What you will do:

Examine new project opportunities, identify the right approach to meeting or exceeding the requirements for these projects and develop solutions with an eye toward quality, security, maintainability, supportability, performance and resilience
Work closely with Engineering, Product Management and Support stakeholders to prioritize features and bugs during all phases of development
Participate in the interaction with relevant hardware partners with a focus on getting key functionality included in their roadmap
Communicate architectural concepts and decisions to various audiences
Be a leader and mentor for more junior members of the team and help expand their skill sets
Participate in upstream AI/ML communities with a focus on learning more about the various technologies and how they might be used within our offerings

What you will bring:

Strong experience with RHEL or other Linux distributions
Strong experience with software development with programming languages such as Python, Go or similar
Problem solving and troubleshooting skills with a focus on root cause analysis
Experience with container technologies, such as Kubernetes/OpenShift and Podman
Hands-on learning and demonstrable experience with implementing and owning complex features individually and in collaboration with others

Nice to have:

Previous experience with hardware acceleration, either generic GPU experience or specific ones, such as CUDA and ROCm
Knowledge of AI frameworks, such as PyTorch and/or TensorFlow
Familiarity with containerization and orchestration
Understanding of Open Source development models
Experience with test-base development and agile/scrum methodologies

11.07.2025

Red hat Senior Software Engineer - Ansible Automation Platform United States, Massachusetts, Boston

שיתוף

This team focuses on a few key project areas:

Developing & enhancing Backstage plugins
Improving product security
Deployment targets & installation methods
Contributing to the Backstage communities by delivering features and fixes to the upstream projects
Creating Software Templates & Actions

What you will do

Develop a deep understanding of the technologies and frameworks used within the Red Hat Developer Hub and related projects
Understand the product security concepts
Create and maintain technical documentation for new and existing functionality
Design and implement automation frameworks, including automated tests and quality checks, to support robust CI/CD pipelines
Operate effectively in a fast-paced, agile environment where both timely delivery and long-term vision are valued
Coordinate with team leads, architects, and other engineers to design and implement scalable, maintainable solutions
Perform code reviews and provide constructive feedback to peers
Actively participate in Scrum ceremonies and contribute to an agile development process
Contribute to upstream projects by submitting and reviewing patches for bug fixes and feature requests to and from the community
Coordinate and communicate effectively with engineering and leadership teams across global time zones
Help establish and refine processes that enhance release quality, consistency, and automation
Advocate for the team’s work through blog posts, community updates, and conference presentations

What will you bring

Strong understanding of software development processes and methodologies (Agile, DevOps)
Knowledge of cloud security principles and securing cloud environments
Experience with programming languages relevant to the products (e.g., typescript, Python, Go, Node.js)
Strong analytical, problem-solving, and critical thinking skills
Bachelor's degree in computer science or a related field, or equivalent working experience
Effective English Communication: Experience communicating effectively with other teams and departments across a broad organization

The following are considered a plus:

Good understanding of common security vulnerabilities, (e.g. OWASP Top Ten) including how to detect, demonstrate, mitigate and resolve them.Knowledge of Security tools (SAST, DAST, SCA, vulnerability scanners, penetration testing)
Experience in parsing and rendering YAML/JSON using tools such as jq and yq
A passion for open source technologies, especially around data solutions
Familiarity with design-thinking concepts and implementations

05.07.2025

Red hat Senior Machine Learning Research Engineer United States, Massachusetts, Boston

שיתוף

We are looking for a Senior Machine Learning Research Engineer with a strong research background and hands-on experience in building and optimizing deep learning models. In this role, you will explore and develop cutting-edge techniques in model compression, including pruning, quantization, knowledge distillation, and speculative decoding. You will help design and evaluate novel algorithms that bridge theory and real-world deployment.

Your Role and Responsibilities

As a core member of our ML research team, you will:

Design and conduct experiments to evaluate model compression strategies for large-scale deep learning models.
Develop scalable and modular research code in Python.
Work closely with software engineers and product teams to translate research into deployable systems.
Explore emerging techniques in efficient inference and help define future directions for model optimization.
Collaborate on publications in top-tier ML/AI conferences and contribute to open-source initiatives.
Benchmark models across hardware configurations, contributing to the broader understanding of how model optimizations affect performance in real-world deployment scenarios.
Participate in reading groups, internal workshops, and mentoring activities.

Required Qualifications

PhD in Machine Learning, Computer Science, Electric Engineering, Applied Mathematics, or a related field.
Strong foundation in machine learning algorithms and numerical optimization.
Proficiency in Python and deep learning frameworks such as PyTorch, TensorFlow, or JAX.
Strong analytical and problem-solving skills.
Experience with experimental design and empirical research, including model evaluation and benchmarking.
Excellent written and verbal communication skills, including the ability to explain complex ideas to a technical audience.

Preferred Qualifications

Familiarity with model compression techniques such as quantization, pruning, knowledge distillation, or speculative decoding.
Experience contributing to open-source machine learning projects.
Experience optimizing model performance for inference efficiency, particularly on GPUs or specialized accelerators.
Publication record in top-tier conferences (e.g., NeurIPS, ICML, ICLR, CVPR).
Comfortable navigating large codebases and collaborating in a research-oriented engineering team.

What We Offer

A dynamic and intellectually stimulating environment with opportunities to shape the future of efficient ML systems.
A collaborative team that values curiosity, creativity, and impact.
Support for academic engagement (publishing, conference travel, workshops).
Access to high-performance computing resources and state-of-the-art ML infrastructure.
Comprehensive benefits, flexible work arrangements, and opportunities for career growth.

The salary range for this position is $170,770.00 - $281,770.00. Actual offer will be based on your qualifications.

Pay Transparency

● Comprehensive medical, dental, and vision coverage

● Flexible Spending Account - healthcare and dependent care

● Health Savings Account - high deductible medical plan

● Retirement 401(k) with employer match

● Paid time off and holidays

● Paid parental leave plans for all new parents

● Leave benefits including disability, paid family medical leave, and paid military leave

05.07.2025

Red hat Senior UX Designer – Lightspeed United States, Massachusetts, Boston

שיתוף

What You Will Do

Support the user experience design process from concept through implementation.
Collaborate with product managers, analysts, project managers, copywriters, engineers, and researchers to define and meet design goals.
Translate business requirements and user insights into actionable and effective design solutions.
Create and deliver UX artifacts including journey maps, wireframes, high-fidelity mockups, interactive prototypes, and production-ready specifications.
Advocate for user needs and accessibility throughout the design process.
Evaluate and improve existing interfaces by identifying usability issues and proposing enhancements.
Articulate and explain design rationale to stakeholders.
Evaluate solutions against desired outcomes.

What You Will Bring

5+ years of experience designing user experiences for applications, especially for technical or enterprise audiences.
Skilled in designing user interfaces for diverse platforms, including GUIs, CLIs, APIs, and configuration workflows.
Proven ability to design for technical users (e.g., developers, sysadmins, platform engineers)
Strong technical acumen and ability to quickly grasp and simplify complex domains
Strong portfolio showcasing end-to-end UX design work and high-quality deliverables.
Deep understanding of user-centered design principles and accessibility standards.
Proficiency in design tools such as Figma, Sketch, Adobe XD, or similar.
Excellent communication and presentation skills with the ability to advocate for design solutions.
Ability to work independently and manage multiple priorities in a fast-paced environment.
Comfort working with ambiguity and driving clarity through design.

The Following Are Considered a Plus

Experience designing user experiences for AI/ML-powered applications or features.
Familiarity with common AI/ML concepts and their application in user interfaces.
Understanding of ethical AI design principles and considerations.
Experience working in open source software environments.
Familiarity with agile methodologies and design systems.
Knowledge of front-end development practices (HTML, CSS, JS) or collaboration with developers.
Experience conducting user research or usability testing.

The salary range for this position is $116,270.00 - $191,840.00. Actual offer will be based on your qualifications.

Pay Transparency

● Comprehensive medical, dental, and vision coverage

● Flexible Spending Account - healthcare and dependent care

● Health Savings Account - high deductible medical plan

● Retirement 401(k) with employer match

● Paid time off and holidays

● Paid parental leave plans for all new parents

● Leave benefits including disability, paid family medical leave, and paid military leave

04.07.2025

Red hat Senior Engineer - Research United States, Massachusetts, Boston

שיתוף

Job Responsibilities

Work with Red Hat engineers and research project teams to develop, test, deploy and operate software for distributed research environments built with OpenShift, OpenStack, OpenShift AI, InstructLab and other open source software.
Work with Red Hat product development teams to explore and help transition selected new functionality into supported products
Develop, deploy, upgrade, monitor and troubleshoot software in research environments such as the Mass Open Cloud Alliance, as well as other university research computing environments in North America
Identify, track and resolve issues as part of a worldwide development team analyzing distributed systems and data using GitOps techniques and tools
Contribute software to open source projects to help advance research computing
As part of the CTO office, write, speak and promote software development research projects, as well as student-oriented development and education activities such as hackathons, tutorials and independent student projects.

Requirements

Software development experience with multiple programming languages (C++, Python, Go)
Experience with software development for distributed systems and AI systems, particularly accelerators, virtual machines and containers
Deep expertise in at least one broad technical area (e.g. operating systems), with a demonstrated understanding of subsystems and their interactions in real-world use
Ability to decompose large complex systems and development tasks and work as a technical leader in a distributed team to release new functionality and resolve issues with deployed systems
Experience maintaining and contributing to linux software (Red Hat Enterprise Linux (RHEL), CentOS, or Fedora preferred)
Detailed understanding of Agile software development processes
Detailed knowledge of development tools, repository management, and CI/CD platforms such as Ansible
Experience working with users and design engineers in a research or production computing environment
Demonstrated ability to work with independence on software design and implementation, while providing technical leadership and some mentoring to a larger team of developers and system engineers
Good oral and written communications
PhD, Master’s or Bachelor’s degree, with work or academic project experience

The salary range for this position is $111,260.00 - $183,580.00. Actual offer will be based on your qualifications.

Pay Transparency

● Comprehensive medical, dental, and vision coverage

● Flexible Spending Account - healthcare and dependent care

● Health Savings Account - high deductible medical plan

● Retirement 401(k) with employer match

● Paid time off and holidays

● Paid parental leave plans for all new parents

● Leave benefits including disability, paid family medical leave, and paid military leave

Red hatSenior Software Engineer LLM Evaluation vLLM Inference

1 2 3 4 5 6

United States, Massachusetts, Boston

760693565

05.09.2025

שיתוף

תיאור:

What you will do

Testing the performance and accuracy of LLMs for vLLM and llama.cpp inference on different accelerators.
Making awesome test plans and cases to hit product requirements.
Doing all sorts of testing: functional, performance, regression, you name it, to make sure the product is solid.
Writing test code and frameworks to automate testing.
Monitoring, analyzing, and reporting test results and failures.
Sharing your knowledge and recommendations to help the team keep getting better.
Keeping everyone in the loop about quality efforts.
Giving good and quick code reviews.

What you will bring

At least 3 years of software testing experience.
Solid experience evaluating LLMs for performance on accelerators and accuracy (think HellaSwag, MMLU, Chatbot Arena, TruthfulQA, etc.).
Being super comfortable with Python and PyTest is a must.
Familiarity with Git, GitHub, or GitLab.
Strong experience with API and performance testing, especially for C++ and Python.
You should be a pro with Docker, Podman, and Kubernetes or Openshift.
Highly experienced in setting up CI/CD processes like Jenkins and GitHub Actions.
Understanding of core Machine Learning algorithms and basics

Bonus points if you have:

A Bachelor’s degree (or higher) in computer science, math, or a related field is cool, but practical experience and technical skills are what really matter.
Knowing how to excel in Open Source communities.
Understanding how application build pipelines work (like how code becomes binaries or Python wheels).
A track record of contributing to the vLLM community is ahugeplus!

The salary range for this position is $133,650.00 - $220,680.00. Actual offer will be based on your qualifications.

Pay Transparency

● Comprehensive medical, dental, and vision coverage

● Flexible Spending Account - healthcare and dependent care

● Health Savings Account - high deductible medical plan

● Retirement 401(k) with employer match

● Paid time off and holidays

● Paid parental leave plans for all new parents

● Leave benefits including disability, paid family medical leave, and paid military leave

Expand