

What you will do:
Own the resilience testing roadmap for vLLM and llm-d: define resilience indicators, prioritize fault scenarios, and establish go/no-go gates for releases and CI/CD
Design GPU/accelerator-aware fault experiments that target vLLM and the stack beneath it (drivers, GPU Operator/DevicePlugin, NCCL/collectives, storage/network paths, NUMA/topology)
Build an automated harness (preferably extending krkn-chaos (https://github.com/krkn-chaos/krkn) ) to run controlled experiments with scoped blast radius, and evidence capture (logs, traces, metrics)
Integrate fault signals into pipelines (GitHub Actions or otherwise) as resilience gates alongside performance gates
Develop detection and diagnostics: dashboards and alerts for pre-fault signals (e.g., vLLM queue depth, GPU throttling, P2P downgrades, KV-cache pressure, allocator fragmentation)
Triage and root-cause resilience regressions from field/customer issues; upstream bugs and fixes to vLLM and llm-d
Explore and experiment with emerging AI technologies relevant to software development and testing, proactively identifying opportunities to incorporate new AI capabilities into existing workflows and tooling.
Publish learnings (internal/external): failure patterns, playbooks, SLO templates, experiment libraries, and reference architectures; present at internal/external forums
What you will bring:
3+ years in reliability, and/or performance engineering on large-scale distributed systems
Expertise in systems‑level software design
Expertise with Kubernetes and modern LLM inference server stack (e.g., vLLM, TensorRT-LLM, TGI)
Observability & forensics skills with experience with Prometheus/Grafana, OpenTelemetry tracing, eBPF/BPFTrace/perf, Nsight Systems, PyTorch Profiler; adept at converting raw signals into actionable narratives.
Fluency in Python (data & ML), strong Bash/Linux skills
Exceptional communication skills - able to translate raw data into customer value and executive narratives
Commitment to open‑source values and upstream collaboration
The following is considered a plus:
Master’s or PhD in Computer Science, AI, or a related field
History of upstream contributions and community leadership, public talks or blogs on resilience, or chaos engineering
Competitive benchmarking and failure characterization at scale.
The salary range for this position is $127,890.00 - $211,180.00. Actual offer will be based on your qualifications.
Pay Transparency
● Comprehensive medical, dental, and vision coverage
● Flexible Spending Account - healthcare and dependent care
● Health Savings Account - high deductible medical plan
● Retirement 401(k) with employer match
● Paid time off and holidays
● Paid parental leave plans for all new parents
● Leave benefits including disability, paid family medical leave, and paid military leave
משרות נוספות שיכולות לעניין אותך

What You Will Do
• Control project requirements, scope, and change management issues
• Determine, monitor, and review all project economics to include costs, operational budgets, staffing requirements, resources, and risk
• Ensure adherence to legally binding requirements
• Facilitate development of recommended project control solutions to be used for planning, scheduling, and tracking projects through integration of various project management tools
• Manage project forecast projections to both client and internally within Red Hat
• Work with management on project proposals, bids, contracts, estimates, and schedules and contribute to project estimation process
• Plan, schedule, monitor, and report on activities related to the project, including subcontractor monitoring
• Establish appropriate metrics for measuring key project criteria
• Develop project control and reporting procedures and manage changes in operational plan
• Undertake status review meetings among project team members and clients
• Integrate and use specific industry PM and technical delivery methodologies (e.g. PM methodologies based on Agile, Red Hat Services Project Management Methodology, Systems/Software Development, Product Development)
• Assist in the training of the project team on application of appropriate procedures and motivate team members to accomplish project goals, meet established schedules and resolve technical and operational issues
• Maintain awareness on emerging technologies and Project Management techniques
• Travel to work alongside regional accounts and service provider partners in person
What You Will Bring
• 3-5 years of relevant program and project management experience; preferably managing large, complex enterprise IT projects
• Experience using various project management and agile tools, frameworks, and methodologies
• Track record of successfully managing projects from inception to closure
• Experience with establishing and contributing to process improvements
• Experience in dealing with projects and multicultural and multidisciplinary teams
• Ability to work effectively across a widely diverse management team, as well as a broad spectrum of internal and external partners
• Exceptional written and verbal communication skills, inclusive of the ability to communicate and integrate with executive level customer personas
• Solid organizational skills and attention to detail; ability to handle multiple priorities in a fast-paced environment
• Healthy balance of business and technical background
• Ability to articulate the Project Management value proposition and be an evangelist for the Red Hat Services PM methodology and techniques
• Bachelor's degree in a related field or equivalent experience
• Scrum master, lean, product management, and project management certifications and computer science training are a plus
• Experience with and knowledge of open source software technologies preferred
• Ability to travel frequently to work alongside customers and visit customer sites - up to 75% annually
The salary range for this position is $105,860.00 - $169,340.00. Actual offer will be based on your qualifications.
Pay Transparency
● Comprehensive medical, dental, and vision coverage
● Flexible Spending Account - healthcare and dependent care
● Health Savings Account - high deductible medical plan
● Retirement 401(k) with employer match
● Paid time off and holidays
● Paid parental leave plans for all new parents
● Leave benefits including disability, paid family medical leave, and paid military leave
משרות נוספות שיכולות לעניין אותך

About The Job
We’re not just looking for candidates who meet all the requirements, we’re looking for people who are excited about working with us and growing their career at Red Hat. We want to be transparent about what would make you most successful in the role but if you are excited by reading the job description and feel like you are right for the role, we encourage you to apply. This position could lead to regular on-site work with clients across North America, so a willingness to travel to customer locations up to 30-40 weeks per year is required. Applicants must reside within close proximity to a primary airport.
What You Will Do
•Assist in supporting customers in building enterprise technology infrastructures that are scalable, optimally managed, and adaptable to technological improvements using Red Hat technological solutions
•Focus on customer IT Automation and Enterprise Cloud Infrastructure solutions through deep technical hands-on work in these fields
•Continuously learn, grow, and adapt to new skills and technologies
•Work alongside leading financial services, retail, telecommunication, and institutional customers, though virtual and on-site collaboration
After joining Red Hat, you will go through an intensive customized training program on Red Hat technologies and Consulting solutions. Here's how your skill set will evolve and what you'll learn during your first year in the role:
•A baseline understanding of how to build technical solutions, integrated with existing enterprise systems, with technical guidance
•Learn technologies and consulting skills to enhance your abilities through enablements designed and taught by Red Hat experts and Red Hat certification
•Gain exposure and collaboration within Red Hat Services & the larger organization through everyday networking and community events
Within 3 months, be ready to deliver a project by attaining the following:
•Knowledge of how a customer use case can be developed into a project plan and how those requirements align with Red Hat’s technologies
•Continue expanding your knowledge and network, both internal and external, through enablement, communities, customers, and meetups
Within 6 months, begin to demonstrate technical leadership by accomplishing the following:
•Successfully implementing enterprise solutions in customer environments as part of delivery team
•Engage and share with our internal and external communities of practice on lessons learned, best practices, and how-tos
What You Will Bring
•Experience with delivering an technical implementation as part of a project or team
•Capable of contributing to technical projects through sustained teamwork and collaboration, ensuring the development of practical solutions.
•Ability to be well-organized in a fast-paced, ever-changing environment
•Ability to interact directly with customers across roles and organizations and clearly communicate technical and non-technical concepts
•Demonstrates ability to adapt quickly to new and unknown situations, ranging from managing deliverables to learning new technologies.
•Practical experience with at least one coding or scripting language. Examples include but are not limited to Java, Python, C++, YAML, Bash, JavaScript, React, etc.
•Familiarity with backend software development methodologies, frameworks, and development principles, including Agile, Code Management (Git), Software Development Life Cycle, etc.
•Interest in diving deep into backend software development, IT automation, cloud infrastructure, CI/CD, DevOps, and Artificial Intelligence
•Knowledge of and some experience with at least one Red Hat technology such as Red Hat Enterprise Linux, Red Hat OpenShift, or Red Hat Ansible is a plus
•Prior experience working in a customer-facing role is preferred
•Familiarity with open source software and open source as a business model is a plus
•Knowledge of Red Hat's product portfolio and subscription business model is a plus
The salary range for this position is $75,320.00 - $120,480.00. Actual offer will be based on your qualifications.
Pay Transparency
● Comprehensive medical, dental, and vision coverage
● Flexible Spending Account - healthcare and dependent care
● Health Savings Account - high deductible medical plan
● Retirement 401(k) with employer match
● Paid time off and holidays
● Paid parental leave plans for all new parents
● Leave benefits including disability, paid family medical leave, and paid military leave
משרות נוספות שיכולות לעניין אותך

About the job:
The program manager will be responsible for executing the tooling strategy and management system for the Red Hat Tech Sales Practices team. This role will align very closely to the regional and global Red Hat tech sales functions, as well as APM and Operations team. You will be responsible for leading the adoption management system: Defining metrics, target methodology, and reporting process. This includes developing reports and analyzing data sets to provide guidance to our APM and Operations team. You will be responsible for leading the strategy definition and facilitating execution around tooling to make the Tech Sales team more productive, with a focus on Red Hat Sales Cloud.. This includes preparing briefings to senior leadership, aligning stakeholders, facilitating decision-making, tracking and reporting progress, and identifying and mitigating risks. Being highly organized is a must as you will be managing multiple initiatives at a time. You’ll need to have clear written and verbal communication and documented planning skills to succeed in this role.
This position has a direct reporting structure to the Global Sr Director, Tech Sales Practices.
What you will do:
Execute the Productivity and Tooling strategy to ensure SA&A associates and leaders are able to efficiently execute their role.
Use of Red Hat Sales Cloud, analytics tools and AI to solve tech sales challenges.
Ensure the defined management system and metrics are consistently executed across all regions, focusing on product adoption
Provide data analytics and insights to leadership
Identify and execute task automation to make the tech sales more efficient
SUCCESS MEASURES AND KPIs
Increased product consumption and adoption maturity
Account Revenue Growth
What you will bring:
Analytical and process-oriented mindset; ability to mine data to make data-driven decisions
Experience with account management tooling, eg Red Hat Sales Cloud
Experience with reporting and analytics tooling, eg, Snowflake, Tableau, Smartsheet
Experience and tenacity to drive transformational change through a large, worldwide organization
Demonstrated ability to work in a distributed, multicultural organization in a global and geo matrix.
Very organized with strong project management skills; ability to implement structure (processes, frameworks) into daily functions
Comfortable managing and solving complex business problems with no clear solution
Strong communication skills, both oral and written and presentation skills
Customer service focus, including the ability to deliver multiple priority projects with high customer satisfaction in a rapidly changing environment.
The salary range for this position is $86,770.00 - $138,850.00. Actual offer will be based on your qualifications.
Pay Transparency
● Comprehensive medical, dental, and vision coverage
● Flexible Spending Account - healthcare and dependent care
● Health Savings Account - high deductible medical plan
● Retirement 401(k) with employer match
● Paid time off and holidays
● Paid parental leave plans for all new parents
● Leave benefits including disability, paid family medical leave, and paid military leave
משרות נוספות שיכולות לעניין אותך

About The Job
Applicants with the U.S. Eastern Time or Central or Time Zones are preferred and U.S. citizenship is required to fulfill a government contract.
What You Will Do
• Support enterprise customers in implementing automated and containerized cloud application platform solutions
• Learn new technologies quickly, including topics like container orchestration, container registries, container build strategies, and microservices on container platforms
• Perform technical reviews and share knowledge to proactively identify and prevent issues
• Gain understanding of customer technical infrastructures and environments, hardware, and offerings
• Collaborate with internal engineering, development, product management, and technical support teams to resolve issues
• Manage customer cases and maintain clear and concise case and customer documentation; craft customer engagement plans
• Build trust with customers and serve as their advocate within Red Hat; analyze and present periodic reviews of operational performance to customer leadership
• Manage and grow customer relationships by delivering attentive, relationship-based support
• Provide hands on keyboard support as needed
• Travel as needed to visit customers
What You Will Bring
• Must be a U.S. citizen to fulfill government contract
• A strong combination of technical and customer-facing skills and a willingness to embrace and further develop both
• Knowledge of Platform-as-a-Service (PaaS) cloud solutions like Red Hat OpenShift
• Experience with Docker containers and Kubernetes container cluster manager
• Experience with cloud management, like Red Hat Cloud Suite, and IT automation, including Red Hat Ansible Automation Platform
• Competent comprehension of enterprise architecture and strategic business drivers
• Direct experience with a variety of hardware vendors
• Ability to manage multiple issues and projects with shifting priorities and timelines
• Outstanding written and verbal communication skills; ability to convey complex information to customers clearly and concisely
• Comprehension of continuous integration (CI) and continuous delivery (CD) concepts
• Familiarity with source code management tools like Git or SVN
• Software engineering background is a plus; experience with RPM-based Linux and Java technologies
• Willingness to provide after hours coverage, including on-call duty
• Ability to make occasional on-site customer visits
The following are considered a plus:
• Bachelor's degree or equivalent in a technology-related discipline, ideally computer science or engineering
• U.S. security clearance
• Red Hat Certified Engineer (RHCE) certification
• Experience working in DevOps environments
• Experience deploying applications in cloud environments and developing containerized applications
• Experience with cloud computing and different cloud providers like Microsoft Azure, Amazon Web Services, Google Compute Platform, or IBM Cloud
• Experience with Red Hat technologies like Red Hat Enterprise Linux, Satellite, OpenShift Container Platform, OpenStack, Ansible Automation Platform, etc.
The salary range for this position is $94,550.00 - $151,170.00. Actual offer will be based on your qualifications.
Pay Transparency
● Comprehensive medical, dental, and vision coverage
● Flexible Spending Account - healthcare and dependent care
● Health Savings Account - high deductible medical plan
● Retirement 401(k) with employer match
● Paid time off and holidays
● Paid parental leave plans for all new parents
● Leave benefits including disability, paid family medical leave, and paid military leave
משרות נוספות שיכולות לעניין אותך

About The Job
This position requires regular on-site work with clients across North America, so a willingness to travel to customer locations 30-40 weeks per year is required. Applicants must reside within close proximity to a primary airport.
What You Will Do
• Implement automated, containerized cloud application platform solutions with a focus on infrastructure concerns including networking, storage, virtualization, security, logging, monitoring, and high availability and system resilience
• Learn new technologies quickly, including container orchestration, container registries, container build strategies, cloud storage, and software-defined networks
• Travel frequently to work alongside leading financial services, retail, telecommunication, and institutional customers
After joining Red Hat, you will go through an intensive training program on Kubernetes and OpenShift technologies and related DevOps and GitOps topics. Here's how your skill set will evolve and what you'll learn during your first year in the role:
• Understanding of how to build production-ready container and virtualization platforms, integrated with existing enterprise systems
• Knowledge of how to deploy source code into running, scalable containers and virtual machines in automated fashion at enterprise scale
• Practical experience with our offerings
Within 6 months, be ready to implement a routine container platform project by attaining the following:
• Successful, collaborative delivery of customer requirements using Red Hat OpenShift
• Knowledge of how a customer use case can be developed into a project plan and how those requirements align with Red Hat’s technologies
• Understanding of how Red Hat’s technologies can transform software delivery (DevOps/GitOps) practices at large organizations
Within 12 months, begin to demonstrate technical leadership in container platforms by accomplishing the following:
• Successfully implementing complex, large-scale container platform solutions in challenging customer environments
• Helping other peers learn DevOps/GitOps paired with container technologies
• Contributing lessons learned, best practices, and how-tos to our internal and external communities of practice
• Applying new technologies, frameworks, or methodologies to container platforms
What You Will Bring
• Experience leading successful modern cloud platform consulting engagements
• Broad and deep technical experience with VMware ESXi software, including vCenter, VM lifecycle operations using VMware tools
• Logging and alerting functions for SRE operations available from VMware and how they integrate with non-VMware enterprise systems, such as Splunk
• Expert level with VMware VM foundational technologies for networks ([S/DV]Switches/NSX)
• Expert level with VMware VM foundational technologies for storage (Datastores, vSAN, vVols)
• Knowledge of common Add-Ons or third party tools, including Aria/vRealize Suite, SRM, Backup software, Performance tools)
• Experience with technologies including OpenStack, Red Hat Virtualization, Microsoft Hyper-V, Amazon Web Services, and Microsoft Azure a plus
• Experience across one or more vertical industry areas
• Demonstrated track record of working in a strategic advisory role to senior IT and business executives
• Applied knowledge and experience working in agile, scrum, and DevOps teams
• Excellent written, verbal communication and presentation skills
• Willingness to travel to customer locations about 30-40 weeks per year on average across North America
• Degree in computer science or a technical discipline
The salary range for this position is $111,260.00 - $183,580.00. Actual offer will be based on your qualifications.
Pay Transparency
● Comprehensive medical, dental, and vision coverage
● Flexible Spending Account - healthcare and dependent care
● Health Savings Account - high deductible medical plan
● Retirement 401(k) with employer match
● Paid time off and holidays
● Paid parental leave plans for all new parents
● Leave benefits including disability, paid family medical leave, and paid military leave
משרות נוספות שיכולות לעניין אותך

What you will do:
Own the resilience testing roadmap for vLLM and llm-d: define resilience indicators, prioritize fault scenarios, and establish go/no-go gates for releases and CI/CD
Design GPU/accelerator-aware fault experiments that target vLLM and the stack beneath it (drivers, GPU Operator/DevicePlugin, NCCL/collectives, storage/network paths, NUMA/topology)
Build an automated harness (preferably extending krkn-chaos (https://github.com/krkn-chaos/krkn) ) to run controlled experiments with scoped blast radius, and evidence capture (logs, traces, metrics)
Integrate fault signals into pipelines (GitHub Actions or otherwise) as resilience gates alongside performance gates
Develop detection and diagnostics: dashboards and alerts for pre-fault signals (e.g., vLLM queue depth, GPU throttling, P2P downgrades, KV-cache pressure, allocator fragmentation)
Triage and root-cause resilience regressions from field/customer issues; upstream bugs and fixes to vLLM and llm-d
Explore and experiment with emerging AI technologies relevant to software development and testing, proactively identifying opportunities to incorporate new AI capabilities into existing workflows and tooling.
Publish learnings (internal/external): failure patterns, playbooks, SLO templates, experiment libraries, and reference architectures; present at internal/external forums
What you will bring:
3+ years in reliability, and/or performance engineering on large-scale distributed systems
Expertise in systems‑level software design
Expertise with Kubernetes and modern LLM inference server stack (e.g., vLLM, TensorRT-LLM, TGI)
Observability & forensics skills with experience with Prometheus/Grafana, OpenTelemetry tracing, eBPF/BPFTrace/perf, Nsight Systems, PyTorch Profiler; adept at converting raw signals into actionable narratives.
Fluency in Python (data & ML), strong Bash/Linux skills
Exceptional communication skills - able to translate raw data into customer value and executive narratives
Commitment to open‑source values and upstream collaboration
The following is considered a plus:
Master’s or PhD in Computer Science, AI, or a related field
History of upstream contributions and community leadership, public talks or blogs on resilience, or chaos engineering
Competitive benchmarking and failure characterization at scale.
The salary range for this position is $127,890.00 - $211,180.00. Actual offer will be based on your qualifications.
Pay Transparency
● Comprehensive medical, dental, and vision coverage
● Flexible Spending Account - healthcare and dependent care
● Health Savings Account - high deductible medical plan
● Retirement 401(k) with employer match
● Paid time off and holidays
● Paid parental leave plans for all new parents
● Leave benefits including disability, paid family medical leave, and paid military leave
משרות נוספות שיכולות לעניין אותך