Finding the best job has never been easier
Share
You will be required to formulate and execute performance test plans. You will investigate cloud infrastructure, on-prem hardware, RHEL, OpenShift, and OpenShift AI performance tuning knobs. In addition, you will triage and potentially fix performance issues, create new benchmarking tests and automation tools as needed, and socialize performance results on a regular basis. This role needs an engineer that thinks creatively, adapts to rapid change, and has the willingness to learn and apply new technologies. You will be joining a vibrant open source culture, and helping promote performance and innovation in this Red Hat engineering team.
What you will do
Execute performance and scalability benchmarks against various components of the OpenShift AI platform to drive improvements and detect regressions
Develop tools and automation to aid the performance benchmarking work
Collaborate with other teams to resolve performance issues
Triage, debug, and solve customer cases related to AI performance
Submit performance benchmarking results to industry consortia
Publish results, conclusions, recommendations and best practices via internal test reports, presentations, and external blogs to support our partners and customers.
Participate in internal and external conferences about your work and results
Provide technical leadership and guidance to the wider team
What you will bring
Experience in running performance tests, data capture, data analysis, and visualization
Experience with systems performance engineering and metrics collection tools such as iostat, vmstat, sar, perf, and prometheus.
Experience with container technologies (podman, Kubernetes, docker)
Programming experience in Python
Experience working with the Linux operating system (RHEL, Fedora or CentOS preferred)
Experience with AI technologies and frameworks (pytorch, transformers, etc)
Excellent written and verbal language skills in English
Following is considered a plus
Bachelor’s degree or equivalent experience
Knowledge of AI benchmarking suites such as MLperf
Experience with software defined storage, networking as it pertains to Kubernetes
Experience working with hardware accelerators such as Nvidia GPUs
Experience working on a MLOps platform
The salary range for this position is $104,080.00 - $166,320.00. Actual offer will be based on your qualifications.
Pay Transparency
● Comprehensive medical, dental, and vision coverage
● Flexible Spending Account - healthcare and dependent care
● Health Savings Account - high deductible medical plan
● Retirement 401(k) with employer match
● Paid time off and holidays
● Paid parental leave plans for all new parents
● Leave benefits including disability, paid family medical leave, and paid military leave
These jobs might be a good fit