Expoint - all jobs in one place

Finding the best job has never been easier

Limitless High-tech career opportunities - Expoint

Red hat Principal Software Quality Engineer InstructLab 
United States, Massachusetts, Boston 
817271998

25.09.2024

Primary Job Responsibilities

  • Execute manual and automated tests for InstructLab and deliver clear status updates in a timely manner.

  • Explore, identify, and document unwanted behavior, output, and bias in InstructLab models. Advocate for the resolution of critical issues and communicate the impact on customers to the development team(s).

  • Monitor and participate in upstream AI/ML communities, evaluating new AI/ML-related technologies in the space and considering potential integrations and collaborations upstream.

  • Conduct new feature research and design test cases with emphasis on model performance, scalability, automation, and bias reduction.

  • Design, develop, and maintain automation frameworks, scripts, and performance benchmarking tools with Python for testing and implementation.

  • Run technical initiatives that grow, improve, and scale our existing processes.

  • Contribute to the development of the open-source projects that comprise Red Hat’s AI family of products.

  • Regularly communicate with project stakeholders including other teams of Red Hat engineers, product managers, consultants, management, and senior leadership.

Required Skills

  • 3+ years of experience as a Software Quality Engineer, Software Engineer, or similar roles (5+ years of experience for Senior Software Quality Engineer.

  • Extensive experience scripting and creating automation in Python and Bash.

  • Experience with AI and Machine Learning platforms, tools, and frameworks, such as: PyTorch, LLaMA.cpp, vLLM, fsdp, deepspeed, Kubeflow, and Tensorflow.

  • Experience creating automation for GitHub, using GitHub Actions or related continuous integration tools.

  • Experience with hardware accelerators, such as GPU, CUDA, and ROCm.

  • Experience developing unit, functional, and end-to-end (E2E) test cases and automation.

  • Ability to quickly learn and use new tools and technologies.

Nice to Haves

  • Experience working with Kubernetes/OpenShift and containers, troubleshooting issues with them, and working with YAML, Kubernetes controllers, and operators.

  • Understanding of DevOps methodology, scrum, and/or Jira.

  • Bachelor's degree or higher in computer science or other related discipline, or equivalent years of experience.

The salary range for this position is $127,890.00 - $211,180.00. Actual offer will be based on your qualifications.

Pay Transparency

● Comprehensive medical, dental, and vision coverage

● Flexible Spending Account - healthcare and dependent care

● Health Savings Account - high deductible medical plan

● Retirement 401(k) with employer match

● Paid time off and holidays

● Paid parental leave plans for all new parents

● Leave benefits including disability, paid family medical leave, and paid military leave