What you’ll be doing:
Develop test plan and orchestrate testing for Compute software releases on all new compute architecture platforms including Tesla GPUs, Nvidia turnkey systems and OEM systems.
Develop a robust test infrastructure incorporating advanced AI tools to significantly enhance our testing capabilities and streamlining operations for more efficient and accurate results.
Improve code coverage, elevating the overall quality of our codebase and reliability of our testing processes and develop roadmaps prioritizing software development schedule for full life-cycle of tool development, test, and deployment
Collaborate across teams to identify new features and lead developers in definition, automation implementation, and productization of those features in timely manner
Build and operate key pieces of a complete infrastructure for automation framework development, as well as, lead and develop automation support and participate in automation of manual test cases, working closely with automation infrastructure
Focus on an efficient customer experience by improving both usability and ease to attain optimal performance
Test both software functionality and internal code/structure and run regression tests for existing CUDA/Driver features.
Work in a dynamic agile software development team with very high production quality standards.
What we need to see:
BS or MS in Engineering (or equivalent experience) with 5+ years testing SW development cycle.
Solid understanding of embedded systems, Linux, Python, C and C++.
Very good knowledge of Linux and Windows packages.
Experience with rpm/deb/run packages is a big plus
Proven experience with AI tools for automation and test plan development directly applied to daily tasks. This expertise is crucial for enhancing performance, developing robust frameworks, and increasing test coverage.
Strong technical skills, with deep understanding of orchestration & automation systems, data centers and cloud architecture combined.
Solid understanding in QA methodology and pay attention to details.
Knowledge in Cluster and cluster management
Experience in developing test strategies, high quality test plans and test execution
Proficient in building test setups and fine tuning in HW and SW
Ways to stand out from the crowd:
Expertise in packaging software in Linux (rpms, debs) and knowledge in Linux distribution (Centos, Ubuntu, SLES, RedHat, Fedora)
Apply AI-powered tools to improve efficiency and quality, including test case/plan/script generation, defect detection, CBTP, bug fixing and day to day assistance
Experience with Configuration and deployment management (Ansible), Containers (Docker) and Virtualization infrastructure software (Xen, KVM)
Good understanding of C/C++ toolchain in Linux including cross-compilation (C, C++, automake/autoconf, cmake, meson).
Background with parallel programming, ideally CUDA C/C++ and OpenACC
You will also be eligible for equity and .
משרות נוספות שיכולות לעניין אותך