Expoint – all jobs in one place
The point where experts and best companies meet
Limitless High-tech career opportunities - Expoint

Nvidia Senior System Software Engineer Build Test Infrastructure 
China, Guangdong Province, Shenzhen 
86907360

Today
China, Shenzhen
China, Shanghai
time type
Full time
posted on
Posted 5 Days Ago
job requisition id

What you'll be doing:

  • Lead the design and implementation of our cloud CI/CD infrastructure and pipelines. You'll streamline the CI development workflow, providing faster feedback to developers and significantly enhancing their overall experience. Your efforts will be crucial in boosting CI throughput.

  • Debug and resolve complex build errors, caching issues, and remote execution challenges. You'll leverage your deep understanding of Git workflows, Bazel build configurations, and Buildbarncaching/infrastructureto keep our systems running smoothly.

  • Guide the release team through daily integration builds. You'll help streamline development phases, minimize context switching for developers, and ultimately shorten the end-to-end integration duration.

  • Apply your full-stack development expertise to build and maintain critical infrastructure tools and services. This includes creating intuitive UI components for large-scale builds, code quality metrics, and architectural reporting, ensuring transparency and efficiency.

  • Develop robust processes and tools for crash analysis. You'll enable immediate insights into system crashes by integrating automated signal handlers and controlled crash logging in debug builds.

  • Integrate cutting-edge AI concepts like Prompt Engineering, Fine-tuning, MCP, RAG, and Agent technologies to develop and deploy powerful AI applications within our infrastructure.

  • Continuously research and embrace new technologies to enhance and evolve our build and test systems, keeping us at the forefront of innovation.

What we need to see:

  • BS or MS in Computer Science, Computer Engineering, Electrical Engineering, or a related field, or equivalent experience.

  • Over 10 years of comprehensive software development experience spanning various architectures and systems, with a demonstrated track record of effectively bringing projects from feasibility to completion.

  • Expertise in a wide range of programming languages, including C++, Python, Golang, Rust, Javascript, and Groovy.

  • Deep understanding of C/C++ compilation, linking, and toolchains, especially in complex cross-compilation environments (e.g., Aarch64 vs. Linux x86).

  • Profound knowledge of Bazel rules and hands-on experience implementing custom build solutions. You should also have direct experience with Buildbarn or similar remote caching and execution infrastructure.

  • Demonstrated success in engineering distributed build systems and CI pipelines on platforms like Kubernetes and Jenkins. You'll have a proven ability to craft and maintain robust CI/CD pipelines for automated builds, comprehensive testing, coverage analysis, and early regression detection.

  • Strong debugging capabilities, including proficiency with local and remote GDB sessions, stack trace analysis, signal handling, and crash logging.

  • A solid understanding of AI-related technologies, including Prompting, Fine-tuning, MCP, RAG, and Agent-based systems, along with their practical application in development and deployment scenarios.

  • A passion for continuous learning and research into new technologies, evidenced by a well-structured personal knowledge management system.