Deep Learning Performance Architect jobs at Nvidia in China, Shanghai
Discover your perfect match with Expoint. Search for job opportunities as a Deep Learning Performance Architect in China, Shanghai and join the network of leading companies in the high tech industry, like Nvidia. Sign up now and find your dream job with Expoint
Company (1)
Job type
Job categories
Job title (1)
China
Shanghai
66 jobs found
27.10.2025
N
Nvidia System Software Engineer AI Performance Efficiency China, Shanghai
Build internal profiling/analysis tools for real world application perf/power analysis at system from small to large scale. Build infrastructure or services for data visualization/mining and management. Work with our users...
Build internal profiling/analysis tools for real world application perf/power analysis at system from small to large scale. Build infrastructure or services for data visualization/mining and management. Work with our users...
Develop highly optimized deep learning kernels for inference. Do performance optimization, analysis, and tuning. Work with cross-collaborative teams across automotive, image understanding, and speech understanding to develop innovative solutions. Occasionally...
Computer Architecture experience in one or more of these focus areas: GPU Architecture, CPU Architecture, Deep Learning, GPU Computing, Parallel Programming, or High-Performance Computing Systems. GPU Computing (CUDA, OpenCL,OpenACC), GPU...
Benchmark, profile, and analyze the performance of AI workloads specifically tailored for large-scale LLM training and inference, as well as High-Performance Computing (HPC) on NVIDIA supercomputers and distributed systems. Aggregate...
Design and develop the architecture, interface and features of the GPU kernel library. Keep improving the quality and performance of the library and its GPU kernels. Explore and expand the...
Writing highly tuned compute kernels to perform core deep learning operations (e.g. matrix multiplies, convolutions, normalizations). Following general software engineering best practices including support for regression testing and CI/CD flows....
Build internal profiling/analysis tools for real world application perf/power analysis at system from small to large scale. Build infrastructure or services for data visualization/mining and management. Work with our users...
Find your dream job in the high tech industry with Expoint. With our platform you can easily search for Deep Learning Performance Architect opportunities at Nvidia in China, Shanghai. Whether you're seeking a new challenge or looking to work with a specific organization in a specific role, Expoint makes it easy to find your perfect job match. Connect with top companies in your desired area and advance your career in the high tech field. Sign up today and take the next step in your career journey with Expoint.