Ai Computing Performance Architect Perf Analysis Kernel Dev jobs at Nvidia in China, Shanghai
Discover your perfect match with Expoint. Search for job opportunities as a Ai Computing Performance Architect Perf Analysis Kernel Dev in China, Shanghai and join the network of leading companies in the high tech industry, like Nvidia. Sign up now and find your dream job with Expoint
Company (1)
Job type
Job categories
Job title (1)
China
Shanghai
53 jobs found
Yesterday
N
Nvidia Deep Learning Performance Architect China, Shanghai
Analyze performance and efficiency of various machine learning/deep learning algorithms on different architectures. Identify architecture and software performance bottlenecks and propose optimizations. Explore new features and hardware capabilities on deep...
Design, develop, and optimize major layers in LLM (e.g attention, GEMM, inter-GPU communication) for NVIDIA's new architectures. Implement and fine-tune kernels to achieve optimal performance on NVIDIA GPUs. Conduct in-depth...
Lead and manage a high-performing team of software engineers and SRE engineers, guiding their professional growth and project execution while fostering a culture of innovation and excellence. Oversee factory automation...
Analyze state of the art DL networks (LLM etc.), identify and prototype performance opportunities to influence SW and Architecture team for NVIDIA's current and next gen inference products. Develop analytical...
Design, build, and optimize containerized inference execution for LLM applications, ensuring efficiency and scalability. These applications may run in container orchestration platforms like Kubernetes to enable scalable and robust deployment....
Develop and maintain simulation environments built on frameworks like MuJoCo, and Isaac Lab to support robotics research. Implement and test control algorithms and XR teleoperation interfaces for simulated robots. Build...
Design, implement, and optimize scalable ML training pipelines for training multimodal foundation models for robotics. Collaborate with researchers to integrate cutting-edge model architectures into scalable training pipelines. Implement scalable data...
Analyze performance and efficiency of various machine learning/deep learning algorithms on different architectures. Identify architecture and software performance bottlenecks and propose optimizations. Explore new features and hardware capabilities on deep...
Find your dream job in the high tech industry with Expoint. With our platform you can easily search for Ai Computing Performance Architect Perf Analysis Kernel Dev opportunities at Nvidia in China, Shanghai. Whether you're seeking a new challenge or looking to work with a specific organization in a specific role, Expoint makes it easy to find your perfect job match. Connect with top companies in your desired area and advance your career in the high tech field. Sign up today and take the next step in your career journey with Expoint.