Expoint – all jobs in one place
מציאת משרת הייטק בחברות הטובות ביותר מעולם לא הייתה קלה יותר

דרושים Gpu Graphics Performance Architect - Ncg ב-אנבידיה ב-China, Shanghai

מצאו את ההתאמה המושלמת עבורכם עם אקספוינט! חפשו הזדמנויות עבודה בתור Gpu Graphics Performance Architect - Ncg ב-China, Shanghai והצטרפו לרשת החברות המובילות בתעשיית ההייטק, כמו Nvidia. הירשמו עכשיו ומצאו את עבודת החלומות שלך עם אקספוינט!
חברה (1)
אופי המשרה
קטגוריות תפקיד
שם תפקיד (1)
China
Shanghai
נמצאו 82 משרות
24.11.2025
N

Nvidia Deep Learning Performance Architect - New College Grad China, Shanghai

Limitless High-tech career opportunities - Expoint
Analyze state of the art DL networks (LLM etc.), identify and prototype performance opportunities to influence SW and Architecture team for NVIDIA's current and next gen inference products. Develop analytical...
תיאור:
China, Shanghai
China, Beijing
time type
Full time
posted on
Posted 4 Days Ago
job requisition id

What you’ll be doing:

  • Analyze state of the art DL networks (LLM etc.), identify and prototype performance opportunities to influence SW and Architecture team for NVIDIA's current and next gen inference products.

  • Develop analytical models for the state of the art deep learning networks and algorithm to innovate processor and system architectures design for performance and efficiency.

  • Specify hardware/software configurations and metrics to analyze performance, power, and accuracy in existing and future uni-processor and multiprocessor configurations.

  • Collaborate across the company to guide the direction of next-gen deep learning HW/SW by working with architecture, software, and product teams.

What we need to see:

  • BS or higher degree in a relevant technical field (CS, EE, CE, Math, etc.).

  • Strong programming skills in Python, C, C++.

  • Strong background in computer architecture.

  • Experience with performance modeling, architecture simulation, profiling, and analysis.

  • Prior experience with LLM or generative AI algorithms.

Ways to stand out from the crowd:

  • GPU Computing and parallel programming models such as CUDA and OpenCL.

  • Architecture of or workload analysis on other deep learning accelerators.

  • Deep neural network training, inference and optimization in leading frameworks (e.g. Pytorch, TensorRT-LLM, vLLM, etc.).

  • Open-sourceAIcompilers (OpenAI Triton, MLIR, TVM, XLA, etc.).

and proud to be an

Show more
24.11.2025
N

Nvidia Performance Engineer Intern Deep Learning HPC - China, Shanghai

Limitless High-tech career opportunities - Expoint
Benchmark, profile, and analyze the performance of AI workloads specifically tailored for large-scale LLM training and inference, as well as High-Performance Computing (HPC) on NVIDIA supercomputers and distributed systems. Aggregate...
תיאור:
China, Shanghai
time type
Full time
posted on
Posted 4 Days Ago
job requisition id

You will be part of global Performance Lab team, improving our capacity to expertly and accurately benchmark state-of-the-art datacenter applications and products. We also work to develop infrastructures and solutions that enhance the team’s ability to gather data through automation and designing efficient processes for testing a wide variety of applications and hardware. The data that we collect drives marketing/sales collaterals as well as engineering studies for future products. You will have the opportunity to work with multi-functional teams and in a dynamic environment where multiple projects will be active at once and priorities may shift frequently.

What you’ll be doing:

  • Benchmark, profile, and analyze the performance of AI workloads specifically tailored for large-scale LLM training and inference, as well as High-Performance Computing (HPC) on NVIDIA supercomputers and distributed systems.

  • Aggregate and produce written reports with the testing data for internal sales, marketing, SW, and HW teams.

  • Develop Python scripts to automate the testing of various applications.

  • Collaborate with internal teams to debug and improve performance issues.

  • Assist with the development of tools and processes that improve our ability to perform automated testing.

  • Setup and configure systems with appropriate hardware and software to run benchmarks.

What we need to see:

  • Currently pursuing a bachelor's degree (or higher) in Computer Science, Electrical Engineering, or a related field.

  • Experienced in programming and debugging with scripting languages such as Python or Unix shell.

  • Strong data analysis skills and the ability to summarize findings in a written report.

  • Hands-on experience with Linux based systems. Familiarity using a container platform such as Docker or Singularity. Experience with compiling and running software from source code.

  • Good English verbal and written skills to improve collaboration with coworkers.

  • Fast and self-learning capabilities.

Ways to stand out from the crowd:

  • Experience with CI/CD pipelines and modern DevOps practices. Familiar with cloud provisioning and scheduling tools (Kubernetes, SLURM).

  • Curiosity about GPUs, TPUs, cloud and performance benchmarking.

  • Familiar with ML/DL techniques, algorithms and frameworks like TensorFlow or PyTorch. Experience in AI model inference deployment and training launching.

  • Background of system-level problem solving.

Show more

משרות נוספות שיכולות לעניין אותך

22.11.2025
N

Nvidia Graphics Tools Software Engineering Intern - China, Shanghai

Limitless High-tech career opportunities - Expoint
Create developer tools features for NVIDIA GPUs that enables developers to quickly iterate on optimizations to build fast graphics applications. Write fast, effective, maintainable, reliable and well documented code. Effectively...
תיאור:
China, Shanghai
time type
Full time
posted on
Posted 2 Days Ago
job requisition id

What you'll be doing:

As a valued member of the team, you will be involved in the technical design and implementation of numerous features working in an agile environment. In this role you can expect to:

  • Create developer tools features for NVIDIA GPUs that enables developers to quickly iterate on optimizations to build fast graphics applications.

  • Write fast, effective, maintainable, reliable and well documented code.

  • Effectively estimate and prioritize tasks in order to build a realistic delivery schedule.

  • Provide peer reviews to other engineers including feedback on performance, scalability and correctness.

  • Drive technology discussions and provide valuable feedback about the architecture as a domain expert.

  • Document requirements and designs, and review documents with stakeholders.

  • Demonstrate growth in technical and non-technical abilities.

  • Meet with the QA Department to develop a test plan for new features.

What we need to see:

  • Pursuing BS or MS degree in one of the areas of Electrical Engineering, Computer Engineering, Computer Science.

  • Excellent C++ programming skills and ability to articulate key aspects of Object-Oriented Programming.

  • Proficient in at least one graphics programming API such as Direct3D, OpenGL and Vulkan.

  • Able to work effectively with a team of engineers in a fast paced and dynamic environment.

  • Excellent written and verbal communication skills.

  • Able to estimate effectively to ensure delivery of software on time.

Ways to stand out from the crowd:

  • Knowledge of 3D Graphics Algorithms and GPU Architectures.

  • Strong grasp of heterogeneous computing, multithreading and a deep understanding of streaming multiprocessors, warp scheduling etc...

  • Experience with GPU low-level performance tuning/optimization, including profiling and debugging.

  • Solid understanding of User Experience (UX) design, GUI development and the Qt framework is a huge plus.

Show more

משרות נוספות שיכולות לעניין אותך

22.11.2025
N

Nvidia GPU Compiler LLVM Backend Intern - China, Shanghai

Limitless High-tech career opportunities - Expoint
Understand, modify, and improve an NVIDIA proprietary GPU compiler and assembler written in C++. Design and develop new compiler passes and optimizations to produce best-in-class, robust, supportable compiler and tools....
תיאור:
China, Shanghai
time type
Full time
posted on
Posted 5 Days Ago
job requisition id

What You Will Be Doing:

  • Understand, modify, and improve an NVIDIA proprietary GPU compiler and assembler written in C++

  • Design and develop new compiler passes and optimizations to produce best-in-class, robust, supportable compiler and tools

  • Work with global compiler, hardware and application teams to oversee improvements and problem resolutions

  • Be part of a team that is at the center of deep-learning compiler technology spanning architecture design and support through functional languages

What We Need To See:

  • Pursuing B.S. or higher degree in Computer Science/Engineering

  • Excellent hands-on C++ programming skills

  • Strong background in software engineering principles with a focus on crafting robust and maintainable solutions to challenging problems

  • Good communication and documentation skills and self-motivated

Ways To Stand Out From The Crowd:

  • Masters or PhD preferred, with some compiler code generation experience

  • Experience in developing CUDA, DirectX, OpenGL/Vulkan applications

  • Experience in performance analysis and compile time analysis tools

  • Exposure to various ML techniques

Show more

משרות נוספות שיכולות לעניין אותך

17.11.2025
N

Nvidia Senior GPU Communications Libraries Test Development Enginee... China, Shanghai

Limitless High-tech career opportunities - Expoint
Be responsible for running test cases to validate NVIDIA GPU Communications Libraries (NCCL, NVSHMEM, UCX, GDRCopy, GPUDirect RDMA etc). Be responsible to automate test cases and maintain the automation scripts....
תיאור:
China, Shanghai
time type
Full time
posted on
Posted 6 Days Ago
job requisition id

What you’ll be doing:

  • Be responsible for running test cases to validate NVIDIA GPU Communications Libraries (NCCL, NVSHMEM, UCX, GDRCopy, GPUDirect RDMA etc).

  • Be responsible to automate test cases and maintain the automation scripts.

  • Collaborate with Developer, PM, marketing, and engineering teams on crafting test plan and implementing validation.

  • You will assist in the architecture, crafting and implementing of SWQA test frameworks.

  • Be responsible for code coverage improvement and code complexity optimization.

What we need to see:

  • BS or higher degree in CS/EE/CE or equivalent experience

  • 5+ years of relevant experience

  • Seasoned software QA or software testing background; test infrastructure and strong analysis skills

  • Be proficient in scripting language (Python, Perl, bash)

  • Solid experience with AI development tools for test development and automation

  • Knowledge of basic networking concepts

  • UNIX/Linux experience is required

  • Experiences in C/C++ is required

  • Ability to work independently and leadership skillsas well as experience in using quality mindset to drive improvements

  • Proficient oral and written English

Ways to stand out from the crowd:

  • Experience with CUDA programming and NVIDIA GPUs

  • Knowledge of high-performance networks like InfiniBand, RoCE,etc

  • Experience with CSPs(AWS, Google Cloud, Oracle Cloud Infrastructure, Microsoft Azure), andHPC cluster,slurm, ansible, etc

  • Prior experience with virtualization technologies (KVM, HyperV, VMWARE, OpenStack, Docker, Kubernetes)

  • Experience with Deep Learning Frameworks such as PyTorch, TensorFlow, etc

Show more

משרות נוספות שיכולות לעניין אותך

16.11.2025
N

Nvidia Director - Applied Neural Graphics China, Shanghai

Limitless High-tech career opportunities - Expoint
Establish and scale NVIDIA's neural graphics organization in China, building research and engineering teams. Define and implement our neural graphics strategy for China and its integration into NVIDIA's graphics platform....
תיאור:
China, Shanghai
time type
Full time
posted on
Posted 6 Days Ago
job requisition id

NVIDIA has been redefining computer graphics, PC gaming, and accelerated computing for more than 25 years, driven by exceptional technology and outstanding individuals. Today, we're tapping into the unlimited potential of AI to define the next era of computing, where our GPU acts as the brains of immersive digital worlds and creative applications. As an NVIDIAN, you'll be immersed in a diverse, encouraging environment where everyone is inspired to do their best work.

What you'll be doing
  • Establish and scale NVIDIA's neural graphics organization in China, building research and engineering teams

  • Define and implement our neural graphics strategy for China and its integration into NVIDIA's graphics platform

  • Build strategic partnerships with China's leading VLM and neural graphics research institutions and talent

  • Lead recruitment of exceptional talent in neural rendering, vision-language models, and differentiable programming

  • Drive research-to-product pipeline, translating China's VLM innovations into shipping products and platforms globally

  • Architect system-level integration of neural graphics technologies across NVIDIA's platform ecosystem

  • Represent NVIDIA across the world's graphics and AI research community, establishing thought leadership

  • Partner with executive leadership to develop company-wide neural graphics strategy and roadmap

What we need to see
  • Degree in Computer Science, Computer Graphics, Machine Learning, or equivalent experience that is outstanding

  • 12+ years experience, 6+ years leading teams

  • Deep connections in the neural graphics research community

  • Proven track record of building organizations and scale teams in fast paced technical domains

  • Strong understanding of ML frameworks, VLM architectures, and AI technologies with experience bringing them to production

  • Experience defining platform strategies with demonstrated research-to-product transfers and measurable business impact across global markets

  • Excellent interpersonal and communication skills to lead across research, product, and executive teams in both Chinese and English

Ways to stand out from the crowd
  • Established thought leadership in neural graphics, VLMs, or AI-powered rendering with industry recognition

  • Experience building successful research-industry partnerships in China's AI ecosystem

  • Track record shipping graphics or AI platforms used by developers worldwide

  • Conference presentations, publications, or awards in graphics, VLMs, or AI

  • Experience establishing new technical organizations or regional presences

Show more

משרות נוספות שיכולות לעניין אותך

16.11.2025
N

Nvidia GPU Power Analysis Engineer - New College Grad China, Shanghai

Limitless High-tech career opportunities - Expoint
Use internally developed tools and industry standard pre-silicon gate-level and RTL power analysis tools, to help improve product power efficiency. Develop and share best practices for performing pre-silicon power analysis,...
תיאור:
China, Shanghai
time type
Full time
posted on
Posted 5 Days Ago
job requisition id

What you'll be doing:

  • Use internally developed tools and industry standard pre-silicon gate-level and RTL power analysis tools, to help improve product power efficiency.

  • Develop and share best practices for performing pre-silicon power analysis, Enhance internal power tools and automate best practices

  • Perform comparative power analysis, to spot trends and anomalies, that warrant more scrutiny.

  • Interact with architects and RTL designers to help them interpret their power data and identify power bugs; drive them to implement fixes.

  • Select and run a wide variety of workloads for power analysis, Collaborate with performance and architecture teams to validate performance of the workloads

  • Prototype a new architectural feature in Verilog and analyze power.

What we need to see:

  • EE, MS or PhD in related fields, or equivalent experience.

  • Basic understanding of concepts of energy consumption, estimation, and low power design.

  • Familiarity with Verilog and ASIC design principles, including knowledge of logic cells.

  • Good verbal/written English and interpersonal skills; much collaboration with design teams is expected.

  • Strong coding skills, preferably in Python, C++.

  • Ability to formulate and analyze algorithms, and comment on their time complexity and memory consumption.

  • Desire to bring data-driven decision-making and analytics to improve our products.

Ways to stand out from the crowd:

  • Familiar with the power tools/flow development is a big plus

Show more

משרות נוספות שיכולות לעניין אותך

Limitless High-tech career opportunities - Expoint
Analyze state of the art DL networks (LLM etc.), identify and prototype performance opportunities to influence SW and Architecture team for NVIDIA's current and next gen inference products. Develop analytical...
תיאור:
China, Shanghai
China, Beijing
time type
Full time
posted on
Posted 4 Days Ago
job requisition id

What you’ll be doing:

  • Analyze state of the art DL networks (LLM etc.), identify and prototype performance opportunities to influence SW and Architecture team for NVIDIA's current and next gen inference products.

  • Develop analytical models for the state of the art deep learning networks and algorithm to innovate processor and system architectures design for performance and efficiency.

  • Specify hardware/software configurations and metrics to analyze performance, power, and accuracy in existing and future uni-processor and multiprocessor configurations.

  • Collaborate across the company to guide the direction of next-gen deep learning HW/SW by working with architecture, software, and product teams.

What we need to see:

  • BS or higher degree in a relevant technical field (CS, EE, CE, Math, etc.).

  • Strong programming skills in Python, C, C++.

  • Strong background in computer architecture.

  • Experience with performance modeling, architecture simulation, profiling, and analysis.

  • Prior experience with LLM or generative AI algorithms.

Ways to stand out from the crowd:

  • GPU Computing and parallel programming models such as CUDA and OpenCL.

  • Architecture of or workload analysis on other deep learning accelerators.

  • Deep neural network training, inference and optimization in leading frameworks (e.g. Pytorch, TensorRT-LLM, vLLM, etc.).

  • Open-sourceAIcompilers (OpenAI Triton, MLIR, TVM, XLA, etc.).

and proud to be an

Show more
בואו למצוא את עבודת החלומות שלכם בהייטק עם אקספוינט. באמצעות הפלטפורמה שלנו תוכל לחפש בקלות הזדמנויות Gpu Graphics Performance Architect - Ncg בחברת Nvidia ב-China, Shanghai. בין אם אתם מחפשים אתגר חדש ובין אם אתם רוצים לעבוד עם ארגון ספציפי בתפקיד מסוים, Expoint מקלה על מציאת התאמת העבודה המושלמת עבורכם. התחברו לחברות מובילות באזור שלכם עוד היום וקדמו את קריירת ההייטק שלכם! הירשמו היום ועשו את הצעד הבא במסע הקריירה שלכם בעזרת אקספוינט.