

Share
doing:
Developing scalable library software using modern tools and languages for various numerical method.
Performance tuning, optimization, and benchmarking of algorithms on various architectures.
Working closely with leadership team and other internal and external partners to understand feature and performance requirements and contribute to the technical roadmaps of libraries.
Providing technical leadership and guidance to library engineers working with you.
Find opportunities to improve user experience and library performance.
What we need to see:
PhD or MSc’s degree in Computational Science, Computer Science, Applied Math, or related science or engineering field of study is preferred (or equivalent experience).
5+ years experience developing, debugging, and optimizing high-performance parallel numerical applications on modern computing platforms, with GPU acceleration using CUDA.
C/C++ programming and software development skills.
Proven experience in leading and completing software development projects.
Strong collaboration, communication, and documentation habits.
Ways to stand out from the crowd:
Good knowledge of CPU and/or GPU hardware architecture,
Experience with software development practices such as CI/CD systems and project management tools such as JIRA,
Experience with working in a distributed organization,
Debugging, profiling, and testing skills for accuracy and performance,
Fluency with Python.
These jobs might be a good fit

Share
What you’ll be doing:
Collaborate closely with our partners and the open-source community to deliver their flagship models as highly optimized NVIDIA Inference Microservices (NIM).
Research and develop innovative deep learning methodologies to accurately evaluate new model families across diverse domains.
Analyze, influence, and enhance AI/DL libraries, frameworks, and APIs, ensuring consistency with the best engineering practices.
Research, prototype, and build robust tools and infrastructure pipelines to support our ground-breaking AI initiatives.
What we need to see:
BS, MS, or PhD in Computer Science, AI, Applied Math, or a related field, or equivalent experience.
10+ years of hands-on experience in AI for natural language processing (NLP) and large language models (LLMs).
Strong problem-solving, debugging, performance analysis, test design, and documentation skills.
Solid mathematical foundations and expertise in AI/DL algorithms.
Excellent written and verbal communication skills, with the ability to work both independently and collaboratively in a fast-paced environment.
Ways to stand out from the crowd:
Experience in accuracy evaluation of LLMs (OpenLLM Leaderboard or HELM).
Hands-on experience with inference and deployment environments like TensorRT, ONNX, or Triton.
Passion for DevOps/MLOps practices in deep learning product development.
Experience running large-scale workloads in high-performance computing (HPC) clusters.
Strong understanding of Linux environments and containerization technologies like Docker.
These jobs might be a good fit

Share
doing:
Developing scalable library software using modern tools and languages for various numerical method.
Performance tuning, optimization, and benchmarking of algorithms on various architectures.
Working closely with leadership team and other internal and external partners to understand feature and performance requirements and contribute to the technical roadmaps of libraries.
Providing technical leadership and guidance to library engineers working with you.
Find opportunities to improve user experience and library performance.
What we need to see:
PhD or MSc’s degree in Computational Science, Computer Science, Applied Math, or related science or engineering field of study is preferred (or equivalent experience).
5+ years experience developing, debugging, and optimizing high-performance parallel numerical applications on modern computing platforms, with GPU acceleration using CUDA.
C/C++ programming and software development skills.
Proven experience in leading and completing software development projects.
Strong collaboration, communication, and documentation habits.
Ways to stand out from the crowd:
Good knowledge of CPU and/or GPU hardware architecture,
Experience with software development practices such as CI/CD systems and project management tools such as JIRA,
Experience with working in a distributed organization,
Debugging, profiling, and testing skills for accuracy and performance,
Fluency with Python.
These jobs might be a good fit