

Serve as the primary technical expert between NVIDIA and our customers, understanding their technology and provide the best AI solutions/ guidance on training process in terms of tools and methodology
Build proof-of-concepts and demonstrations that highlight the power of NVIDIA AI platforms in robotics
Partner with developers, researchers, technology specialists, IT professionals, and executives to facilitate the integration of NVIDIA technology
Work with Engineering, Product and Sales teams to develop, plan best suitable solutions for customers. Enable development and growth of product features through customer feedback and proof-of-concept evaluations.
MS/PhD or equivalent experience in Computer Science, Data Science, Electrical/Computer Engineering, Physics, Mathematics, other Engineering fields.
Deep expertise in AI/Deep Learning, with hands-on experience in training or optimizing VLMs for production
Expertise with deep learning frameworks for training VLMs (PyTorch, Nemo), and/or experience with such model's optimization methods and tools (TensorRT and Triton Inference Server).
Excellent verbal, written communication, and technical presentation skills in English.
5+ years' work or research experience with Python/ C++ / other software development
AI passionate with a growth mindset, ability to collaborate effectively with different teams (Engineering, Product, Sales, Marketing) in a rapid evolving environment while continuously learning and sharing insights.
Familiarity with Cosmos (e.g. Cosmos-Reason) and Isaac GR00T family of models.
Familiarity with robotics simulation environments (Isaac Sim, Isaac Lab, MuJoco, etc.).
Experience in large scale training / understanding of modern model, data, expert, context parallel training techniques.
Track record in Neural Networks inference optimization for Physical AI use cases
משרות נוספות שיכולות לעניין אותך

What you will be doing:
Be an inspiring leader on integrating NVIDIA technology into IT architectures to support scientific and engineering applications
Conduct regular technical customer meetings to discuss projects, present product portfolio and roadmaps, and deliver technical training
Manage technical aspects of complex datacenter solutions and deployments, including design-in opportunities and responding to RFP/RFI proposals, and interact with NVIDIA OEM partners to understand their product roadmap and positioning using NVIDIA technology
Interact with end-users in academia and industry, develop a keen understanding of their goals and needs, define and deliver high-value solutions that meet these needs
Communicate customer requirements to NVIDIA Engineering to foster product improvements
What we need to see:
Passion for HPC and AI
A postgraduate degree in a STEM related discipline or equivalent experience
5 years in the technical pre-sales of datacenter solutions
Significant expertise in complex system architecture and network topologies, GPU computing, parallel filesystems, cluster operations, workload schedulers, datacenter engineering, etc.
Action oriented with strong analytical skills, good organization skills to work in a heavily multi-tasked environment, self-motivated to work independently with minimal day-to-day direction
Strong collaboration and social skills, ability to communicate effectively with customers and across organizations (Engineering, Sales, Support), fluent in English both oral and written
Ways to stand out from the crowd:
Experience with NVIDIA software platforms for HPC and AI like CUDA, DOCA, NeMo, NIM, etc.
Experience working on EuroHPC-class procurements, benchmarks, performance analysis and projections

and pharmafocusing on the ground-breaking potential of Large Language Models (LLMs) and generative AI. At NVIDIA, we have a group of exceptional developers and scientists who thrive on working with the latest GPU hardware and software. Our platform for, and start-ups.
individual tous in exploring new opportunities in the AI-powereddigital biologyrevolution. As a Solution Architect, you willwork asindustries, who envision accelerated computing and artificial intelligence as a transformative force in their field. Join us on this thrilling journey and advance your career while empowering top organizations and institutions worldwide.
What you will be doing:
GenAI adoption—from requirements gathering and proof-of-concept development to deployment, integration,benchmarkingand ongoing optimization
Collaborate with our business/account team toidentifytechnical needs,customer goals,andstrategies.Your responsibilities will include enabling customer adoption of NVIDIA technologyby mappingour solutions to their use casesand driving positive relationships with our technology partners, making NVIDIA an integral part of end-user solutions.
Keep up to date on AI advancementsindrug discoveryincluding agentic AI techniques, foundation models for protein, small molecules, and genomics,as well asrelevantNVIDIA technology thatenable thisinnovation.
Design, develop, andoptimizesolutions tailored for healthcare and life science applications, such as AI scientists and autonomous lab or drug discovery.
Engaging with developers, researchers, data scientists, IT managers, and senior leadersinternally and externallyis an essential part of the Solutions Architect role to gain experience in various technical areas.
Document what you know and teach others. This can vary from building targeted training for partners and other Solutions Architects to writing whitepapers, blogs, and wiki articles to simply working through hard problems with a customer on a whiteboard.
What we need to see:
MS or PhD (or equivalent experience) inComputer Science,Computational Biology, Computational ChemistryorComputational Physics,orrelated fieldswith strong applied experience in these domains.
5+ years of work-related experiencewith hands-onexpertiseinAI/MLforhealthcare or life sciences.
Proven experience withPython and AI/ML frameworks (PyTorch,Langchain, or building custom framework)and application to scientific questions.
Strong time-management and organizational skills for coordinating multiple initiatives, priorities, and implementations ofnew technologyand products intovery complexprojects.
Motivated self-starter with an equal balance of strong problem-solving skills and customer-facing communication skills -especially in effectively presenting complex technical information.Mustenjoy engaging with innovative individuals,continuouslearning, and staying at the forefront of the field.
Ways to stand out from the crowd:
Experience building, deploying, andoptimizingagentic AI systems for healthcare and lifesciencesend-to-end includingdata ingestion, preprocessing, model training, agentic tool development, pipelinedeploymentand evaluation,especially for scientific software vendors and data platforms.
Experiencedeveloping,trainingand customizing Transformer models for healthcare and life sciences applications, especially using libraries like Transformer EngineorMegatron-LM.
Familiarity with AI deployment/inference technologies such as TensorRT, TRT-LLM.
Experience deploying and scaling agentic AI solutions in cloud environments (AWS Bedrock, Azure AI foundry, Vertex AI, etc).
Experience in the pharmaceutical industryorestablishedthought leadership through publications or presentations on AI/ML applications in healthcare and life science.
NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices)on the basis of, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.
NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us.

What you'll be doing:
Work with financial institutions and their technology ecosystem in leveraging NVIDIA's advanced technologies
Provide technical guidance and support to internal teams and external stakeholders, ensuring effective ideation, research and developing solution prototypes, followed by a successful deployment.
Work closely with product management, engineering, applied research and sales teams to develop and deliver comprehensive solutions.
Stay updated on industry trends and advancements in hardware technologies to continually improve and innovate NVIDIA's software and data center solutions.
What we need to see:
BS/MS/PhD degree in Machine Learning, Computer Science, or related technical field.
Minimum of 5 years of experience in AI and accelerated technologies.
Background working within Financial Services firms.
Expertise in writing code for training and/or inference for NVIDIA GPUs.
Capable of working in a constantly evolving environment without losing focus.
Self-starter with a passion for continuous learning and sharing findings across the team.
Ways to stand out from the crowd:
Demonstrated understanding of how GPU acceleration can be applied to financial workloads.
Knowledge of software-defined infrastructure and orchestration tools (Kubernetes, OpenStack).
Proficiency in cloud platforms (AWS, Azure, Google Cloud) and hybrid cloud solutions.
Understanding of HPC systems: data center design, high speed interconnect InfiniBand, Cluster Storage and Scheduling related design and/or management experience.

Collaborating with NVIDIA’s training framework developers and product teams to stay ahead of the latest features and help partners to adopt them effectively.
Assisting with deployment, debugging, and improving the efficiency of AI workloads on extensive NVIDIA platforms.
Benchmarking new framework features, analyzing performance, and sharing actionable insights with both customers and internal teams.
Working directly with external customers to solve cluster performance and stability issues, identify bottlenecks, and implement effective solutions.
Build expertise and guide customers in scaling workloads efficiently and reliably on the latest generation of NVIDIA GPUs.
Contributing to Europe’s Sovereign AI initiative by helping customers implement advanced resiliency features within AI training pipelines.
BS, MS, PhD or equivalent experience in Computer Science, Electrical/Computer Engineering, Physics, Mathematics, or a related engineering field—or equivalent practical experience.
8+ years of experience in accelerated computing technologies at cluster scale, ideally including work with NVIDIA platforms.
Strong programming skills in at least one of the following languages: C, C++, or Python.
Practical experience identifying and resolving bottlenecks in large-scale training workloads or parallel applications.
Hands-on experienced in profiling and debugging large parallel applications.
Solid understanding of CPU and GPU architectures, CUDA, parallel filesystems, and high-speed interconnects.
Experienced in working with large compute clusters with an understanding of their internal scheduling and resource management mechanisms (e.g. SLURM or Cloud based clusters).
Proficient knowledge of training pipelines and frameworks, encompassing their internal operations and performance attributes.
Experience in debugging training pipelines running on thousands of GPUs in production environment.
Hands-on experience with performance profiling and optimizations using tools like Nsight Systems, Nsight Compute and good understanding of NCCL, MPI and low-level communication libraries.
Ability to debug stability issues across the entire stack: parallel application, training frameworks, runtime libraries, schedulers, and hardware.
Solid understanding of the internal workings of LLM frameworks such as PyTorch, Megatron-LM, or NeMo, and how they affect compute layers like CPUs, GPUs, network and storage or understanding of inference tools such as vLLM, Dynamo, TensorRT-LLM, RedHat Inference Server or SGLang.

What you will be doing:
Work directly with key customers to understand their technology and provide the best AI solutions.
Perform in-depth analysis and optimization to ensure the best performance on GPU architecture systems (in particular Grace/ARM based systems). This includes support in optimization of large scale inference pipelines.
Partner with Engineering, Product and Sales teams to develop, plan best suitable solutions for customers. Enable development and growth of product features through customer feedback and proof-of-concept evaluations.
What we need to see:
Excellent verbal, written communication, and technical presentation skills in English.
MS/PhD or equivalent experience in Computer Science, Data Science, Electrical/Computer Engineering, Physics, Mathematics,other Engineeringfields.
5+ years workor research experience with Python/ C++ / other software development
Work experience and knowledge of modern NLPincluding good understandingof transformer, state space, diffusion, MOE model architectures. This can includeeither expertise intraining oroptimization/compression/operationof DNNs.
Understanding of key libraries used for NLP/LLM training (such as Megatron-LM,NeMo, DeepSpeed etc.)and/or deployment(e.g. TensorRT-LLM, vLLM,Triton Inference Server).
Enthusiastic about collaborating with various teams and departments—such as Engineering, Product, Sales, and Marketing—this person thrives in dynamic environments and stays focused amid constant change.
Self-starter with demeanor for growth, passion forcontinuous learning andsharing findings across the team.
Ways to Stand Out from The Crowd:
Demonstrated experience in running and debugging large-scale distributed deep learning training or inferenceprocesses.
Experience working with larger transformer-based architectures for NLP, CV, ASR or other.
Applied NLP technology in productionenvironments.
Proficient with DevOps tools including Docker, Kubernetes, andSingularity.
Understanding of HPC systems: data center design, high speed interconnect InfiniBand, Cluster Storage and Scheduling related design and/or management experience.

What you will be doing:
Work on the architecture for new media solutions targeting the broadcasting industry.
Develop and maintain SW technologies for enabling and supporting NVIDIA's GPU and DPU hardware.
Lead an entire software lifecycle
Work with and influence other internal worldwide teams (AI, software, hardware, OEM support).
Represent NVIDIA at Industry Standard bodies.
What we need to see:
Bachelor's/Masters in Computer Science, Computer Engineering, or Electrical Engineering, or equivalent experience.
12+ years "hands on" experience developing and architecting solutions for the broadcasting industry.
Knowledge in SMPTE 2110, AMWA NMOS and related standards.
Cloud-focused experience (containers, Kubernetes, OAuth, building applications for cloud)
Strong software engineering skills combined with a drive to solve hard problems are a must.
Excellent programming skills in Python, C or C++.
Ways to stand out from the crowd:
Strong written and oral communication skills to collaborate with other engineers, worldwide.
Knowledge in video encoding, decoding and streaming technologies.
The candidate should be able to work independently with minimal direction.
Knowledge of graphics APIs, like Vulkan and OpenGL.

Serve as the primary technical expert between NVIDIA and our customers, understanding their technology and provide the best AI solutions/ guidance on training process in terms of tools and methodology
Build proof-of-concepts and demonstrations that highlight the power of NVIDIA AI platforms in robotics
Partner with developers, researchers, technology specialists, IT professionals, and executives to facilitate the integration of NVIDIA technology
Work with Engineering, Product and Sales teams to develop, plan best suitable solutions for customers. Enable development and growth of product features through customer feedback and proof-of-concept evaluations.
MS/PhD or equivalent experience in Computer Science, Data Science, Electrical/Computer Engineering, Physics, Mathematics, other Engineering fields.
Deep expertise in AI/Deep Learning, with hands-on experience in training or optimizing VLMs for production
Expertise with deep learning frameworks for training VLMs (PyTorch, Nemo), and/or experience with such model's optimization methods and tools (TensorRT and Triton Inference Server).
Excellent verbal, written communication, and technical presentation skills in English.
5+ years' work or research experience with Python/ C++ / other software development
AI passionate with a growth mindset, ability to collaborate effectively with different teams (Engineering, Product, Sales, Marketing) in a rapid evolving environment while continuously learning and sharing insights.
Familiarity with Cosmos (e.g. Cosmos-Reason) and Isaac GR00T family of models.
Familiarity with robotics simulation environments (Isaac Sim, Isaac Lab, MuJoco, etc.).
Experience in large scale training / understanding of modern model, data, expert, context parallel training techniques.
Track record in Neural Networks inference optimization for Physical AI use cases
משרות נוספות שיכולות לעניין אותך