דרושים Model-as-a-service Tech Lead ב-אנבידיה ב-ארהב

Working with tech giants to develop and demonstrate solutions based on NVIDIA’s groundbreaking software and hardware technologies. Partnering with Sales Account Managers and Developer Relations Managers to identify and secure...

US, WA, Seattle

time type: Full time

posted on: Posted Today

job requisition id

What you’ll be doing:

Working with tech giants to develop and demonstrate solutions based on NVIDIA’s groundbreaking software and hardware technologies.
Partnering with Sales Account Managers and Developer Relations Managers to identify and secure business opportunities for NVIDIA products and solutions.
Serving as the main technical point of contact for customers engaged in the development of intricate AI infrastructure, while also offering support in understanding performance aspects related to tasks like large scale LLM training and inference.
Conducting regular technical customer meetings for project/product details, feature discussions, introductions to new technologies, performance advice, and debugging sessions.
Collaborating with customers to build Proof of Concepts (PoCs) for solutions to address critical business needs and support cloud service integration for NVIDIA technology on hyperscalers.
Analyzing and developing solutions for customer performance issues for both AI and systems performance.

What we need to see:

BS/MS/PhD in Electrical/Computer Engineering, Computer Science, Physics, or other Engineering fields or equivalent experience.
4+ years of engineering(performance/system/solution)experience.
Hands-on experience building performance benchmarks for data center systems, including large scale AI training and inference.
Understanding of systems architecture including AI accelerators and networking as it relates to the performance of an overall application.
Effective engineering program management with the capability of balancing multiple tasks.
Ability to communicate ideas clearly through documents, presentations, and in external customer-facing environments.

Ways to stand out from the crowd:

Hands-on experience with Deep Learning frameworks (PyTorch, JAX, etc.), compilers (Triton, XLA, etc.), and NVIDIA libraries (TRTLLM, TensorRT, Nemo, NCCL, RAPIDS, etc.).
Familiarity with deep learning architectures and the latest LLM developments.
Background with NVIDIA hardware and software, performance tuning, and error diagnostics.
Hands-on experience with GPU systems in general including but not limited to performance testing, performance tuning, and benchmarking.
Experience deploying solutions in cloud environments including AWS, GCP, Azure, or OCI as well as knowledge of DevOps/MLOps technologies such as Docker/containers, Kubernetes, data center deployments, etc. Command line proficiency.

You will also be eligible for equity and .

משרות נוספות שיכולות לעניין אותך

Nvidia Senior Solutions Architect GPU - Cloud Service Providers United States, Texas

Nvidia Solutions Architect Networking - Cloud Service Providers United States, Texas

Yesterday

Nvidia Model-as-a-Service Tech Lead United States, California

שיתוף

Serve as the primary, high-impact contributor on complex features. Dedicate significant time to producing production code across the full stack, including UI, APIs, services, and infrastructure. Code Review Leadership &...

time type: Full time

posted on: Posted 6 Days Ago

job requisition id

What you'll be doing:

Serve as the primary, high-impact contributor on complex features. Dedicate significant time to producing production code across the full stack, including UI, APIs, services, and infrastructure.
Code Review Leadership & Quality Assurance: Lead the code review process, setting and implementing thorough coding standards, performance benchmarks, and architectural integrity to ensure all merged code is high-quality, maintainable, and robust.
Architectural Ownership & Portability: Define and own the long-term technical roadmap, architecture, and design. This includes the required assurance that the deployment pipelines and services are platform-agnostic and easily deployable across the broader NVIDIA ecosystem, deliberately avoiding internal infrastructure dependencies.
Foundation Model Deployment Strategy: Lead the strategic implementation of web services and efficient batch processing queues to seamlessly integrate and operationalize our world foundation models into the customer-facing platform.
System Performance & Reliability: Implement and make sure standards for production-grade performance, monitoring, and fault tolerance across all services. Proactively identify and resolve systemic technical debt and scalability bottlenecks.
Deployment & Operational Excellence: Take ultimate ownership of the CI/CD pipelines, container orchestration strategy (Kubernetes/Helm), and operational readiness, ensuring seamless scalability and reliability in production.
Team Mentorship & Guidance: Mentor and guide the engineering team on advanced practices in full-stack development, distributed systems design, performance optimization, and clean, portable code architecture.
Multi-functional Partnership: Act as the key technical liaison, translating complex requirements from Product Managers, ML Engineers, and Data Scientists into robust, portable, and implementable designs.

What we need to see:

This role requires a proven track record of significant experience and technical mastery:

Minimum 12+ years of hands-on experience developing and deploying scalable full-stack web services in a cloud environment.
Proven Tech Lead or equivalent Senior/Staff level experience with demonstrated ability to define system architecture, mentor engineers, and take end-to-end technical ownership of a major platform while remaining deeply active in coding and code reviews.
Expert-level proficiency in designing and scaling distributed microservices architectures using gRPC and REST APIs.
Deep expertise in modern frontend frameworks and building highly responsive, data-intensive UIs capable of managing high-frequency data flows.
Direct experience designing and deploying containerized applications that use a GPU (e.g., NVIDIA Container Toolkit).
Experience with MaaS (Model-as-a-Service) patterns and serving large machine learning models as high-throughput endpoints.
Mastery of container orchestration, including Kubernetes and Helm for sophisticated, portable, multi-service production deployments.
Proficiency in backend languages such as Python and/or Go, and TypeScript for the frontend.
Strong practical experience with Cloud Infrastructure (AWS S3) and running complex data storage/access patterns (SQL, key-value stores).
Expertise in CI/CD practices (GitLab, Jenkins) with a focus on automation, testing, and improving deployment velocity and stability.
Bachelor's degree (B.S.) or equivalent experience in Computer Science, Software Engineering, Electrical Engineering, or a closely related technical field; Master's degree (M.S.) preferred

Ways to stand out from the crowd:

These skills represent a strong alignment with our specific domain challenges:

Experience in data querying platforms such as Apache Druid, ClickHouse, or Elasticsearch.
Familiarity with autonomous vehicle simulation environments (e.g., Carla) and synthetic data generation pipelines using foundational models.

You will also be eligible for equity and .

משרות נוספות שיכולות לעניין אותך

16.11.2025

Nvidia Senior DevOps Service Reliability Operations Engineer - DGX ... United States, Texas

שיתוף

The team will provide their services 24/7 with a follow-the-sun environment which will span continents. You will report directly to a manager in the United States. Some CIS shifts require...

US, Remote

time type: Full time

posted on: Posted 2 Days Ago

job requisition id

What you will be doing:

The team will provide their services 24/7 with a follow-the-sun environment which will span continents. You will report directly to a manager in the United States.
Some CIS shifts require either a Saturday or Sunday each week. The hours worked may include an early or late start (10hrs-per-day x 4 days-per-week schedule) to ensure that the combination the US and India teams provide 24/7 coverage.
Every CIS team member will use alerts and alarms to help prevent issues and incidents when possible. You may also work with the developer community to develop and implement predictive support or diagnostic routines.
Perform systems administration tasks, network administration tasks, security incident monitoring to drive our actions.
CIS team members will work with developers to learn how the service works, then translate that understanding into runbooks which the entire team will use. As new features and functionality are added, you will also update and evolve the runbooks as needed.
Help discover incidents and issues, including initiating the incident management procedure.
Bring in subject matter authorities or service owners as needed to resolve issues. Feedback will help us continually improve our service.
Your interpersonal skills will help keep the team engaged through resolution and ensure our clients believe we value their time and effort. May perform other tasks that will help us provide extraordinary service levels for our customers.

What we need to see:

Highly motivated with strong communication skills, you have the ability to work successfully with multi-functional teams, principles, and architects, coordinating effectively across organizational boundaries and geographies.
5+ years of experience administering large-scale production systems. 3+ years of experience in high-availability Internet, Cloud, or Data Center environments (Systems Administration, SRE, or NOC).
BS in Computer Science, Engineering, Physics, Mathematics, or equivalent experience.
Expert-level knowledge of Linux system administration and automation using Ansible and/or Python.
Strong experience with shell scripting, DNS, DHCP, storage systems, and core networking (IP Tables, routing, firewalls).
Experience with at least one workload manager (Slurm preferred) or job scheduling system in a production environment.
Strong experience troubleshooting and maintaining large-scale bare-metal infrastructure. Strong cross-team collaboration, documentation, and mentoring skills.
Experience improving processes for automation, reliability, and operational excellence.
Expertise using monitoring tools and problem ticketing systems. Strong problem-solving, analytical, and troubleshooting abilities.

Ways to Stand Out from the Crowd:

Advanced hands-on experience with Kubernetes, SLURM, and large-scale cluster management.
Familiarity with GPU hardware and high-performance computing environments.
Experience with observability and incident management tools (Grafana, OpenTelemetry, PagerDuty, JIRA). Cloud experience (AWS, Azure, GCP) is a plus; strong preference for on-prem expertise.

You will also be eligible for equity and .

משרות נוספות שיכולות לעניין אותך

15.11.2025

Nvidia Router Testing Tech Lead United States, Washington

שיתוף

Participate in an international team of software engineers working on products for testing NVIDIA products. Oversee the design, implementation, and maintenance of scalable test automation frameworks. Manage, mentor, and guide...

US, WA, Redmond

time type: Full time

posted on: Posted 2 Days Ago

job requisition id

What you’ll be doing:

Participate in an international team of software engineers working on products for testing NVIDIA products.
Oversee the design, implementation, and maintenance of scalable test automation frameworks.
Manage, mentor, and guide a team of automation engineers.
Design and implement robust, maintainable, and efficient automation test suite.
Work with Continuous integration systems and regression tools, automate builds, and test suites, generate test reports, isolate and classify failures and review new degradation.
Promote a culture of innovation, quality, and accountability. Bring SONiC NOS to shine in customer's view.

What we need to see:

B.Sc. degree or equivalent experience in Engineering/Computer Science/related field.
8+ overall years of experience in software development and testing. 2+ years of experience in a leadership role.
Proven experience in a leadership role, with a track record of successfully leading scrums and projects.
Intrinsically motivated with a desire for automation programming.
Strong programming skills in Python.
Strong technical abilities, problem-solving skills, coding, and design skills.
Ability to lead feature development, take full ownership and deliver independently.
Linux knowledge: have a general understanding of Linux operation system concepts.

Ways to stand out from the crowd:

Strong communication and interpersonal skills, with the ability to motivate and inspire others.
Knowledge in one or more Networking areas: Ethernet, VLANs, TCP/UDP/IP, QoS, L2-L3 protocols.
Extensive experience with automation frameworks (e.g., Selenium, Robot Framework, PyTest) and scripting languages (e.g., Python, Java).

You will also be eligible for equity and .

משרות נוספות שיכולות לעניין אותך

08.11.2025

Nvidia Lead Senior Software Engineer Agentic AI Applications United States, Texas

שיתוף

Design, develop, and implement agentic AI blueprints (applications) that show enterprises how to utilize and deploy this technology. Lead technical reviews and provide mentorship, guiding the engineering team in building...

time type: Full time

posted on: Posted 9 Days Ago

job requisition id

What you'll be doing:

Design, develop, and implement agentic AI blueprints (applications) that show enterprises how to utilize and deploy this technology.
Lead technical reviews and provide mentorship, guiding the engineering team in building production-grade workflows and extending core GenAI SDK capabilities.
Develop proof-of-concept workflows rooted in first principles that apply modern data science techniques to GenAI use cases.
Collaborate cross-functionally with product, research, and infrastructure teams to evolve NVIDIA's agentic ecosystem, including integrations between the NeMo Agent Toolkit and other NVIDIA products and services such as the NeMo Framework, NIMs, and NVIDIA Blueprints.
Drive performance optimization for agentic applications across the data center, focusing on improving accuracy, reducing latency, and growing efficiency.
Establish engineering standards and best practices for developing, testing, and deploying agentic AI applications across distributed environments.

What we need to see:

BS in Computer Engineering, Computer Science, Data Science, or a related field, or equivalent experience; MS or PhD preferred
8+ years of software engineering experience, including 2+ years as tech lead.
Proficient in Python, with at least 6+ years of experience building Python libraries or applications for enterprise customers.
Experience with GenAI application development using LLM frameworks (e.g., Langchain, Llamaindex, or AutoGen), evaluation systems (e.g., RAGAs), and observability platforms (e.g., Arize Phoenix, W&B Weave, or LangSmith).
Experience using and understanding of agentic frameworks.
Proficient in distributed orchestration and communication frameworks (e.g., Kafka, Ray).
Ability to quickly learn and apply new technologies and libraries.
Self-starter with a proactive work ethic, capable of working independently and successfully within a distributed team.
Excellent communication and collaboration skills across distributed, cross-functional teams.

Ways to stand out from the crowd:

Demonstrated leadership in building and scaling agentic AI applications in production.
Experience developing your own agents in Python or a similar language (e.g., Go).
Concrete examples/code of how you have profiled code in the past to identify performance bottlenecks and examples of how you mitigated these.
Experience developing for GPU platforms and familiarity with NVIDIA technologies (e.g., CUDA, TensorRT, Triton, NeMo) and LLM serving frameworks (e.g., Dynamo, vLLM, SGLang).
Experience with RAG systems and communication protocols (e.g., MCP, A2A).

You will also be eligible for equity and .

משרות נוספות שיכולות לעניין אותך

08.11.2025

Nvidia Manager Large Language Model Inference United States, California

שיתוף

Lead and grow a team responsible for specialized kernel development, runtime optimizations, and frameworks for LLM inference. Drive the design, development, and delivery of production inference software, targeting NVIDIA's next-generation...

time type: Full time

posted on: Posted 9 Days Ago

job requisition id

What You’ll Be Doing:

Lead and grow a team responsible for specialized kernel development, runtime optimizations, and frameworks for LLM inference.
Drive the design, development, and delivery of production inference software, targeting NVIDIA's next-generation enterprise and edge hardware platforms.
Integrating cutting-edge technologies developed at NVIDIA and offering an intuitive developer experience for LLM deployment.
Lead software development execution, with responsibility for project planning, milestone delivery, and cross-functional coordination.

What We Need to See:

MS, PhD, or equivalent experience in Computer Science, Computer Engineering, AI, or a related technical field.
7+ overall years of overall software engineering experience, including 3+ years of technical leadership experience.
Proven ability to lead and scale high-performing engineering teams, especially across distributed and cross-functional groups.
Strong background in C++ or Python, with expertise in software design and delivering production-quality software libraries.
Demonstrated expertise in large language models (LLM) and/or vision language models (VLM).

Ways to Stand Out from the Crowd:

Deep understanding of GPU architecture, CUDA programming, and system-level performance tuning.
Background in LLM inference or working with frameworks such as TensorRT-LLM, vLLM, or SGLang.
Passion for building scalable, user-friendly APIs and enabling developers in the AI ecosystem.
Have a proven track record of growing and managing a team that encourages idea sharing, empowers team members, and provides opportunities for professional growth.

You will also be eligible for equity and .

משרות נוספות שיכולות לעניין אותך

26.10.2025

Nvidia Enterprise Partner Marketing Specialist - AI Natives Model B... United States, California

שיתוף

Help to grow the relationships with the marketing organizations of emerging NVIDIA AI Partners. Take charge of coordinating and implementing measurable, end-to-end campaigns with partners to ensure NVIDIA's full-stack solution...

time type: Full time

posted on: Posted 3 Days Ago

job requisition id

What you’ll be doing:

Help to grow the relationships with the marketing organizations of emerging NVIDIA AI Partners.
Take charge of coordinating and implementing measurable, end-to-end campaigns with partners to ensure NVIDIA's full-stack solution maintains prominence in our collaborative marketing communications and initiatives.
Responsible for defining co-marketing objectives, plans and activities that drive brand awareness and generate leads.
Work closely with internal stakeholders, including field sales, product, and product marketing teams, to ensure alignment and integration of partner marketing strategies with overall business goals.
Drive partner asset review & MDF approval process.
Ability to present to senior leaders internally and within our partners organization to influence marketing opportunities that meet both companies' objectives.
Be prepared to roll up your sleeves and actively lead NVIDIA's presence at all partner and industry events.

What we need to see:

3-5 years in marketing with preferred background in technology, open models, AI & cloud technology
Consistent track record of planning and completing end-to-end marketing campaigns.
Positive relationship building and influencing skills in the area of co-marketing & partner alliance marketing
Broad experience across various marketing fields: PR, digital, paid, social media, content marketing, physical & virtual events
Outstanding project management skills with outstanding attention to detail and the capability to lead all aspects of various projects, deadlines, and collaborators concurrently.
Ability to connect the dots across complex conversations and align stakeholders toward common goals.
Bachelor’s degree or equivalent experience in marketing, communications, business or related fields.

You will also be eligible for equity and .

NvidiaSenior Solutions Architect GPU - Cloud Service Providers

משרות נוספות שיכולות לעניין אותך

1 2 3 4 5 6

United States, Texas

971215478

Today

שיתוף

תיאור:

US, CA, Santa Clara

US, WA, Seattle

time type: Full time

posted on: Posted Today

job requisition id

What you’ll be doing:

Working with tech giants to develop and demonstrate solutions based on NVIDIA’s groundbreaking software and hardware technologies.
Partnering with Sales Account Managers and Developer Relations Managers to identify and secure business opportunities for NVIDIA products and solutions.
Serving as the main technical point of contact for customers engaged in the development of intricate AI infrastructure, while also offering support in understanding performance aspects related to tasks like large scale LLM training and inference.
Conducting regular technical customer meetings for project/product details, feature discussions, introductions to new technologies, performance advice, and debugging sessions.
Collaborating with customers to build Proof of Concepts (PoCs) for solutions to address critical business needs and support cloud service integration for NVIDIA technology on hyperscalers.
Analyzing and developing solutions for customer performance issues for both AI and systems performance.

What we need to see:

BS/MS/PhD in Electrical/Computer Engineering, Computer Science, Physics, or other Engineering fields or equivalent experience.
4+ years of engineering(performance/system/solution)experience.
Hands-on experience building performance benchmarks for data center systems, including large scale AI training and inference.
Understanding of systems architecture including AI accelerators and networking as it relates to the performance of an overall application.
Effective engineering program management with the capability of balancing multiple tasks.
Ability to communicate ideas clearly through documents, presentations, and in external customer-facing environments.

Ways to stand out from the crowd:

Hands-on experience with Deep Learning frameworks (PyTorch, JAX, etc.), compilers (Triton, XLA, etc.), and NVIDIA libraries (TRTLLM, TensorRT, Nemo, NCCL, RAPIDS, etc.).
Familiarity with deep learning architectures and the latest LLM developments.
Background with NVIDIA hardware and software, performance tuning, and error diagnostics.
Hands-on experience with GPU systems in general including but not limited to performance testing, performance tuning, and benchmarking.
Experience deploying solutions in cloud environments including AWS, GCP, Azure, or OCI as well as knowledge of DevOps/MLOps technologies such as Docker/containers, Kubernetes, data center deployments, etc. Command line proficiency.

You will also be eligible for equity and .