The point where experts and best companies meet
Share
You will apply your expertise to develop highly available services that make effective use of the thousands of GPU involved in this operation. Your services provide the best-in-class performance, accuracy and availability. We are looking for technical talent to design, build, operate and improve our capabilities to produce NIMs at scale, including the underlying infrastructure, pipelines, inference backends, Docker build, test harness, metrics, performance engineering, log ingestion, and more.
What you'll be doing:
Design, build, and optimize containerized inference execution for a variety of applications, ensuring efficiency and scalability. These applications may run in container orchestration platforms like Kubernetes to enable scalable and robust deployment.
Drive the adoption of a large number of optimized models through microservices developed in Python or Go. Ensure the performance and scalability of NIMs through comprehensive performance measurement and optimization.
Apply container expertise to create and optimize the basic building blocks of NIMs, influencing the development of many models and related products within NVIDIA.
Collaborate, brainstorm, and improve the designs of inference solutions with a broad team of software engineers, researchers, SREs, and product management.
Mentor and collaborate with team members and other teams to foster growth and development. Demonstrate a history of learning and enhancing both personal skills and those of colleagues.
What we need to see:
A history of using advanced programming skills to build distributed compute systems, backend services, microservices, and cloud technologies.
Effective experience working with multi-functional teams, principals, and architects across organizational boundaries.
Mentorship and the ability to grow teams and team members.
Deep technical expertise in distributed containerized applications using Docker, Kubernetes, Cloud Endpoints, Helm, and Prometheus.
Passion for building scalable and performant microservice applications.
Excellent interpersonal skills and the flexibility to lead multi-functional efforts.
Proven experience debugging and analyzing the performance of distributed microservices or cloud systems.
A degree in Computer Science, Computer Engineering, or a related field (BS or MS) or equivalent experience.
6+ years of demonstrated experience in developing performant microservices, cloud software, and/or tooling roles.
Ways to stand out from the crowd:
Experience with multiple container engines, internals of the container image and runtime.
Prior experience in building and deploying containers for Microservices, Cloud, and On-prem deployments, along with their associated CI/CD pipelines.
Previous work in large-scale full-stack development.
You will also be eligible for equity and .
These jobs might be a good fit