

What you'll be doing:
Development of Kubernetes integration in our Linux-based cluster management software product. You will allow customers to set up, manage and monitor Kubernetes deployments on their BCM clusters.
Integrating other NVIDIA components into Base Command Manager.
Ensuring that various types of workload can easily utilize GPUs through Kubernetes or other workload management systems such as Slurm.
Development of various Kubernetes operators to facilitate different types of workload in Kubernetes.
Following the latest developments in the area of Kubernetes.
Assisting the support team with Kubernetes specific support tickets that require specific expertise.
Working with the latest hardware (e.g. GPUs, AI accelerators, high-speed interconnects such as InfiniBand and Spectrum X) and software technologies such as parallel filesystems (e.g. Lustre, GPFS, BeeGFS, WekaIO), Jupyter, various ML frameworks and tools, and Ceph.
What we need to see:
Degree in Computer Science or related field.
Fluency in C++ and/or Python
Experience with concurrent programming techniques
7+ years of relevant experience, ideally in the area of systems programming
In-depth knowledge of Linux and Kubernetes
Ways to stand out from the crowd:
Experience with high-performance computing and system administration would be an asset
Experience with Slurm
Background with GoLang would be beneficial
משרות נוספות שיכולות לעניין אותך

What you’ll be doing:
Own and maintain CI/CD pipelines using GitLab, Jenkins, and internal tools.
Design and implement automated solutions in Python to streamline development and operations.
Support a container-based environment (Docker/Kubernetes) for building and testing distributed microservices.
Build internal Python tools to help developers test, and debug their code in a CI/CD environment.
Collaborate closely with developers to improve reliability, efficiency, and visibility of software delivery workflows.
Drive adoption of best practices in automation, testing, and release processes.
What we need to see:
Degree in Computer Science or related field (or equivalent experience).
3+ years of experience in DevOps, Automation, or Infrastructure Engineering roles.
Proficiency in Python with a focus on backend tooling and automation scripts.
Strong knowledge of GitLab CI, Jenkins, or similar pipeline systems.
Hands-on experience with Linux systems and IP networking in production environments.
Understanding of containers, microservices, and distributed systems.
Ways to stand out from the crowd:
Strong Python, Linux, and Networking skills.
Passion for helping other developers through tooling and infrastructure improvements.
Familiarity with Grafana, Prometheus, or similar monitoring tools.
A track record of reducing build times, flakiness, or pipeline costs in a large-scale environment.
Contributions to internal developer platforms or open-source DevOps tooling.
משרות נוספות שיכולות לעניין אותך

What you'll be doing:
Development of Kubernetes integration in our Linux-based cluster management software product. You will allow customers to set up, manage and monitor Kubernetes deployments on their BCM clusters.
Integrating other NVIDIA components into Base Command Manager.
Ensuring that various types of workload can easily utilize GPUs through Kubernetes or other workload management systems such as Slurm.
Development of various Kubernetes operators to facilitate different types of workload in Kubernetes.
Following the latest developments in the area of Kubernetes.
Assisting the support team with Kubernetes specific support tickets that require specific expertise.
Working with the latest hardware (e.g. GPUs, AI accelerators, high-speed interconnects such as InfiniBand and Spectrum X) and software technologies such as parallel filesystems (e.g. Lustre, GPFS, BeeGFS, WekaIO), Jupyter, various ML frameworks and tools, and Ceph.
What we need to see:
Degree in Computer Science or related field.
Fluency in C++ and/or Python
Experience with concurrent programming techniques
7+ years of relevant experience, ideally in the area of systems programming
In-depth knowledge of Linux and Kubernetes
Ways to stand out from the crowd:
Experience with high-performance computing and system administration would be an asset
Experience with Slurm
Background with GoLang would be beneficial
משרות נוספות שיכולות לעניין אותך