Participate in an international team developing software platforms for cluster jobs execution and monitoring in a variety of NVIDIA data centers solutions.
Develop algorithms and software defining the quality standards for future data center technologies.
Collaborate with partners in the design, research, and implementation of required targets.
Work closely with networking architects, driver engineers, and other NVIDIA teams.
Prepare and deliver internal presentations on technical topics.
What we need to see:
B.Sc. degree in Computer Science or equivalent experience.
8+ years of relevant software development experience.
Familiarity with software design patterns, principles and best practices.
Familiarity with version control systems (e.g. Git) and CI/CD pipelines.
Proficiency in Linux and Windows operating systems.
Passion for automation programming.
Strong debugging and analytical skills.
Excellent interpersonal and communication skills.
Ways to stand out from the crowd:
M.Sc. in Computer Science.
Experience with cluster management and related tools, including Docker Containers, Slurm, Kubernetes and Ansible.
Knowledge of RESTful API design and development.
Experience with micro-services architecture and experience with cloud infrastructure platforms like AWS, Azure, or Google Cloud.
Specialization in Python programming and Familiarity with networking protocols - InfiniBand, Ethernet.