Share
In this role, you will contribute to the development and optimization of advanced machine learning software and frameworks. You will also play a critical role in enhancing our CI/CD pipelines and DevOps practices to ensure efficient and reliable delivery of our software solutions. We are looking for someone with extensive experience in machine learning, strong leadership skills, and a deep understanding of CI/CD and cloud infrastructure.
What you’ll be doing:
Contribute the development and maintenance of advanced machine learning software and frameworks, optimizing for performance and scalability.
Enhance CI/CD pipelines to streamline the development, testing, and deployment of large-scale machine learning models.
Implement and manage cloud infrastructure for continuous integration, delivery, and deployment, ensuring high availability and scalability.
Collaborate with cross-functional teams, including engineering, QA, and research, to improve development workflows and enhance software delivery speed and quality.
Troubleshoot and resolve complex issues related to software development, containerization, and cloud infrastructure in production environments.
Write and maintain robust documentation for development and deployment processes.
Communicate effectively with technical and non-technical stakeholders to set shared expectations and ensure visibility around the release and deployment process.
Lead code reviews, testing, and debugging to ensure high-quality code and efficient workflows.
Mentor and guide junior engineers, fostering professional growth and enhancing team capabilities.
What we need to see:
Bachelor’s or master’s degree or equivalent experience in Computer Science, Information Systems, Engineering, or other related fields.
7+ years of experience in software engineering, with hands-on experience in CI/CD, cloud infrastructure, and advanced machine learning frameworks.
Proficiency with automation and orchestration tools, including Docker, Kubernetes, Jenkins, and Terraform or Ansible, or similar CI Tools.
Experience with cloud platforms like AWS, Azure, or GCP.
Strong programming skills in Python and/or other relevant languages.
Experience in developing and deploying scalable software solutions.
Strong analytical and problem-solving skills with a focus on practical and scalable solutions.
Ability to work effectively in a collaborative environment and manage multiple tasks and projects.
Familiarity with version control systems and configuration management.
Ways to stand out from the crowd:
Demonstrated ability to learn quickly and adapt to new technologies.
Strong academic record, with achievements in competitive programming or related areas.
Extensive experience with advanced AI tools and frameworks, including LLMs and NVIDIA Blueprints.
Contributions to open-source projects, demonstrating a collaborative and innovative mindset.
Experience in deploying machine learning models on edge devices or platforms.
You will also be eligible for equity and .
These jobs might be a good fit