Expoint - all jobs in one place

Finding the best job has never been easier

Limitless High-tech career opportunities - Expoint

Nvidia Senior MLOps Automation Engineer - AV Infrastructure 
United States, Texas 
966261908

04.04.2024


What you will be doing:

  • Craft, maintain, and enhance CI/CD tools for both on-prem and cloud deployment of our software, with a laser focus on minimizing cognitive load and creating lightning-fast feedback loops for our software developers.

  • Collaborate closely with development, operations, and quality assurance teams to optimize MLOps practices, tools, and infrastructure, ensuring a seamless developer experience with minimal toil, and boosting the efficiency of continuous integration, continuous delivery (CI/CD), and software release management.

  • Dive into the world of automation, streamlining build, deployment, and release processes using infrastructure as code (IaC) tools like Terraform. Make these processes a breeze, reducing manual effort and freeing up creative brainpower for our developers.

  • Instrument, measure, and evaluate our developer practices with a focus on reducing the time it takes to get code into production for our software developers.

  • Collaborate with operations and security teams to ensure proper configuration and management of infrastructure resources, including containers, databases, and networking, following standard processes for security, scalability, and cost optimization.

  • Stay up-to-date with the latest advancements in MLOps tools, technologies, and standard methodologies, and provide recommendations for continuous improvement of our software development and deployment processes.

What we need to see:

  • Bachelor's or Master's degree in Computer Science, Software Engineering, or a related field (or equivalent experience).

  • 8+ years of hands-on experience in large and complex software development environments and Experience with large code bases is a plus.

  • Proven experience as a Software Engineer, with a strong focus on enhancing developer productivity through streamlined processes and tools.

  • Systems thinker with a willingness to learn new technologies and propose innovative solutions that cross team and system boundaries.

  • Programming and scripting skills in languages such as Python, Java, Shell, or PowerShell, and proficiency in version control systems such as Git.

  • Familiarity with popular CI/CD tools (e.g., Jenkins, GitLab CI, Travis CI, CircleCI), build systems (e.g., CMake, Bazel, Gradle), and version control systems (e.g., Git, Perforce).

  • Knowledge of infrastructure as code (IaC) tools and concepts, including Terraform, and experience with cloud computing platforms.

  • Familiarity with containerization technologies such as Docker and container orchestration platforms such as Kubernetes.

  • Solid understanding of software testing principles and experience with automated testing frameworks and tools.

  • Knowledge of release management practices and tools, covering versioning, branching, merging, and tagging.

Ways to stand out from the crowd

  • Experience with GPU-accelerated applications or technologies.

  • Thrive in a dynamic, multi-tasking environment with ever-evolving priorities.

  • Strong background with Jenkins on k8s at scale.

  • Previous success stories with a large-scale operations team.

  • Outstanding interpersonal skills and communication with all levels of management.

You will also be eligible for equity and .