Finding the best job has never been easier
Share
What you'll be doing:
Work with Configuration Management tools and Workflow management tool like Ansible and Stackstorm to manage and deploy COLO multi-platform server clusters in Nvidia Data Centers around the world.
You will design and develop different infrastructures, tools, and automation scripts to improve automation support for varies activities such as image management, switch management, deployment, data analytics, automated testing, logging, monitors and alerts for different micro-services.
Work with data centers infrastructure tools like netbox, foreman and Mellanox switches to manage DCs at scale.
Work on the latest infrastructure management like Kubernetes and docker for fast and consistent delivery.
You will be the owner of infrastructure for micro-services and provide operation support to application teams such as automation service, infrastructure and security improvement, and live service troubleshooting.
What we need to see:
BS in ComputerScience/Engineering/Math/Physics
3+ years of proven experience
Excellent scripting: Python, bash, Groovy, GOLANG
Outstanding debugging skills
Cloud experience with AWS Compute, Containers, and Networking services are preferable.
CI/CD experience with Jenkins and Jenkins pipeline
Experience with Configuration Management such as Ansible is a big plus.
Experience with Packer, Terraform and StackStorm is a plus.
Experience with Kubernetes, docker and Helm is a huge plus
These jobs might be a good fit