Finding the best job has never been easier
Share
Build and improve our Python training infrastructure for stable and faster training
Build the tooling and infrastructure for reporting and visualizing model metrics and performance
Build the pipelines to run and validate our PyTorch models
Manage, analyze, and visualize our training and test datasets
Coordinate with the team managing the hardware cluster to maintain high availability / jobs throughput for Machine Learning
Build and improve tooling to deploy trained neural nets to Tesla hardware
Practical experience programming in Python and/or C++
Proficient in system-level software, particularly hardware-software interactions and resource utilization
Understanding of modern machine learning concepts and state of the art deep learning
Experience working with training frameworks, ideally PyTorch
Demonstrated experience scaling neural network training jobs across clusters of GPU’s
Optional: Previous experience in deep learning deployment
Optional: Profiling and optimizing CPU-GPU interactions (pipelining compute/transfers, etc)
These jobs might be a good fit