Own a machine learning vertical, including deciding what data we should collect and how to label it, designing the network architecture, training the model at scale on one of the largest GPU clusters in the world, and driving and evaluating your model in an engineering Tesla vehicle
Work with a world-class team on cutting-edge techniques in large multimodal models, multi-task learning, video networks, generative models, imitation learning, semi-supervised learning, and self-supervised learning
Have an outsized impact deploying foundation models to millions of Tesla’s robotic platforms across the world
What You’ll Bring
Strong software engineering skills: much of modern deep learning success comes down to the quality of the implementation and strong engineering ability is non-negotiable
Demonstrated excellence and a proven track record of solving difficult software engineering problems
An “under the hood” knowledge of deep learning: layer details, loss functions, optimization, etc.
Experience with PyTorch, or another major deep learning framework such as JAX or TensorFlow