Share
- 5+ years of non-internship professional software development experience
- 5+ years of programming with at least one software programming language experience
- 5+ years of leading design or architecture (design patterns, reliability and scaling) of new and existing systems experience
- Experience as a mentor, tech lead or leading an engineering team
- Design, build, and optimize machine learning infrastructure for large-scale training and inference.
- Apply PyTorch, Python, and C++ skills to engineer modular, scalable ML systems.
- Evaluate and implement parallelism techniques such as data, tensor, model, and pipeline parallelism.
- Monitor and optimize GPU memory and throughput for training large models efficiently.
- Collaborate cross-functionally with research, data infra teams to integrate new models and features
- Deep understanding of LLM algorithm and deep learning framework like PyTorch
- Mathematics and Statistics: Strong understanding of linear algebra, calculus, probability, and statistics.
- 5+ years of full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations experience
- Bachelor's degree in computer science or equivalentPursuant to the San Francisco Fair Chance Ordinance, we will consider for employment qualified applicants with arrest and conviction records.
These jobs might be a good fit