Share
- 3+ years of non-internship professional software development experience
- 2+ years of non-internship design or architecture (design patterns, reliability and scaling) of new and existing systems experience
- Experience programming with at least one software programming language
- Design, build, and optimize machine learning infrastructure for large-scale training and inference.
- Apply PyTorch, Python, and C++ skills to engineer modular, scalable ML systems.
- Evaluate and implement parallelism techniques such as data, tensor, model, and pipeline parallelism.
- Monitor and optimize GPU memory and throughput for training large models efficiently.
- Collaborate cross-functionally with research, data infra teams to integrate new models and features.
- Deep understanding of LLM algorithm and deep learning framework like PyTorch.
- Mathematics and Statistics: Strong understanding of linear algebra, calculus, probability, and statistics.
- 3+ years of full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations experience
- Bachelor's degree in computer science or equivalentPursuant to the San Francisco Fair Chance Ordinance, we will consider for employment qualified applicants with arrest and conviction records.
These jobs might be a good fit