מציאת משרת הייטק בחברות הטובות ביותר מעולם לא הייתה קלה יותר
Key job responsibilities
What will you do?
* Developing End to End pipelines from model training to model running on device with exception automation.
* Building tools and dashboards to auto test these pipelines and provide health metrics
* Scaling to more H100 clusters, Trainiums, and test frameworks like Pytorch ligtining
* Simplifying and reinventing systems, processes, and tools to make ML pipelines better for our Scientists, System Integration teams, Device OS teams
* Investigating technical issues scientifically and thoroughly, and assist in fixing them so they don't come back
* Providing technical solutions to real business problems in a global organization
- Experience leading the design, automation, deployment, and support of large-scale infrastructure
- Experience programming with at least one modern language such as Python, Ruby, Golang, Java, C++, C#, Rust
- Experience with Linux/Unix
- Experience with CI/CD pipelines build processes
- Experience with distributed systems at scale
משרות נוספות שיכולות לעניין אותך