Expoint - all jobs in one place

Finding the best job has never been easier

Limitless High-tech career opportunities - Expoint

Google Field Solution Developer II AI Infrastructure Google Cloud 
Canada, Ontario, Toronto 
302140047

29.01.2025
Info Note: By applying to this position you will have an opportunity to share your preferred working location from the following: Toronto, ON, Canada; Waterloo, ON, Canada; Montreal, QC, Canada.Note: By applying to this position you will have an opportunity to share your preferred working location from the following:.
Minimum qualifications:
  • Bachelor's degree in Computer Science, Mathematics, a related technical field, or equivalent practical experience.
  • 5 years of experience with cloud infrastructure (hardware shapes, sizes, auto-scaling, auto-provisioning, etc.), working with infrastructure as a service, platform as a service, or software as a service.
  • Experience with distributed training and optimizing performance versus costs.
  • Experience coding in Python, bash scripting, and using OSS frameworks such as TensorFlow, PyTorch, Jax, etc.
  • Experience with orchestrators such as Slurm or Kubernetes.
  • Experience building and operationalizing machine learning models.

Preferred qualifications:
  • Experience training and fine tuning large models (i.e., image, language, segmentation, recommendation, genomics) with accelerators.
  • Experience with containerization, K8s, Kubernetes on cloud.
  • Experience with running MLPerf benchmarks.
  • Experience with performance profiling tools (i.e., Tensorflow profiler, PyTorch profiler, Tensorboard).
  • Experience designing/architecting large-scale AI compute clusters.
  • Ability to debug distributed training/inferencing code running.