המקום בו המומחים והחברות הטובות ביותר נפגשים
You must have deep technical experience working with technologies related to large language models (LLM) including LLM architectures, distributed training and inference, model evaluation, and fine-tuning techniques.Key job responsibilities
You will help develop the industry’s best cloud-based solutions to grow the GenAI business. Working closely with our engineering teams, you will help enable new capabilities for our customers to develop and deploy GenAI workloads on AWS. You will facilitate the enablement of AWS technical community, solution architects and, sales with specific customer centric value proposition and demos about end-to-end GenAI on AWS cloud.You will possess a technical and business background that enables you to drive an engagement and interact at the highest levels with startups, Enterprises, and AWS partners. You will have the technical depth and business experience to easily articulate the potential and challenges of GenAI models and applications to engineering teams and C-Level executives. This requires deep familiarity across the stack – compute infrastructure (e.g., Amazon EC2, EKS, SageMaker, Lustre), ML frameworks PyTorch, JAX, orchestration layers Kubernetes and Slurm, parallel computing (NCCL, MPI), MLOPs, as well as target use cases in the cloud.You will drive the development of the GTM plan for building and scaling GenAI on AWS, interact with customers directly to understand their business problems, and help them with defining and implementing scalable GenAI solutions to solve them (often via proof-of-concepts). You will also work closely with account teams, research scientists, and product teams to drive model implementations and new solutions.This is an opportunity to be at the forefront of technological transformations, as a key technical leader. Additionally, you will work with the AWS GenAI product teams to shape product vision and prioritize features for AI/ML Frameworks and applications. A keen sense of ownership, drive, and being scrappy is a must.
- Bachelor's degree in computer science, engineering, mathematics or equivalent
- Experience developing technology solutions and evangelising end-to-end technology roadmaps that guide IT transformations toward cloud computing
- Experience in specific technology domain areas like software development, cloud computing, systems engineering, infrastructure, security, networking, data and analytics
- Experience communicating across technical and non-technical audiences and at C-level, including training, workshops, publications
- Practical experience in distributed training frameworks and inference servers. Orchestrators/schedulers (one or several of Kubernetes, EKS, Slurm), storage systems (S3, Lustre, POSIX). Experience working with GPUs or custom silicon, profiling and optimization.
- Knowledge of distributed systems design and implementation or equivalent
- Knowledge of large scale automation and workflow management or equivalent
- Knowledge of presentations and whiteboarding skills with a high degree of comfort speaking with internal and external executives, IT management, and developers
- Experience architecting, migrating, transforming or modernizing customer requirements to the cloud
- Practical experience in High Performance Computing (HPC) and/or distributed training, performance profiling and optimization.
- Experience in distributed training (PyTorch, Jax, NeMo) and/or inference (NIMS, TRT-LLM, TorchServe, Triton).Pursuant to the San Francisco Fair Chance Ordinance, we will consider for employment qualified applicants with arrest and conviction records.
משרות נוספות שיכולות לעניין אותך