You thrive on the challenge of building and optimizing platforms at scale, are deeply passionate about leveraging cutting-edge technologies, and are dedicated to innovation and market success. Your role will involve leveraging your technical expertise and practical experience in AI and machine learning, with a focus on model serving and compute platforms. You will be responsible for designing and developing the platform to enable data scientists and ML solution engineers to build and deliver AI/ML solutions in a self-service
Essential Responsibilities:
- Delivers complete solutions spanning all phases of the Software Development Lifecycle (SDLC) (design, implementation, testing, delivery and operations), based on definitions from more senior roles.
- Advises immediate management on project-level issues
- Guides junior engineers
- Operates with little day-to-day supervision, making technical decisions based on knowledge of internal conventions and industry best practices
- Applies knowledge of technical best practices in making decisions
Minimum Qualifications:
- Minimum of 5 years of relevant work experience and a Bachelor's degree or equivalent experience.
- track recordof over-achieving engineering and platform delivery and scaling targets in high volume, innovative, and fast-paced high-pressure environment; proven results in delivery the platform products.
- Familiar with AI/ML concepts, algorithms, and techniques, with hands-on experience in delivering AI/ML solutions.
- Practical knowledge of infrastructure components includingcompute, networking, and storage for modern cloud environments, especially with Kubernetes.
- Solid understanding of the AI/ML development lifecycle, with familiarity inMLOpspractices such as model deployment, serving, and monitoring.
- Proficient in Programming languages such as Java or Python.
- Experience with LLM inferencing frameworks (vLLM,SGLang,TensorRT-LLM) and LLM model optimization (nice to have).
- Experience with ML model serving and orchestration frameworks such asMLflow, Seldon, Triton Inference Server, or Ray Serve (nice to have).
Travel Percent:
The total compensation for this practice may include an annual performance bonus (or other incentive compensation, as applicable), equity, and medical, dental, vision, and other benefits. For more information, visit .
The US national annual pay range for this role is $123,500 to $212,850
Our Benefits:
Any general requests for consideration of your skills, please