What you'll do...
- which will be used for deploying deep learning models or traditional ML models and run inferences on-demand at low latency. This system should be capable of scaling horizontally and will be deployed onto kubernetes based cluster. This platform will be capable of supporting different ml frameworks like TensorRT, TensorFlow, PyTorch, ONNX and Custom . This Real Time Inference system plays an important role in providing recommendations based on current user context.
- Develop capabilities to support inferencing of Gen AI models , and support other LLM techniques like RAG, model fine tuning, building semantic cache .
- Develop and implement best practices in MLOps to streamline the development, deployment, and monitoring of ML models. This includes complete model lifecycle management, ensuring model reproducibility, and facilitating seamless model versioning, rollout, monitoring and updating.
- Collaborate with product managers, data scientists, and other engineering teams to understand their needs and translate them into ML platform capabilities. This collaborative approach ensures that the ML infrastructure effectively supports product features and business objectives.
- Lead architectural and design discussions to ensure our platform is modular, scalable, fault tolerant, and sustainably built
- Mentor and grow team members while working in an agile environment and ensure best engineering practices are followed.
You will also receive PTO and/or PPTO that can be used for vacation, sick leave, holidays, or other purposes. The amount you receive depends on your job classification and length of employment. It will meet or exceed the requirements of paid sick leave laws, where applicable.
For information about PTO, see .
Eligibility requirements apply to some benefits and may depend on your job classification and length of employment. Benefits are subject to change and may be subject to a specific plan or program terms.
For information about benefits and eligibility, see .
Hoboken, New Jersey US-10279/Bellevue, Washington US-11075:The annual salary range for this position is $132,000.00-$264,000.00 Sunnyvale, California US-04397:The annual salary range for this position is $143,000.00-$286,000.00 Additional compensation includes annual or quarterly performance bonuses. Additional compensation for certain positions may also include: - Stock
Minimum Qualifications... Outlined below are the required minimum qualifications for this position. If none are listed, there are no minimum qualifications.
Option 1: Bachelor's degree in computer science, computer engineering, computer information systems, software engineering, or related area and 4 years’ experience in software engineering or related area.Option 2: 6 years’ experience in software engineering or related area.
Preferred Qualifications... Outlined below are the optional preferred qualifications for this position. If none are listed, there are no preferred qualifications.
680 West California Avenue, Sunnyvale, CA 94086-4834, United States of America