Expoint - all jobs in one place

מציאת משרת הייטק בחברות הטובות ביותר מעולם לא הייתה קלה יותר

Limitless High-tech career opportunities - Expoint

Walmart Staff Software Engineer Machine Learning Engineer Personalization Real Time 
United States, California, Sunnyvale 
116727693

23.09.2024

What you'll do...

  • which will be used for deploying deep learning models or traditional ML models and run inferences on-demand at low latency. This system should be capable of scaling horizontally and will be deployed onto kubernetes based cluster. This platform will be capable of supporting different ml frameworks like TensorRT, TensorFlow, PyTorch, ONNX and Custom . This Real Time Inference system plays an important role in providing recommendations based on current user context.
  • Develop capabilities to support inferencing of Gen AI models , and support other LLM techniques like RAG, model fine tuning, building semantic cache .
  • Develop and implement best practices in MLOps to streamline the development, deployment, and monitoring of ML models. This includes complete model lifecycle management, ensuring model reproducibility, and facilitating seamless model versioning, rollout, monitoring and updating.
  • Collaborate with product managers, data scientists, and other engineering teams to understand their needs and translate them into ML platform capabilities. This collaborative approach ensures that the ML infrastructure effectively supports product features and business objectives.
  • Lead architectural and design discussions to ensure our platform is modular, scalable, fault tolerant, and sustainably built
  • Mentor and grow team members while working in an agile environment and ensure best engineering practices are followed.

  • Bachelors degree in Computer Science, Information Technology , Engineering or related field and 7+ years' experience OR Masters degree in Computer Science, Information Technology, Engineering or related field and 5 + years' experience in an related field.
  • You’re experienced with
    building real time inference systems
    for deploying deep learning or traditional ML models.
  • Experience with developing Restful APIs using Python and Flask .
  • Experience working with Triton Inference Server .
  • Experience with deploying and building real time systems for LLMs
  • You have experience working on ranking, online advertising, recommender systems, ecommerce , or relevant areas
  • Working knowledge of setting up RAG
  • Experience working with
    model lifecycle management tools
    like ML Flow and KF Serving
  • Strong Expertise in Java, Object Oriented & Functional Programming, Design Patterns .
  • In-depth knowledge of No-SQL and RDBMS technologie s.
  • Proficient with event driven architecture using Kafka.
  • Advanced knowledge of complex software design, distributed system design, design patterns, data structures and algorithms.
  • Working knowledge in container and cloud native technologies – Kubernetes, Docker .
  • Proficient in building scalable Microservices using Spring framework.
  • Experience with CI/CD tools such as Jenkins and Maven.
  • Strong expertise with Cloud Technologies like Azure and GCP.
  • Experience in monitoring production systems and using different systems like Grafana, Prometheus .
  • You have strong written and oral communication skills .
  • Experience with all phases of the software development life cycle, best practices, and Agile Software Development .
You will also receive PTO and/or PPTO that can be used for vacation, sick leave, holidays, or other purposes. The amount you receive depends on your job classification and length of employment. It will meet or exceed the requirements of paid sick leave laws, where applicable.

For information about PTO, see .

Eligibility requirements apply to some benefits and may depend on your job classification and length of employment. Benefits are subject to change and may be subject to a specific plan or program terms.

For information about benefits and eligibility, see .

Hoboken, New Jersey US-10279/Bellevue, Washington US-11075:The annual salary range for this position is $132,000.00-$264,000.00 Sunnyvale, California US-04397:The annual salary range for this position is $143,000.00-$286,000.00 Additional compensation includes annual or quarterly performance bonuses. Additional compensation for certain positions may also include: - Stock Minimum Qualifications...

Outlined below are the required minimum qualifications for this position. If none are listed, there are no minimum qualifications.

Option 1: Bachelor's degree in computer science, computer engineering, computer information systems, software engineering, or related area and 4 years’ experience in software engineering or related area.Option 2: 6 years’ experience in software engineering or related area.
Preferred Qualifications...

Outlined below are the optional preferred qualifications for this position. If none are listed, there are no preferred qualifications.

680 West California Avenue, Sunnyvale, CA 94086-4834, United States of America