As a Staff ML Engineer, you will help lead the technical direction of a next-generation AI and ML platform, while enabling GenAI and LLM-powered applications at enterprise scale.
You will architect systems that power compliance intelligence while also influencing the broader AI platform strategy across eBay. The role is ideal for someone who thrives at the intersection of system architecture, ML infrastructure, and powerful AI innovation.
What You Will Accomplish
Architect & Scale: Design and build robust, scalable systems and low-latency APIs that support high-throughput machine learning applications, including LLM inference and GenAI pipelines.
Platform Ownership: Lead the development of a modern AI platform supporting data prep, model training, serving, monitoring, and observability—built for modular reuse and horizontal scale.
GenAI Enablement: Architect and deploy GenAI systems including retrieval-augmented generation (RAG), vector search, timely engineering frameworks, and GPU inference optimization.
Tooling & Infrastructure: Drive adoption of groundbreaking ML infrastructure—integrating orchestration tools (Kubeflow, Airflow), MLOps workflows (MLflow), and scalable serving layers (Triton, Ray.io, TensorRT, vLLM).
Multi-functional Influence: Align platform capabilities with product, ML science, and compliance engineering teams to deliver unified solutions across domains.
Technical Leadership: Own architectural vision, perform meticulous code reviews, and lead technical deep-dives and spike efforts into emerging tech.
Culture & Mentorship: Mentor engineers across teams, champion engineering excellence, and build a forward-leaning culture of experimentation and delivery velocity.
What You Will Bring
Prefer PhD or MS in Computer Science, Electrical Engineering or related field with 8+ years of experience in software engineering with a focus on large-scale, high-performance distributed systems.
Proven experience designing and scaling AI/ML platforms and services—especially those optimized for large model training and GenAI inference.
Strong programming skills in Python, Java or C, with experience in frameworks like TensorFlow, PyTorch.
Deep understanding of database systems: SQL, NoSQL, vector databases, graph databases. Strong foundation in building data architectures for ML applications.
Experience building and deploying GenAI/LLM systems, including RAG pipelines, custom embeddings, vector indexing, and GPU-accelerated inference.
Familiarity with tools and frameworks including Docker, Kubernetes, Spark, Hadoop, Kafka, MLflow, Airflow, Kubeflow, Nvidia Triton, Nvidia TensorRT, vLLM, Ray.io.
The base pay range for this position is expected in the range below:
$132,000 - $222,100משרות נוספות שיכולות לעניין אותך