Expoint - all jobs in one place

מציאת משרת הייטק בחברות הטובות ביותר מעולם לא הייתה קלה יותר

Limitless High-tech career opportunities - Expoint

Capital One Principal Associate Machine Learning Engineer GEN AI 
India, Karnataka, Bengaluru 
325084607

25.09.2024
Voyager (94001), India, Bangalore, Karnataka Principal Associate Machine Learning Engineer (GEN AI)

As a Capital One Machine Learning Engineer (MLE), you'll be part of a team focusing on generative AI observability. In this role, you will work on pioneering technologies that ensure the robustness, reliability, and performance of our generative AI models. Your contributions will directly impact the scalability and success of our AI-driven products.

What You’ll Do

  • Design and implement sophisticated observability frameworks to monitor the performance and health of generative AI models.

  • Build robust SDK, platform components to collect metadata, traces and parameters of models running at scale.

  • Work on cutting edge LLM frameworks and their instrumentation.

  • Analyze and optimize model performance, latency, and resource utilization to maintain high standards of efficiency, reliability and compliance

  • Collaborate as part of a cross-functional Agile team to create and enhance software that enables state of the art, next generation big data and machine learning applications.

  • Build smart products to enable actionable insights to improve model performance and scalability.

Basic Qualifications:

  • Bachelor’s Degree

  • At least 4 years of experience designing and building NLP, LLM and DL based solutions.

  • At least 4 years of experience programming with Python, Go, or Java

  • At least 2 years of on-the-job experience with an industry recognized ML framework such as scikit-learn, PyTorch, CUDA, Keras, Spark, or TensorFlow.

  • At least 2 years of experience in training and deploying large language or deep learning models at scale.

  • At least 1 year of experience productionizing, monitoring, and maintaining GEN AI/LLM models.

  • At least 1 year of experience in modern LLM orchestration and inference frameworks.

Preferred Qualifications:

  • 1+ years of experience building, scaling, and optimizing ML systems.

  • 2+ years of experience developing performant, resilient, and maintainable code.

  • Familiarity with cloud services (AWS, GCP, Azure) and container orchestration (Kubernetes, Docker).

  • Experience with model quantization or computational optimization is a plus.

If you have visited our website in search of information on employment opportunities or to apply for a position, and you require an accommodation, please contact Capital One Recruiting at 1-800-304-9102 or via email at . All information you provide will be kept confidential and will be used only to the extent required to provide needed reasonable accommodations.