Expoint – all jobs in one place
מציאת משרת הייטק בחברות הטובות ביותר מעולם לא הייתה קלה יותר
Limitless High-tech career opportunities - Expoint

JPMorgan Lead Infrastructure Engineer - AI/ML 
India, Telangana, Hyderabad 
915823684

Yesterday

Join our Infrastructure Engineering Enablement Team as a Lead Engineer, where you’ll drive innovation with AI/ML, automation, and cloud technologies.

As the Lead Infrastructure Engineer at JPMorgan Chase within the Infrastructure Engineering Enablement Team, you will play a crucial role in guiding application teams to effectively perform infrastructure deployment and management functions. You will leverage your expertise in public cloud technologies, AI/ML, and automation to enhance developer capabilities and promote robust cloud-based solutions. This position offers the opportunity to collaborate with product and engineering teams, manage platform issues, and implement best practices for public cloud processes, ensuring minimal downtime and optimal performance.

  • Create /Train LLM models that provide required knowledge to the developers to perform the tasks.
  • Develop automation scripts to make day to day job easier
  • Collaborate with product and engineering teams to deliver robust cloud-based solutions that drive enhanced customer experiences.
  • Own end-to-end platform issues, problem management & help provide solutions to platform production issues on the AWS Cloud & ensure the applications are available as expected.
  • Guide various product teams on the standards and best practices related to the Public Cloud process and help them mitigate issues in production cloud with minimal downtime.

Job responsibilities

  • Promote, self-service, and deliver on a strategy to operate on a build broad use of Amazon's utility computing web services (e.g., AWS EC2, AWS S3, AWS RDS, AWS CloudFront, AWS EFS, CloudWatch, EKS)
  • Utilize programming languages like Java, Python, SQL, Node, Go, and Scala, Open Source RDBMS and NoSQL databases, Container Orchestration services including Docker and Kubernetes, and a variety of AWS tools and services
  • Develop/enhance LLM models using AI/ML skills for enabling self service for developers or other teams requiring Infrastructure Information.
  • Identify opportunities to improve resiliency, availability, secure, high performing platforms in Public Cloud using JPMC best practices.
  • Improve reliability, quality, and reduce to time to resolve issues in production incidents on software applications in prod.
  • Implement continuous process improvement, including but not limited to policy, procedures, and production monitoring and reduce time to resolve.
  • Identify, coordinate, and implement initiatives/projects and activities that create efficiencies and optimize technical processing.
  • Provide primary operational support and engineering for the public cloud platform. Show leadership for any production issue and manage all the corresponding team in working towards fix and also should ensure minimal customer impact.
  • Promote work streams to ensure Applications meet strict operational readiness for Public Cloud On-boarding.
  • Monitor metrics and program health, anticipate and clear blockers, manage escalations.
  • Revise the heading to 'Job responsibilities'. Consolidate bullet points in this section.

Required qualifications

  • Formal training or certification on Infrastructure concepts and 5+ years applied experience
  • A strong understanding of business technology drivers and their impact on architecture design, performance and monitoring, best practices
  • A dynamic individual with excellent communication skills, who can adapt verbiage and style to the audience at hand and deliver critical information in a clear and concise message.
  • The candidate must be a strong analytical thinker, with business acumen and the ability to assimilate information quickly, with a solution-based focus on incident and problem management.
  • 10+ years’ experience across the SDLC process – Design and/or Development and/or support
  • 5+ years’ experience/knowledge building or supporting environments on AWS using Terraform
  • Experience using DevOps tools in a cloud environment, such as Ansible, Artifactory, Docker, GitHub, Jenkins
  • Experience/Knowledge using monitoring solutions like CloudWatch, Prometheus, Datadog
  • Experience/Knowledge of writing Infrastructure-as-Code (IaC), using tools like CloudFormation or Terraform
  • Experience with one or more public cloud platforms like AWS, GCP, Azure

Preferred qualifications, capabilities and skills

  • Experience with one or more AI/ML languages such as Python, Scala and have developed, trained, LLM models.
  • Ability to leverage Splunk and Dynatrace to identify and troubleshoot issues.
  • Experience with high volume, mission critical applications, and building upon messaging and or event-driven architectures.
  • Knowledge of container platforms such as Docker and Kubernetes.
  • Strong understanding of architecture, design, and business processes
  • Keen understanding of financial and budget management, control and optimization of Public Cloud expenses
  • Strong communication and collaboration skills