Expoint - all jobs in one place

מציאת משרת הייטק בחברות הטובות ביותר מעולם לא הייתה קלה יותר

Limitless High-tech career opportunities - Expoint

JPMorgan Lead Site Reliability Engineer - Payments Technology 
India, Karnataka, Bengaluru 
703300635

15.04.2025

Assume a critical role in defining the future of a globally recognized firm and have a direct and significant effect in a realm tailored for top achievers in site reliability.

Job responsibilities

  • Demonstrates and champions site reliability culture and practices and exerts technical influence throughout your team
  • Leads initiatives to improve the reliability and stability of your team’s applications and platforms using data-driven analytics to improve service levels
  • Collaborates with team members to identify comprehensive service level indicators and stakeholders to establish reasonable service level objectives and error budgets with customers
  • Demonstrates a high level of technical expertise within one or more technical domains and proactively identifies and solves technology-related bottlenecks in your areas of expertise
  • Acts as the main point of contact during major incidents for your application and demonstrates the skills to identify and solve issues quickly to avoid financial losses
  • Performs a role of an Application owner for a critical payments application with an ability to drive multiple Technology initiatives
  • Educates Product Owners, Dev Leads, and Developers on service level objectives, service level indicators, and error budget policies
  • Documents and shares knowledge within your organization via internal forums and communities of practice
  • Leads infrastructure planning, data center migrations and partner application engagements for cross impacting initiatives
  • Engages the application or platform stakeholders in establishing reasonable service level objectives and error budgets with your customers and continue to monitor/review them at regular frequency

Required qualifications, capabilities, and skills

  • Formal training or certification onsoftware engineering*concepts and 5+ years applied experience
  • Deep proficiency in reliability, scalability, performance, security, enterprise system architecture, toil reduction, and other site reliability best practices with the ability to implement these practices within an application or platform
  • Experience in the Application Support domain with excellent communication skills, with the ability to tailor communications for different audiences, ranging from senior business to junior technology staff.
  • Good understanding of file transfer solutions, SSL certificates, load balancers, job scheduling, application monitoring, backup and capacity management
  • Deep knowledge of software applications and technical processes with emerging depth in one or more technical disciplines
  • Proficiency and experience in observability such as white and black box monitoring, SLO alerting, and telemetry collection using tools such as Grafana, Dynatrace, Prometheus, Datadog, Splunk, etc.
  • Proficiency in software applications and technical processes within a technical discipline (e.g., cloud, artificial intelligence, machine learning, mobile, etc.)
  • Proficiency in continuous integration and continuous delivery tools (e.g., Jenkins, GitLab, Terraform, etc.)
  • Experience with container and container orchestration (e.g., ECS, Kubernetes, Docker, etc.)
  • Ability to identify and solve problems related to complex data structures and algorithms
Preferred qualifications, capabilities, and skills
  • Ability to code and demonstrate data fluency
  • Certified Kubernetes knowledge (e.g. CKAD)
  • Certified public cloud technology knowledge (e.g. AWS)