Expoint - all jobs in one place

המקום בו המומחים והחברות הטובות ביותר נפגשים

Limitless High-tech career opportunities - Expoint

JPMorgan Sr Lead Site Reliability Engineer 
United Kingdom, England 
403648839

24.04.2025

Job responsibilities

  • Partner with SRE LOB teams to create a centre of excellence for Performance Testing in CIB, developing an efficient approach and defining testing templates to drive compliance, quality, and standardization across the organization.
  • Utilize performance testing and monitoring tools such as Blazemeter, JMeter, Grafana, and Prometheus to build prescriptive guidance that highlights and resolves performance bottlenecks.
  • Assist with the implementation of performance monitoring solutions to provide real-time insights into system performance.
  • Master multi-platform performance tooling and lead the architecture and engineering of performance tooling to meet the firm’s needs.
  • Contribute to the engineering community as an advocate of firmwide frameworks, tools, and practices of the Software Development Life Cycle.
  • Creates high quality designs, roadmaps, and program charters that are delivered by you or the engineers under your guidance
  • Provides advice and mentoring to other engineers and acts as a key resource for technologists seeking advice on technical and business-related issues
  • Collaborates with others to create and implement observability and reliability designs for complex systems that are robust, stable, and do not incur additional toil or technical debt
  • Works toward becoming an expert on the applications and platforms in your remit while understanding their interdependencies and limitations
  • Provides comprehensive and ongoing guidance, tools, and solutions to support the firms’ growth
  • Makes significant contributions to JPMorgan Chase’s site reliability community via internal forums, communities of practice, guilds, and conferences

Required qualifications, capabilities, and skills

  • Formal training or certification on software engineering concepts and proficient advanced experience.
  • Hands on experience in identification and resolution of Performance issues across systems, specifically in building and executing Load and Stress testing for app health and Capacity purposes.
  • Passion for creating best practices and influencing technology change, with understanding of how application architecture and performance impact business outcomes and continuity.
  • Hands-on practical experience in system design, application development, testing, and operational stability.
  • Advanced knowledge in site reliability culture and principles with demonstrated ability to implement site reliability within an application or platform
  • Advanced knowledge and experience in observability such as white and black box monitoring, service level objectives, alerting, and telemetry collection using tools such as Grafana, Dynatrace, Prometheus, Datadog, Splunk, etc.
  • Ability to communicate data-based solutions with complex reporting and visualization methods
  • Recognized as an active contributor of the engineering community
  • Continues to expand network and leads evaluation sessions with vendors to see how offerings can fit into the firm’s strategy
  • Ability to anticipate, identify, and troubleshoot defects found during testing
  • Strong communication skills with ability to mentor and educate others on site reliability principles and practices
Preferred qualifications, capabilities, and skills
  • Practical cloud-native experience, primarily in AWS.
  • Knowledge of industry-wide technology trends and best practices.
  • Certifications in performance testing or cloud technologies.