Expoint - all jobs in one place

The point where experts and best companies meet

Limitless High-tech career opportunities - Expoint

Wells Fargo Lead Site Reliability Engineer 
United States, North Carolina, Charlotte 
155022580

10.04.2025

About this role:

Site Reliability Engineer


In this role, you will:

  • Work alongside developers as well as the business stakeholders and strive to automate the acceptance criteria
  • Maintain high reliability and availability for software applications
  • Automate the mundane tasks and avoid human errors
  • Define SLI (Service level indicator) & SLO (service level objective) by collaborating with Product owners
  • Lead incident response efforts and post-mortem analysis to prevent future occurrences.
  • Write incident root cause analysis, find out the core reason behind the issue and prevent it from happening again
  • Document procedures, best practices and troubleshooting FAQs.
  • Debug the system and fixing the production related issues.
  • Escalate / follow-up on permanent fix for development related issues.
  • Handle complex operational tasks and recommends process and technology changes.
  • Provide global support including troubleshooting production related issues and performing checkouts.


Required Qualifications:

  • 5+ years of Technology Infrastructure Engineering and Solutions experience, or equivalent demonstrated through one or a combination of the following: work experience, training, military experience, education
  • 5+ years of Site Reliability Engineering experience or related experience


Desired Qualifications:

  • Strong understanding of the REST APIs
  • Strong understanding in working of the troubleshooting tools such as Splunk, AppDynamics, and Elastic APM
  • Strong experience in API Management tools such as Apigee
  • Working knowledge of databases such as MongoDB, Oracle
  • Strong foundation in reliability engineering principles and distributed systems behavior
  • Experience defining and implementing SLOs/SLIs and using them to drive system improvements
  • Demonstrated ability to design and implement observability solutions that provide actionable insights while minimizing alert fatigue
  • Understand modern observability practices and experience implementing and maintaining monitoring solutions such as Prometheus/Grafana, Splunk, NewRelic, CloudWatch, and ELK in the cloud
  • Strong incident response skills with experience leading incident retrospectives and driving improvements
  • Excellent problem-solving abilities and experience debugging distributed systems
  • Track record of successfully automating operations and reducing toil
  • Strong communication skills with ability to explain complex technical concepts to diverse audiences
  • Ability to work both independently and collaboratively (in groups) in an energetic, and diverse team environment.


Job Expectations:

  • Ability to work weekends
  • Participate in on-call rotations to ensure 24/7 system availability and support.

24 Apr 2025


Wells Fargo Recruitment and Hiring Requirements:

b. Wells Fargo requires you to directly represent your own experiences during the recruiting and hiring process.