Expoint - all jobs in one place

Finding the best job has never been easier

Limitless High-tech career opportunities - Expoint

JPMorgan Lead Site Reliability Engineer 
United States, Ohio, Columbus 
571604933

07.09.2024

Assume a critical role in defining the future of a globally recognized firm and have a direct and significant effect in a realm tailored for top achievers in site reliability.

Job responsibilities

  • Collaborates with other software engineers and teams to design and implement deployment approaches using automated continuous integration and continuous delivery pipelines
  • Act as a key contributor with other software engineers and teams to design, develop, test, and implement availability, reliability, scalability, self-healing, and solutions in their applications ensuring minimal refactoring or changes
  • Implements infrastructure, configuration, and network as code for the applications and platforms in your remit
  • Collaborates with technical experts, key stakeholders, and team members to resolve complex problems
  • Understands service level indicators and utilizes service level objectives to proactively resolve issues before they impact customers
  • Be part of the 24x7 support coverage, as needed, and lead coverage during incidents
  • Engage with Technology Controls organization to ensure tooling and ecosystem meets the Firm’s rigorous cyber policies
  • Coach team members, encourage acquisition of new skills, and be directly accountable for specific software solution outcomes

Required qualifications, capabilities, and skills

  • Formal training or certification on software engineering concepts and 5+ years applied experience
  • Minimum of 8+ years of hands on experience using large scale software development.
  • Experience in SRE/DevOps role for supporting highly available production systems in AWM cloud/Private cloud.
  • Experience with defining SRE standards and supporting implementation and adoption of these standards on alert/monitoring setup, Observability, Service level objectives, Incident management, Problem management,
  • Proficiency in modern development process and automation tools and one/more general purpose programming languages including Java, C#, Python, C/C++ or Node.js; Web Development – HTML5 ; JavaScript ; CSS; API web services; SQL
  • Hands on experience of GIT, BitBucket, Jenkins, SONAR, SPLUNK, Maven, Continuous Integration/Deployment (CI/CD) tools, cloud and containerization: AWS, K8,Unix: Linux and Solaris, relational SQL and non-SQL DB, messaging technologies: eg Kafka, MSK, etc
  • Knowledge of networking concepts as Load balancing, IP, DNS
  • Excellent communication skill with debugging and trouble shooting skills
  • Ability to collaborate with high-performing teams and individuals throughout the firm to accomplish common goals and effectively prioritize the task in a highly dynamic work environment that includes globally positioned resources

Preferred qualifications, capabilities, and skills

  • Experience with infrastructure components utilized in data warehousing or big data environments.
  • Banking experience preferable