Expoint - all jobs in one place

Finding the best job has never been easier

Limitless High-tech career opportunities - Expoint

JPMorgan Site Reliability Engineer III 
India, Telangana, Hyderabad 
609769420

31.07.2024

As a Site Reliability Engineer III at JPMorgan Chase within the CCB , you will solve complex and broad business problems with simple and straightforward solutions. Through code and cloud infrastructure, you will configure, maintain, monitor, and optimize applications and their associated infrastructure to independently decompose and iteratively improve on existing solutions. You are a significant contributor to your team by sharing your knowledge of end-to-end operations, availability, reliability, and scalability of your application or platform.

Job responsibilities

  • Executes small to medium projects independently with initial direction and eventually graduates to designing and delivering projects by yourself
  • Leverages technology to solve business problems by writing high quality, maintainable, and robust code following best practices in software engineering
  • Participates in triaging, examining, diagnosing, and resolving incidents and work with others to solve problems at their root
  • Recognizes the toil within your role and proactively works towards eliminating it through either systems engineering or updating application code
  • Understands observability patterns and strives to implement and improve service level indicators, objectives monitoring, and alerting solutions for optimal transparency and analysis
  • Define and implement SLOs, SLIs and error budgets to ensure stability and performance
  • Define, create and track application analytics in support of SLOs
  • Design, code, test and deliver software solutions to improve our existing systems
  • Participate feature delivery ensuring reliability, scalability, resiliency, and operational stability from the outset

Required qualifications, capabilities, and skills

  • Formal training, or certification on software engineering concepts and 3+ years applied experience
  • Experience of defining and implementing Service level objectives and indicators
  • Experience maintaining a Cloud-base infrastructure
  • Familiar with site reliability concepts, principles, and practices
  • Familiar with observability such as white and black box monitoring, service level objective alerting, and telemetry collection using tools such as Grafana, Dynatrace, Splunk, and others
  • Emerging knowledge of software, applications and technical processes within a given technical discipline (e.g., Cloud, artificial intelligence, Android, etc.)
  • Emerging knowledge of continuous integration and continuous delivery tools like Jenkins, GitLab, or Terraform
  • Emerging knowledge of common networking technologies
  • Ability to work in a large, collaborative team and demonstrates the willingness to vocalize ideas with peers and managers
  • Understanding of how to prioritize and adjust work plans to adapt to changes in assigned responsibilities and projects
  • Eagerness to participate in learning opportunities to enhance one’s effectiveness in executing day-to-day project activities
  • Ability to demonstrate and apply existing and new system processes, methodologies, and skills to contribute to the development of systems

Preferred qualifications, capabilities, and skills

  • Cloud computing (Cloud Foundry, AWS)
  • Ability to code in at least one scripting-programming language, PowerShell, Python is favorable.
  • SRE bar raiser certification is preferred.