Drive, support, and deliver on a strategy to operate on a build broad use of Amazon's utility computing web services (e.g., AWS EC2, AWS S3, AWS RDS, AWS CloudFront, AWS EFS, CloudWatch, EKS)
Analyze upcoming platform level changes into production ensure communication of relevant impact.
Identify opportunities to improve resiliency, availability, secure, high performing platforms in Public Cloud using JPMC best practices
Improve reliability, quality, and reduce to time to resolve issues in production incidents on software applications in prod
Implement continuous process improvement, including but not limited to policy, procedures, and production monitoring
Identify, coordinate, and implement initiatives/projects and activities that create efficiencies and optimize technical processing
Measure and optimize system performance, with an eye toward pushing our capabilities forward, getting ahead of customer needs, and innovating to continually improve
Provide primary operational support and engineering for the public cloud platform. Show leadership for any production issue and manage all the corresponding team in working towards fix and also should ensure minimal customer impact
Debug and optimize systems and automate routine tasks.
Collaborate with a cross-functional team to identify potential risks in production and opportunities to improve user experiences at every interaction.
Drive work streams to ensure Applications meet strict non-functional requirements for Public Cloud On-boarding
Required qualifications, capabilities, and skills
Formal training or certification on software engineering concepts and 5+ years applied experience
Very good understanding of business technology drivers and their impact on architecture design, performance and monitoring, best practices
A dynamic individual with excellent communication skills, who can adapt verbiage and style to the audience at hand and deliver critical information in a clear and concise message.
The candidate must be an analytical thinker, with business acumen and the ability to assimilate information quickly, with a solution based focus on incident and problem management.
Good exposure to SDLC process
Proven experience building or supporting environments on AWS, which includes working with services like EC2, ELB, RDS, and S3
Experience using DevOps tools in a cloud environment, such as Ansible, Artifactory, Docker, GitHub, Jenkins, Kubernetes, Maven, and Sonar Qube
Experience using monitoring solutions like CloudWatch, Prometheus, Datadog
Experience of writing Infrastructure-as-Code (IaC), using tools like CloudFormation or Terraform
Experience with one or more public cloud platforms like AWS, GCP, Azure
Experience with high volume, mission critical applications and their interdependencies with other applications and databases
Preferred qualifications, capabilities, and skills
Ability to leverage Splunk and Dynatrace to identify and troubleshoot issues.
Experience of ITIL process such as incident, problem, and life cycle management
Knowledge of container platforms such as Docker and Kubernetes.
Strong understanding of architecture, design, and business processes
Experience in working in in large, collaborative teams to achieve organizational goals
Passionate about building an innovative culture
Experience with production/non-production support of highly available applications
Experience with system performance monitoring and operational capacity management