Job responsibilities
- Design/document various fault injections using AWS Fault Injection Service (AWS FIS)
- Develop terraform module for various AWS services including AWS FIS.
- Drive, support, and deliver on a strategy to operate on a build broad use of Amazon's utility computing web services (e.g., AWS EC2, AWS S3, AWS RDS, AWS CloudFront, AWS EFS, CloudWatch, EKS).
- Identify opportunities to improve resiliency, availability, secure, high performing platforms in Public Cloud using JPMC best practices.
- Executes software solutions, design, development, and technical troubleshooting with ability to think beyond routine or conventional approaches to build solutions or break down technical problems.
- Produces architecture and design artifacts for complex applications while being accountable for ensuring design constraints are met by software code development
- Proactively identifies hidden problems and patterns in data and uses these insights to drive improvements to coding hygiene and system architecture.
Required qualifications, capabilities, and skills
- Formal training or certification in AWS applications, and resiliency, scalability, observability, monitoring etc with at least 5 years of experience
- Experience in provisioning AWS infrastructure through Terraform.
- Ability to develop and run CI/CD pipeline.
- Experience as SRE in complex and mission critical applications involving multitude of components of varying technical generations
- Advanced knowledge and experience in observability, monitoring, alerting, and telemetry collection using tools such as Cloudwatch, Grafana, Prometheus, Splunk, etc.
- Proactively identifies hidden problems and patterns in data and uses these insights to drive improvements to system architecture
- Fluency in at least one programming language such as (e.g., Python, Shell Scripting, Windows Powershell)
Preferred qualifications, capabilities, and skills
- Familiarity with modern front-end technologies
- Exposure to cloud technologies