Expoint - all jobs in one place

מציאת משרת הייטק בחברות הטובות ביותר מעולם לא הייתה קלה יותר

Limitless High-tech career opportunities - Expoint

JPMorgan Site Reliability Engineer 
United States, California, Palo Alto 
191771376

Yesterday

DESCRIPTION:

Duties: Design, build and operate large-scale production systems. Debug complex problems across the whole stack. Develop tools for application engineering teams based on operations requirements for micro services. Improve alerting and monitoring for the existing services. Assist with onboarding and mentoring new engineers. Collaborate with the application development team on best practices to define Service level Agreement and Service Level Objective. Work on Google cloud Infrastructure and support cloud services and application. Deployment, automation, and management of AWS production systems. System troubleshooting and problem resolution across multiple production applications. Provide recommendations for architecture and process improvements. Configure SLO (service level objective)/SLA (Service level Agreement) for the applications running on AWS. Perform Devops on cloud platform. Reduce the toil and incident in production for wepay site. Debug and troubleshoot production incidents. Automation of repetitive work to avoid manual overhead. Setup SLA for micro services to detect issues faster.

QUALIFICATIONS:

Minimum education and experience required: Master’s degree in Computer Science, Computer Engineering, Electrical Engineering or related field of study plus 4 years of experience in the job offered or as Site Reliability Engineer, Software Engineer, Infrastructure Engineer, or related occupation. The employer will alternatively accept a Bachelor’s degree in Computer Science, Computer Engineering, Electrical Engineering or related field of study plus 7 years of experience in the job offered or as Site Reliability Engineer, Software Engineer, Infrastructure Engineer, or related occupation.

Skills Required: Requires experience in the following: Linux administration; Terraform; Amazon Web Services; Python; Bash; Apache airflow; Setting up disaster recovery plan; Multi-tier architectures including load balancers, caching, webservers and application servers; and CI/CD pipelines.

Full-Time. Salary: $235,000 - $235,000 per year.