Finding the best job has never been easier
Share
– Application Build and Deployment Processes (git*, automation pipelines, Infrastructure as code, etc.) – Automated Application Delivery (load balancers, container orchestration, service mesh, High Availability architectures, Frontend, Backend technologies including database, etc.) – Service Operation (Define, instrument, measure, and manage service level objectives. Experience with observability tooling including logging infrastructure, time series metrics databases, tracing systems, alert definitions, etc.) – Incident management (service restoration, root cause analysis, postmortem authorship, define roles and responsibilities, etc.) – Security awareness and competencies, including security as code. – Configuration management OBSERVABILITY – Explores beyond the obvious to ensure Service Level Objectives (SLO) are met. – Understands and measures system behaviors to quickly and efficiently diagnose, identify, and address needs. – Proactively test, automate, monitor outputs, leverage signals to infer services and needs. – Data management to explore properties, patterns, and distributed tracing SOLUTIONIST – Constantly seeking ways to improve systems, making them more efficient and reducing toil. – Understands the difference between short-term strategic and long-term fixes – Simplifies decisions and judgments by recognizing what to pay attention to and what to ignore; a proficient problem solver. Tenacious and resourceful with an inherent predisposition toward action; unafraid to try something new in the name of innovation.
Responsibilities
1. SRE Lead for a project(s) from conception to implementation, defining needs, establishing strategy, and managing development.
2. Coordinate with development and platform teams to design and implement zero-downtime deployment approaches, real-time logging, alerting, and monitoring solutions, and code instrumentation, and with solution architects to design highly available solutions that meet objectives, automating solutions whenever possible.
3. Develop solutions to identify reliability risks, monitor application growth, and triage production issues.
4. Actively engage with internal engineering teams to develop tooling, framework to drive full observability, and automation of the environment.
5. Provide leadership to ensure the services and tools that provide the foundation for an intuitive and engaging developer experience to increase developer productivity, product quality, and overall system performance.
The Job Description is intended to be a general representation of the responsibilities and requirements of the job. However, the description may not be all-inclusive, and responsibilities and requirements are subject to change.
The annual U.S. base pay range for this position is: $151,477.00 - $227,215.00These jobs might be a good fit