Job responsibilities
- Applies technical knowledge and problem-solving methodologies to projects of moderate scope, with a focus on improving the data and systems running at scale, and ensures end to end monitoring of applications
- Resolves most nuances and determines appropriate escalation path
- Executes conventional approaches to build or break down technical problems
- Partners with application and infrastructure teams to identify potential risks and govern remediation statuses
- Considers upstream/downstream data and systems or technical implications
- Adds to team culture of diversity, equity, inclusion, and respect
Required qualifications, capabilities, and skills
- Formal training or certification on software engineering concepts and 5+ years applied experience
- A deep understanding of business technology drivers and their impact on architecture design, performance and monitoring, best practices
- experience/knowledge building or supporting web environments on AWS, which includes working with services like EC2, ELB, RDS, and S3
- Experience using DevOps tools in a cloud environment, such as Ansible, Artifactory, Docker, GitHub, Jenkins, Kubernetes, Maven, and Sonar Qube
- experience across the SDLC process – Design and/or Development and/or support
- Experience/Knowledge using monitoring solutions like CloudWatch, Prometheus, Datadog
- Experience/Knowledge of writing Infrastructure-as-Code (IaC), using tools like CloudFormation or Terraform
- Experience with one or more public cloud platforms like AWS, GCP, Azure
- Experience with one or more automation tools like Terraform, Puppet, Ansible
- Strong knowledge of one or more infrastructure disciplines such as hardware, networking terminology, databases, storage engineering, deployment practices, integration, automation, scaling, resilience, and performance assessments
- Strong knowledge of one or more scripting languages (e.g., Scripting, Python, etc.)
Preferred qualifications, capabilities, and skills
- A proactive approach to spotting problems, areas for improvement, and performance bottlenecks
- SRE mindset Culture/Approaches: To run better production systems by creating engineering solutions to operational problems.