Job responsibilities:
- Applies technical knowledge and problem-solving methodologies to projects of moderate scope, with a focus on improving the data and systems running at scale, and ensures end to end monitoring of applications
- Resolves most nuances and determines appropriate escalation path
- Executes conventional approaches to build or break down technical problems
- Drives the daily activities supporting the standard capacity process applications
- Partners with application and infrastructure teams to identify potential capacity risks and govern remediation statuses
- Considers upstream/downstream data and systems or technical implications
- Makes significant decisions for a project consisting of multiple technologies and applications
- Adds to team culture of diversity, equity, inclusion, and respect
Required qualifications, capabilities, and skills:
- Formal training or certification on infrastructure engineering concepts and 5+ years applied experience
- Demonstrated experience managing or operating applications in public cloud at scale with a strong emphasis on AWS and proven experience with Unix, Linux or Windows
- Strong knowledge of one or more infrastructure disciplines such as hardware, networking terminology, database, storage engineering, deployment practices, integration, automation, scaling, resilience, and performance assessments
- Experience with monitoring and observability tooling such as Cloud Watch, Dynatrace and DataDog
- Experience with chaos engineering concepts and tooling.
- Strong knowledge of one or more infrastructure disciplines such as hardware, networking terminology, databases, storage engineering, deployment practices, integration, automation, scaling, resilience, and performance assessments
- Strong knowledge of one or more scripting languages (e.g., Scripting, Python, etc.)
- Experience with multiple cloud technologies with the ability to operate in and migrate across public and private clouds
- Leverage infrastructure engineering knowledge of additional domains, data fluency, and automation knowledge including experience with automation tools such as terraform
Preferred qualifications, capabilities, and skills:
- AWS Associate level certification in Developer, Solutions Architect or DevOps
- Expert in one or more programming language(s) preferably Java
- Experience managing and operating, large scale, mission critical applications
- Experience with logging systems including Splunk or ELK
- Experience developing process, tooling and methods to help improve operational maturity
- Experience working in an agile environment using tools such as Jira and Confluence