Job responsibilities
- Utilize technical expertise and problem-solving skills to facilitate and accelerate the adoption of strategic technical programs.
- Lead projects or workstreams focused on infrastructure engineering practices, including automation, capacity planning, resiliency, performance optimization, and security enhancements.
- Collaborate with internal service providers to develop solutions that reduce manual effort and support modern technology stacks.
- Develop innovative solutions for the design, development, and troubleshooting of moderately complex technical issues.
- Consider the impact on upstream and downstream data and systems, providing guidance on mitigation strategies.
- Contribute to a team culture that values diversity, equity, inclusion, and respect.
Required qualifications, capabilities, and skills
- Formal training or certification on infrastructure engineering concepts and 3+ years applied experience
- Extensive expertise in one or more areas of infrastructure engineering, such as hardware, networking, databases, storage, deployment practices, integration, automation, scaling, resilience, performance, or security assessments.
- In-depth knowledge of a specific infrastructure technology and proficiency in scripting languages (e.g., Python, Ansible, Terraform).
- Commitment to expanding technical and cross-functional knowledge beyond the product scope.
- Strong understanding of cloud infrastructure and multiple cloud technologies, with the ability to operate in and migrate across both public and private clouds.
- Proficient in multiple infrastructure technologies with the operational skills to identify and resolve complex technical issues. Capable of clearly articulating solutions to a technical audience. Experience in capacity management, resiliency, and business continuity planning and execution.
- Solid knowledge of tools such as Splunk, Prometheus, and Grafana.
Preferred qualifications, capabilities, and skills
- Experience working in financial institutions, with a background in DevOps and Site Reliability Engineering.
- Proven ability to manage multiple service improvement programs and effectively prioritize workloads.
- Familiarity with multi-tiered application architecture and a successful track record in developing and implementing IT strategies and plans.