Job responsibilities
- Applies technical expertise and problem-solving methodologies to projects of moderate scope
- Oversee IT resource utilization, conducting regular analyses to identify and address any underutilization of resources, ensuring optimal efficiency.
- Govern capacity plans and predict future requirements using data and growth projections to ensure scalability and reliability.
- Generate regular capacity status reports for the leadership team, evaluating the impact of changes on capacity and performance.
- Ensure capacity management practices comply with control procedures and standards, collaborating with application teams to optimize resources effectively.
- Assess and enhance capacity management processes and tools, staying updated with the latest trends and best practices in the field.
- Drives a workstream or project consisting of one or more infrastructure engineering technologies
- Adds to team culture of diversity, equity, inclusion, and respect
Required qualifications, capabilities, and skills
- Formal training or certification on infrastructure engineering concepts and 5+ years applied experience.
- Deep knowledge of one or more areas of infrastructure engineering such as hardware, networking terminology, databases, storage engineering, deployment practices, integration, automation, scaling, resilience, or performance assessments
- Deep knowledge of one specific infrastructure technology and scripting languages (e.g., Scripting, Python, etc.)
- Excellent interpersonal skills with the ability to maintain strong engagement with internal collaborators and stakeholders.
- Demonstrated proficiency in analyzing performance data to make informed decisions, with the ability to interpret and apply information, data, and trends effectively.
- Capability to assess and mitigate capacity-related risks with keen commercial and financial awareness, understanding the impact on business and internal stakeholders.
- Proven experience in IT capacity management, particularly with critical applications and on-premises infrastructure.
- Experience with Hypervisor, Kubernetes, Databases, and BMC TrueSight Capacity Optimization, along with a solid grasp of compute, memory, and storage concepts.
- Good understanding of application and infrastructure KPIs
- Manage the development and maintenance of capacity management reports and dashboards that provide insights into capacity utilization and performance.
Preferred qualifications, capabilities, and skills
- Experience with On-prem and AWS capacity management.
- Have worked with Dynatrace, APP D, Datadog, or similar APM tool along experience in Grafana.
- Experience in building capacity/forecasting models and plans, undertaking complex analysis to generate actionable insights.
- Robust IT systems knowledge and skills, including advanced Excel capabilities, with a quick aptitude for learning new software.
- Understanding or working experience with APIs is a plus, showcasing your adaptability and technical versatility and can code using Python.