Lead and mentor a proficient team of Data Center, DevOps, system, and network engineers, serving as their HR manager , Tech Lead and Squad Lead.
Lead Team agile planning and drive the team for execution
Execute the infrastructure roadmap both in CP managed DCs and cloud-based ones, ensuring alignment with business objectives and strategies.
Oversee the design, deployment, and maintenance of critical infrastructure components, including networking, virtualization, storage, and servers in Kubernetes / other containerized solutions.
Lead the implementation and maintenance of a robust security measures to safeguard Data Center assets and ensure compliance with industry standards.
Conduct routine maintenance, inspections, and repairs of Data Center equipment to uphold operational efficiency.
Make sure we monitor system performance, identify bottlenecks, and proactively implement solutions to enhance system reliability and integrity.
Foster a positive work environment that promotes growth, collaboration, and innovation among team members.
Qualifications
Minimum of 5 years of proven experience as a Data Center/DevOps Engineer or in a similar role.
Minimum of 3 years of demonstrated technical leadership or team leader experience in infrastructure (cloud & physical) focused roles.
Exceptional interpersonal skills and proficiency in communicating effectively with diverse stakeholders.
Hands-on experience with Agile methodologies, serving in roles such as Scrum lead, product owner, or other agile capacities.
Profound experience with virtualization and containerization technologies, including Proxmox, VMWare, and Kubernetes.
Robust experience in network and system infrastructure management, with a focus on Linux system administration.
Extensive familiarity with networking protocols such as TCP/IP, and VLANs, as well as proficiency in VPN and firewall configuration.
Strong understanding of security best practices and adeptness in implementing system resilience techniques.
Mastery of high-availability concepts and implementation strategies, encompassing redundancy, failover, and load-balancing technologies.
Minimum of 3 years of hands-on experience with Bash, Python, or similar languages for task automation.
Experience with observability and monitoring systems such as SNMP, DataDog, Prometheus, and Grafana.