What you’ll be doing:
Lead and coordinate the planning and build of complex clusters and supercomputers across multiple data centers and labs
Manage for rack-and-stack, cabling, and space optimization efforts to ensure efficiency, maintainability, and standard processes
Lead all aspects of power and cooling efficiency strategies while ensuring optimal rack space utilization
Coordinate daily functions and maintenance of data facilities and test environments, ensuring seamless operations and timely problem resolution
Installation and integration of diverse infrastructure and solutions including Cloud, VMs, Storage, Network, HPC, and AI
Manage debugging activities — network, optical cabling, bare metal, and operating systems
Collaborate closely with Research & Development teams to support evolving project needs and experimental setups
Mentor and develop team members, ensuring knowledge sharing, standard methodologies, and professional growth
MCSE or MCITP / CCNA certification
3+ years of experience as a team lead in large and complex data center environments, overall experience of 8+ years
Demonstrated practical experience in operating systems with strong problem identification and resolution skills
In-depth knowledge of Linux & Windows Core Services: DHCP, DNS, NIS, AD, etc.
Strong leadership skills with ability to organize, prioritize, and guide a team
Passionate about delivering excellent service with strong collaboration and interpersonal skills
Hands-on with Python and configuration management tools (e.g., Ansible, Puppet)
Experience with CI tools and job schedulers (e.g., Jenkins, SLURM)
Knowledge of virtualization technologies: KVM, VMware, Hyper-V
Experience with storage solutions like Netapp, Lustre, GPFS, ZFS
Skilled in L2 & L3 network protocols and resolving technical issues
משרות נוספות שיכולות לעניין אותך