Write, modify, run terraform from an existing codebase to deploy and maintain infrastructure across multiple cloud service providers. Be able to debug errors when deploying terraform
Run ansible playbooks to manage customer infrastructure. Be able to modify and troubleshoot ansible as needed as errors occur
Use GitLab with multiple repositories to maintain customer infrastructure and create merge requests for changes to customer infrastructure.
Configure, build, and deploy containerized services using Docker and/or Kubernetes
Access traffic flow data between customer and hosted environments to troubleshoot connectivity issues
Produce and maintain technical documentation in regard to network and system design and governance.
Develop standard operating procedures, knowledge base articles, technical bulletins, and other documents in support of the infrastructure.
Operate in a security-first mindset, performing all other responsibilities with security in mind
Implement monitoring, config management, and logging capabilities to manage a multiple tenant cloud infrastructure across multiple cloud service providers.
Use generative AI elements to increase efficiency and speed, improve accuracy and consistency, enhance security, and better manage resources where practical and within security boundary guidelines
KNOWLEDGE AND SKILLS
Knowledge of AWS foundational technologies (EC2, S3, IAM, Route53, VPC)
Experience with Unix / Linux operating system internals and administration (e.g., filesystems, inodes, system calls, hardening) and networking (e.g., TCP / IP, routing, DNS, network topologies, SDN).
Preferred qualifications:
Expertise in designing, analyzing and troubleshooting large-scale distributed systems.
Ability to debug and optimize code and automate routine tasks.
Systematic problem-solving approach coupled with strong communication skills and a sense of ownership and drive.