Your Role and Responsibilities
- As a DevOps engineer and work on setting up company-wide infrastructure and ensuring their availability, reliability, security and compliance.
- Manage the full lifecycle of code changes from development to deployment, including code reviews, testing and operations.
- Use of various open-source tools for instrumenting, monitoring, alerting, and debugging of production services.
- Participate in on-call rotations along with the other members of the team to maintain the uptime of our services out of hours.
- Participate in architectural discussions and help drive the future of our services as we continually strive to provide best-in-class experiences for IBM development teams.
- Stay abreast of the latest technologies and methodologies with high focus on Cloud infrastructure (VPC, Kubernetes/OpenShift, Istio, Akamai etc)
- Provide exceptional end-user support, collaborating with development teams to understand their needs and optimize their build processes.
Required Technical and Professional Expertise
- Minimum 3+ years of experience in Cloud environment in the role of DevOps
- Familiarity with continuous integration & continuous deployment tools and strategies.
- Experience with Infrastructure as Code (Terraform, Ansible).
- Demonstrated experience working with Cloud infrastructure (VPC, Kubernetes/OpenShift, Istio, Akamai).
- Proficient in any one of the high-level programming language such as Python, JavaScript, Java, or Go.
- Desire to design, build, and maintain highly efficient, secure and scalable software build systems.
- Excellent analytical and problem-solving abilities, with a keen attention to detail.
- Demonstrated analytical skills and data structures/algorithms fundamentals.
- Demonstrated verbal and written communications skills.
- Demonstrated skills with troubleshooting, debugging, maintaining and improving existing software.
- Experience working with any version control system (Git preferred).
Preferred Technical and Professional Expertise
- Knowledge of IBM Cloud services and management, with an emphasis on deploying and scaling applications in a cloud environment
- Experience using cloud-native systems for monitoring and alerting (e.g. Prometheus, Grafana, Elasticsearch)
- Solid foundation in Linux internals, including system calls, file systems, and kernel debugging
- Understanding of endpoint security best practices (TLS, HTTPS, TLS Certificates, WAF, Network Traffic Analysis)
- Understanding of networking principles and protocols
- Knowledge of security testing techniques to identify vulnerabilities in software and infrastructure.
- Genuine interest in building efficient and highly performant software
- Ability to quickly learn new technologies and adapt to changing technical environments