Expoint - all jobs in one place

Finding the best job has never been easier

Limitless High-tech career opportunities - Expoint

IBM Site Reliability Engineering Professional 
United States, California, San Jose 
612367693

08.05.2024

Your Role and Responsibilities

Ideally, you’ll bring experience with:

  • Configuration management and infrastructure-as-code experience (Terraform and Ansible preferred)
  • Collaborating with product development engineers to identify, implement and report on service level indicators and objectives
  • Software development and scripting (GoLang preferred)
  • Deploying and troubleshooting complex, global production systems
  • Multiple hosting models preferred (managed, colo, and AWS/multi-cloud)
  • Admin-level Linux skills


Required Technical and Professional Expertise

  • 3+ years of hands-on experience creating SaaS applications working in the production operation of a company whose primary products are SaaS applications.
  • Minimum of 5 to 7 years’ experience in hands-on global production system deployment, administration and troubleshooting
  • Proven experience in systems performance analysis and debugging in a Linux environment
  • Experience in software development and scripting: bash and python are required (golang preferred)
  • Experience in automation is required
  • 2+ year’s Experience with provisioning and configuration management systems (terraform, ansible) across multiple cloud providers
  • 2+ years Experience in observability and alerting systems, splunk, ELK, open telemetry or similar systems
  • 2+years experience in working with different cloud providers such as IBM Cloud, AWS, Azure, GCP
  • 3+years Experience with operating systems running on Kubernetes / Openshift platforms.
  • Experience on Postgres DBA and kafka (or similar)
  • Collaborating with product development engineers to identify, implement and report on service level indicators and objectives
  • Willingness to participate in an on-call rotation.


Preferred Technical and Professional Expertise

Experience with the following would be an asset:

  • Working on integration and delivery systems such as Jenkins
  • Containerized applications
  • Experience with remote bare metal hardware provisioning. PXE boot, working with remote hands