What You’ll DoDrive improvements related to the Monitoring, Alerting, and overall support for HX including a high-level understanding of the Calling DI applications and services that run on the HX.
Work cross-functionally with Calling DI Tier 1/2/4 teams related to projects, initiatives, on-call, and support requirements that need Tier 3
Explore possible automation areas to improve monitoring, simplify troubleshooting and aid with accelerated remediation related to Tier 3
Who You’ll Work With
You will be a member of the Webex Infrastructure Engineering (WxIE) Systems Operations (SysOps) team responsible for Tier 3 support of the commercial Calling DI HX infrastructure.
Who You AreThis is a technical role that requires a strong background in Cisco UCS server operations, capability to address complex problems concerning server operations, and the ability to work with multiple teams located in different time zones.
You will contribute to the team in the following ways:
- Work closely with other Tier 3 team members, and other Calling DI teams located around the world.
- Apply SRE/DevOps principles to effectively operate Calling DI’s globally scaled, highly available and stable cloud service
- Follow and enhance standard processes and procedures based on ITIL principles.
- You will work closely with technical leads for day-to-day service operation, planning, monitoring, upgrades, incident management and team and/or customer escalations, etc.
- Use your analytical skills to identify issues, propose solutions and deliver successful resolutions
- Participate in on-call and pager duty rotation to provide excellent service to our Calling DI team and customers.
- Review logging, metrics, and alerts (LMA) for errors, identify and file Cisco TAC tickets, propose design tweaks, and much more!
Experience required:- A minimum of 3 - 5 years working with any following technologies: VMWare, Cisco Unified Computing System (UCS), Cisco Hyperflex or other Hyperconverged platforms (like Nutanix)
- Experience with IP networking, TCP, UDP, TLS, X.509 Certificates, etc
- Experience with Linux to include shell commands, iptables, networking, performance tuning, etc
- Experience with SAN and/or Hyperconverged Infrastructure storage systems. Does not need to be as a Storage Admin, but you should have exposure to and understand the differences between these storage constructs.
Desirable skills:- At least 1 year of Python exposure (not necessarily as a Python Developer, scripting is good enough)
- 3+ years of Linux experience, including proficient Bash scripting skills
- OpenStack and KVM experience
- You're an Engineer at heart, passionate about solving puzzles and mindful of keeping our internal teams and customers happy.
But “Digital Transformation” is an empty buzz phrase without a culture that allows for innovation, creativity, and yes, even failure (if you learn from it.)