Expoint - all jobs in one place

מציאת משרת הייטק בחברות הטובות ביותר מעולם לא הייתה קלה יותר

Limitless High-tech career opportunities - Expoint

Cisco Site Reliability Engineer 
China, Liaoning, Dalian 
120655335

27.01.2025

Position Overview:

As a Senior Site Reliability Engineer (SRE), you will assume a leadership role in ensuring the reliability, scalability, and performance of our company's software systems and infrastructure. You will be responsible for driving the evolution of SRE practices and collaborating closely with engineering teams to architect and implement highly available and resilient systems. The role requires a deep understanding of software development, system design, and operations, as well as the ability to mentor and guide junior SRE team members.

Responsibilities:

Monitoring and Alerting: Oversee the implementation and maintenance of robust monitoring and alerting systems. Ensure the timely response to alerts and lead efforts to improve the monitoring framework continually.

Continuous Integration and Continuous Deployment (CI/CD): Enhance the CI/CD pipeline to enable seamless and reliable deployments. Foster a culture of continuous improvement in the deployment process.

Requirements:

  • Bachelor's degree in Computer Science, Engineering, or a related technical field (or equivalent practical experience).
  • Substantial experience as a Site Reliability Engineer or in a similar role, with demonstrated progression in responsibility and leadership.
  • Expertise in software development and proficiency in multiple programming languages (e.g., Python, Go, Java).
  • In-depth knowledge of cloud platforms (e.g., AWS, Google Cloud, Azure) and containerization technologies (Docker, Kubernetes).
  • Strong understanding of system architecture, distributed systems, and networking principles.
  • Experience with monitoring and logging tools like Prometheus, Grafana, DataDog, ThousandEyes, etc.
  • Proven track record of driving automation initiatives and using infrastructure-as-code tools (e.g., Terraform, Ansible).
  • Excellent problem-solving and critical-thinking skills, with a focus on root cause analysis.
  • Ability to lead and mentor technical teams, fostering a collaborative and innovative environment.