Expoint - all jobs in one place

Finding the best job has never been easier

Limitless High-tech career opportunities - Expoint

BMC Lead Monitoring Systems Engineer 
Sweden 
247713870

27.06.2024
Description and Requirements

CareerArc Code

Key Responsibilities:

  • Monitoring Strategy Development:
    • Develop and implement comprehensive monitoring strategies for our infrastructure and applications.
    • Lead the design, deployment, and maintenance of monitoring solutions using TrueSight, BMC Helix Operations Management, and Prometheus.
  • System Administration:
    • Administer and maintain monitoring tools to ensure optimal performance and availability.
    • Ensure coverage of all critical systems and applications with appropriate alerting thresholds.
  • Performance Analysis and Optimization:
    • Enhance AIOps within the ecosystem for better observability.
    • Optimize monitoring configurations to reduce false positives and improve the accuracy of alerts.
  • Leadership and Collaboration:
    • Lead a team of monitoring administrators, providing guidance, mentorship, and training.
    • Collaborate with cross-functional teams to integrate monitoring solutions with existing IT and development workflows.
  • Documentation and Reporting:
    • Develop and maintain comprehensive documentation for monitoring configurations, processes, and procedures.
    • Generate and present regular reports on system performance, incidents, and resolutions to senior management.

Required Skills and Qualifications:

  • Technical Expertise:
    • Extensive experience with TrueSight, BMC Helix Operations Management, and Prometheus.
    • Strong knowledge of monitoring principles, practices, and tools.
    • Proficiency in scripting languages such as Python.
    • Any relational database knowledge.
  • Experience:
    • Minimum of 8 years of experience in infrastructure monitoring and administration.
    • Proven experience in leading and managing monitoring teams.
    • Familiarity with log monitoring solutions and log analytics like clickhouse, kibana, etc.
    • Implemented network monitoring using any NPM tools.
  • Analytical Skills:
    • Strong analytical and problem-solving skills with the ability to diagnose and resolve complex technical issues.
    • Ability to interpret and analyse system performance data to drive continuous improvement.
  • Communication Skills:
    • Excellent verbal and written communication skills.
    • Ability to convey complex technical information to both technical and non-technical stakeholders.

Nice to have experience.

  • Hands-on experience with cloud environments (AWS, OCI, GCP) and containerization technologies (Docker, Kubernetes) is a plus.
  • Relevant certifications such as BMC Certified Professional, Certified Kubernetes Administrator (CKA), or similar are preferred.
  • Experience with PromQL.

Min salary

Mid point salary

Max salary