Expoint - all jobs in one place
מציאת משרת הייטק בחברות הטובות ביותר מעולם לא הייתה קלה יותר
Limitless High-tech career opportunities - Expoint

PNC Observability Support Specialist - Tempus 
Remote, Remote 
52749623

Yesterday
The IT Observability and Support Specialist is responsible for ensuring the reliability, availability, and performance of IT systems through proactive monitoring, incident response, and continuous improvement of observability practices. This role will design, implement, and maintain monitoring solutions, troubleshoot production issues, and work closely with IT, DevOps, and engineering teams to optimize system performance and incident resolution. Job responsibilities may include the following:· Observability & Monitoring
o Develop and maintain monitoring, logging, and alerting solutions using tools like Elastic Stack, Datadog, Splunk, Prometheus, or similar platforms.
o Create and refine dashboards, alerts, and metrics to enhance visibility into system health and performance.o Ensure end-to-end observability of infrastructure, applications, and network components.· Incident Response & Troubleshooting
o Monitor, triage, and manage production incidents, ensuring that issues are addressed quickly and efficiently.
o Communicate incident status, resolution steps, and timelines to both technical and non-technical stakeholders.o Participate in on-call rotations to ensure 24/7 incident response coverage.o Assist in developing runbooks and standard operating procedures (SOPs) for incident response.· Continuous Improvement & Automation
o Identify gaps in monitoring and implement improvements to reduce false positives and enhance actionable alerts.
o Automate repetitive monitoring and troubleshooting tasks to improve efficiency.o Contribute to the ongoing maturity of the observability strategy, ensuring alignment with industry best practices.· Collaboration & Documentationo Maintain accurate and up-to-date documentation of monitoring configurations, incident response procedures, and troubleshooting guides.
Key Relationships:
· IT Tempus Production Systems Team
· IT Site Reliability Team
· Observability EngineerQualifications:
· Hands-on experience with observability tools such as Elastic Stack, Datadog, Prometheus, Splunk, Grafana, or similar.
· Strong understanding of incident management, troubleshooting methodologies, and root cause analysis.
· Experience working with Linux and Windows environments.
· Proficiency in scripting languages (Python, PowerShell, Bash) for automation.
· Knowledge of cloud environments (AWS, Azure, or GCP) and containerized applications (Docker, Kubernetes).
· Excellent analytical, problem-solving, and communication skills.Preferred Qualifications:
· Certifications in ITIL, Observability, or Site Reliability Engineering (SRE).
· Experience with Infrastructure-as-Code (IaC) tools like Terraform or Ansible.
· Familiarity with DevOps and CI/CD pipelines.
· Understanding of networking principles and security best practices.Job Description
  • Monitor, maintain and support systems and software to ensure stability and compliance to technology standards.
  • Performs maintenance, including installation of patches and upgrades with an understanding of the production environment and goal of continuous availability.
  • Incident detection, resolution and/or escalation. Conducts root cause analysis and correction resulting in prevention.
  • Monitors systems availability, performance, and capacity against baseline metrics and reports trends. Reports out information to support stabilization.
  • Develops in depth knowledge of supported systems and applications and transfers knowledge to more junior staff.
  • Leads application governance, such as required attestations, continuity testing, impact analysis, responding to audit/regulatory inquiries and complying with technology standards.

PNC Employees take pride in our reputation and to continue building upon that we expect our employees to be:

  • Customer Focused - Knowledgeable of the values and practices that align customer needs and satisfaction as primary considerations in all business decisions and able to leverage that information in creating customized customer solutions.
  • Managing Risk - Assessing and effectively managing all of the risks associated with their business objectives and activities to ensure they adhere to and support PNC's Enterprise Risk Management Framework.
Qualifications

Successful candidates must demonstrate appropriate knowledge, skills, and abilities for a role. Listed below are skills, competencies, work experience, education, and requiredneeded to be successful in this position.

Analytical Thinking, Application Maintenance, Application Programming Interfaces (API's), Decision Making and Critical Thinking, IT Environment, IT Incident Management, IT Service Management (ITSM), IT Standards, Procedures & Policies, Packaged Application Integration, Software Reliability Management, System Development Life Cycle, Technical TroubleshootingRoles at this level typically require a university / college degree, with 3+ years of relevant / direct industry experience. Certifications are often desired. In lieu of a degree, a comparable combination of education, job specific certification(s), and experience (including military service) may be considered.No Required Certification(s)No Required License(s)

This position is subject to the requirements of Section 19 of the Federal Deposit Insurance Act (FDIA) and, for any registered role, the Secure and Fair Enforcement for Mortgage Licensing Act of 2008 (SAFE Act) and/or the Financial Industry Regulatory Authority (FINRA), which prohibit the hiring of individuals with certain criminal history.

California Residents

Refer to the