As a Technology Support III team member in Identity & Access Management and Privilege Access Management, you will ensure the operational stability, availability, and performance of our production application flows. Encourage a culture of continuous improvement as you troubleshoot, maintain, identify, escalate, and resolve production service interruptions for all internally and externally developed systems, leading to a seamless user experience.
Job responsibilities
- Provides end-to-end application or infrastructure service delivery to enable successful business operations of the firm.
- Supports the day-to-day maintenance of the firm’s systems to ensure operational stability and availability.
- Assists in the monitoring of production environments for anomalies and address issues utilizing standard observability tools.
- Identify issues for escalation and communication, and provide solutions to the business and technology stakeholders.
- Analyze complex situations and trends to anticipate and solve incident, problem, and change management in support of full stack technology systems, applications, or infrastructure.
- Applies technical knowledge and problem-solving methodologies to projects of moderate scope, with a focus on improving the data and systems running at scale, and ensures end to end monitoring of applications.
- Work as an Command Center individual in a highly customer-focused Payments Affinity Team, which provides 7×24 support for critical systems.
- Resolves most nuances and determines appropriate escalation path.
- Demonstrates ability to monitoring multiple systems simultaneously and responding to incidents in real-time.
- Executes conventional approaches to build or break down technical problems.
Required qualifications, capabilities, and skills
- 3+ years of experience or equivalent expertise troubleshooting, resolving, and maintaining information technology services
- Bachelor Degree in Computer Science/Information Systems/Engineering or related disciplines
- Demonstrated knowledge of applications or infrastructure in a large-scale technology environment both on premises and public cloud
- Experience in observability and monitoring tools and techniques
- Comprehensive understanding of networking protocols such as TCP/IP, DNS, DHCP, VPNs, and firewalls, with basic troubleshooting capabilities.
- Exposure to processes in scope of the Information Technology Infrastructure Library (ITIL) framework and proficient in using real-time monitoring tools like Splunk and Grafana.
- Basic proficiency in scripting languages such as PowerShell, Bash, and Python for automating tasks and analyzing logs.
- Experienced with platforms like ServiceNow and Jira for effective incident management.
- Knowledgeable in common enterprise applications and adept at troubleshooting related issues.
- Capable of reading system logs, conducting health checks, and identifying potential issues before they escalate.
- Actively participates in Root Cause Analysis (RCA) calls for P1/P2 issues to identify underlying problems and solutions.
Preferred qualifications, capabilities, and skills
- Experience with one or more general purpose programming languages and/or automation scripting
- Working understanding of public cloud