As a Technology Support III team member in Corporate Technology, you will ensure the operational stability, availability, and performance of our production application flows. Encourage a culture of continuous improvement as you troubleshoot, maintain, identify, escalate, and resolve production service interruptions for all internally and externally developed systems, leading to a seamless user experience.
Job responsibilities
- Provides end-to-end application or infrastructure service delivery to enable successful business operations of the firm
- Supports the day-to-day maintenance of the firm’s systems to ensure operational stability and availability
- Assist in the monitoring of production environments for anomalies and address issues utilizing standard observability tools
- Identify issues for escalation and communication, and provide solutions to the business and technology stakeholders
- Analyze complex situations and trends to anticipate and solve incident, problem, and change management in support of full stack technology systems, applications, or infrastructure
Required qualifications, capabilities, and skills
- Formal training or certification on software engineering concepts and 3+ years applied experience
- Troubleshoots incidents, conducts blameless post-mortems, and ensures the permanent closure of incidents. Possesses excellent debugging and troubleshooting skills.
- Engages with the development team throughout the lifecycle to help develop software for reliability.
- Applies analytics on historical data, such as incidents and usage patterns, to predict issues and take proactive action.
- Drives the adoption of self-healing and resiliency patterns such as circuit breakers and bulkheads.
- Understands Maven, CI/CD/automation testing, and chaos engineering pipelines.
- Has excellent knowledge of standard APM tools like AppDynamics, Dynatrace, Perfmon, NMON, Apica, Grafana, etc.
- Is well-versed in one of the cloud platforms, Kubernetes, and Jenkins.
- Has a good understanding of the change management process and the ServiceNow tool.
Preferred qualifications, capabilities, and skills
- Experienced in Agile development with SCRUM or similar methodologies, and possesses strong analytical and problem-solving skills.
- Quickly learns and applies new technologies, and actively participates in diagnosing and resolving incidents, addressing root causes.
- Identifies and eliminates toil through systems engineering or code updates, and enhances observability with improved service level indicators and monitoring.
- Effective team player who collaborates well with larger groups