Job responsibilities
- Analyze and troubleshoot production application flows to ensure end-to-end application or infrastructure service delivery supporting the business operations of the firm
- Improve operational stability and availability through participation in problem management
- Monitor production environments for anomalies and address issues utilizing standard observability tools
- Assist in the escalation and communication of issues and solutions to the business and technology stakeholders
- Identify trends and assist in the management of incidents, problems, and changes in support of full stack technology systems, applications, or infrastructure
- Produces architecture and design artifacts for complex applications while being accountable for ensuring design constraints are met by software code development
- Proactively identifies hidden problems and patterns in data and uses these insights to drive improvements to coding hygiene and system architecture
- Collaborates to ensures all production changes are made in accordance with lifecycle methodology and risk guidelines
- Understands the application and platform details and is responsible for the stability and continuous operation of them
- Adds to team culture of diversity, equity, inclusion, and respect
Required qualifications, capabilities, and skills
- Formal training or certification on software engineering concepts and proficient applied experience
- Good Incident, Problem and Change management skills, as well as communication skills to interact with Development teams and User base
- Familiar with monitoring tools (Geneos, Dynatrace, Splunk) and their configuration
- Hands-on practical experience in system design, application development, testing, and operational stability
- Working experience in one or more general purpose programming languages (Java, Python, Bash) and understanding of frameworks (Springboot)
- Experience in developing, debugging, and maintaining code in a large corporate environment with one or more modern programming languages and database querying languages
- Exposure to observability and monitoring tools and techniques
- Solid understanding of agile methodologies such as CI/CD, Application Resiliency, and Security
Preferred qualifications, capabilities, and skills
- AWS Exposure (Understanding and working experience in AWS applications, and understanding of resiliency, scalability, observability, monitoring etc,)
- Ability to identify and solve problems related to complex data structures and algorithms
- Drive to self-educate and evaluate new technology