Job responsibilities
- Automate identified manual processes to reduce toil and build tools for enhanced monitoring and observability.
- Execute small to medium projects independently, progressing to designing and delivering projects autonomously.
- Leverage technology to solve business problems by writing high-quality, maintainable, and robust code following software engineering best practices.
- Participate in triaging, diagnosing, and resolving incidents, collaborating with others to address root causes.
- Recognize toil in your role and proactively work to eliminate it through systems engineering or application code updates.
- Understand observability patterns and strive to implement and improve service level indicators, objectives, monitoring, and alerting solutions for optimal transparency and analysis.
Required qualifications, capabilities, and skills
- Possess formal training or certification in software engineering concepts and 2+ years of applied experience
- Demonstrate coding ability in at least one programming language and experience in maintaining cloud-based infrastructure.
- Familiar with site reliability concepts and observability practices, including monitoring and alerting using tools like Grafana, Dynatrace, Prometheus, Datadog, and Splunk.
- Understand and work with containers or common server operating systems such as Linux and Windows.
- Develop emerging knowledge in software, applications, and technical processes, including CI/CD tools like Jenkins, GitLab, or Terraform, and common networking technologies.
- Collaborate effectively in a large team, vocalizing ideas with peers and managers, and adapt work plans to changing responsibilities and projects.
- Eager to participate in learning opportunities to enhance effectiveness in project activities and apply system processes and methodologies to system development.
Preferred qualifications, capabilities, and skills
- General knowledge of financial services industry