Job responsibilities
- Executes small to medium projects independently with initial direction and eventually graduates to designing and delivering projects by yourself
- Leverages technology to solve business problems by writing high quality, maintainable, and robust code following best practices in software engineering
- Participates in triaging, examining, diagnosing, and resolving incidents and work with others to solve problems at their root
- Recognizes the toil within your role and proactively works towards eliminating it through either systems engineering or updating application code
- Understands observability patterns and strives to implement and improve service level indicators, objectives monitoring, and alerting solutions for optimal transparency and analysis
- Define and implement SLOs, SLIs and error budgets to ensure stability and performance
- Define, create and track application analytics in support of SLOs
- Design, code, test and deliver software solutions to improve our existing systems
- Participate feature delivery ensuring reliability, scalability, resiliency, and operational stability from the outset
Required qualifications, capabilities, and skills
- Formal training or certification on software engineering concepts and 3+ years applied experience
- Experience of defining and implementing Service level objectives and indicators
- Experience maintaining a Cloud-base infrastructure
- Familiar with site reliability concepts, principles, and practices
- Familiar with observability such as white and black box monitoring, service level objective alerting, and telemetry collection using tools such as Grafana, Dynatrace, Splunk, and others
- Emerging knowledge of software, applications and technical processes within a given technical discipline (e.g., Cloud, artificial intelligence, Android, etc.)
- Emerging knowledge of continuous integration and continuous delivery tools like Jenkins, GitLab, or Terraform
- Emerging knowledge of common networking technologies
- Ability to work in a large, collaborative team and demonstrates the willingness to vocalize ideas with peers and managers
- Understanding of how to prioritize and adjust work plans to adapt to changes in assigned responsibilities and projects
- Eagerness to participate in learning opportunities to enhance one’s effectiveness in executing day-to-day project activities
- Ability to demonstrate and apply existing and new system processes, methodologies, and skills to contribute to the development of systems
Preferred qualifications, capabilities, and skills
- Cloud computing (Cloud Foundry, AWS)
- Ability to code in at least one scripting-programming language, PowerShell, Python is favorable.
- SRE bar raiser certification is preferred.