Assume a critical role in defining the future of a globally recognized firm and have a direct and significant effect in a realm tailored for top achievers in site reliability.
Job responsibilities
- Collaborates with other software engineers and teams to design and implement deployment approaches using automated continuous integration and continuous delivery pipelines
- Act as a key contributor with other software engineers and teams to design, develop, test, and implement availability, reliability, scalability, self-healing, and solutions in their applications ensuring minimal refactoring or changes
- Implements infrastructure, configuration, and network as code for the applications and platforms in your remit
- Collaborates with technical experts, key stakeholders, and team members to resolve complex problems
- Understands service level indicators and utilizes service level objectives to proactively resolve issues before they impact customers
- Be part of the 24x7 support coverage, as needed, and lead coverage during incidents
- Engage with Technology Controls organization to ensure tooling and ecosystem meets the Firm’s rigorous cyber policies
- Coach team members, encourage acquisition of new skills, and be directly accountable for specific software solution outcomes
Required qualifications, capabilities, and skills
- Formal training or certification on software engineering concepts and 5+ years applied experience
- Minimum of 8+ years of hands on experience using large scale software development.
- Experience in SRE/DevOps role for supporting highly available production systems in AWM cloud/Private cloud.
- Experience with defining SRE standards and supporting implementation and adoption of these standards on alert/monitoring setup, Observability, Service level objectives, Incident management, Problem management,
- Proficiency in modern development process and automation tools and one/more general purpose programming languages including Java, C#, Python, C/C++ or Node.js; Web Development – HTML5 ; JavaScript ; CSS; API web services; SQL
- Hands on experience of GIT, BitBucket, Jenkins, SONAR, SPLUNK, Maven, Continuous Integration/Deployment (CI/CD) tools, cloud and containerization: AWS, K8,Unix: Linux and Solaris, relational SQL and non-SQL DB, messaging technologies: eg Kafka, MSK, etc
- Knowledge of networking concepts as Load balancing, IP, DNS
- Excellent communication skill with debugging and trouble shooting skills
- Ability to collaborate with high-performing teams and individuals throughout the firm to accomplish common goals and effectively prioritize the task in a highly dynamic work environment that includes globally positioned resources
Preferred qualifications, capabilities, and skills
- Experience with infrastructure components utilized in data warehousing or big data environments.
- Banking experience preferable