• Bachelor's Degree in Computer Science or related technical field AND strong technical engineering experience with coding in languages including, but not limited to, Golang, C++, C#, Java, Rust, or Python
OR equivalent experience
Experience in one or more of the following areas:
Distributed Systems
Designing and running large-scale fault-tolerant infrastructure services
Networking (TCP/IP, TLS/SSL, HTTP/HTTPS
API design and RESTful Services
Other Requirements:
Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include, but are not limited to the following specialized security screenings:
Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud Background Check upon hire/transfer and every two years thereafter.
Preferred Qualifications:
Bachelor's Degree in Computer Science OR related technical field AND significant technical engineering experience with coding in languages including, but not limited to, Golang, C++, C#, Java, Rust or Python
OR Master's Degree in Computer Science or related technical field AND significant years technical engineering experience with coding in languages including, but not limited to, Golang, C++, C#, Java, Rust, or Python
OR equivalent experience
Experience with Kubernetes and wider Cloud native / Container ecosystem
Experience with L4-L7 proxies (Nginx, Envoy, HAProxy)
Contribution to open source software projects
Responsibilities
Works with appropriate stakeholders to determine user requirements for a set of features.
Contributes to the identification of dependencies, and the development of design documents for a product area with little oversight.
Owns and drives product features end to end right from scoping, architecture, design to implementation and production support.
Balance pragmatism with vision and creativity; deliver continuous improvements to the team’s process and codebase.
Acts as a Designated Responsible Individual (DRI) working on-call to monitor system/product feature/service for degradation, downtime, or interruptions and gains approval to restore system/product/service for simple problems.
Remains current in skills by investing time and effort into staying abreast of current developments that will improve the availability, reliability, efficiency, observability, and performance of products while also driving consistency in monitoring and operations at scale.