About the Role:Our platform makes informed decisions with a high degree of precision and accuracy, to detect possible incidents and regressions early and automatically, thereby minimizing the chance of outages. We offer automatic rollbacks of bad deployments, emergency deploy policies, lockdown of systems during an active incident, incident mitigation capabilities, and other critical capabilities that help Uber innovate at remarkable speed.
What You'll Do:- Build elegant backend platform components for high-throughput, low-latency distributed systems.
- Tackle challenging problems with innovative design and algorithms.
- Collaborate with team members to maintain system health, ensure code hygiene, refactor, and enhance system robustness and resilience.
- Promote best practices in code health, testing, and maintainability.
- Analyze and decompose complex software systems, working with cross-functional teams to design for scalability and testability.
- Work with motivated and curious teammates in a fast-paced, collaborative environment.
Basic Qualifications:- Experience : 5+ years as a software engineer focused on building large-scale distributed systems with proven impact.
- Architecture : Solid understanding of distributed systems architecture, including consensus, convergence, data consistency, and performance/efficiency constructs. Ability to identify and resolve performance and scalability issues.
- Coding Skills : Proficient in writing clean, elegant, bug-free code.
- Leadership : Proven experience leading engineering teams to deliver large, multi-functional projects autonomously. Ability to own part of the team’s charter and help set the long-term direction.
- Project Management : Ability to manage projects from scoping requirements to launch.
- Continuous Learning : Strong desire to learn, grow, and build best-in-class systems.
Key Skills:- Solution Design : Design reliable solutions for business use cases in highly distributed systems. Prior work experience designing reliable, scalable solutions is a must.
- Problem Solving : Ability to identify and resolve performance and scalability issues.
- Implementation : Write clean, readable code with unit and integration test coverage, and running it in production.
- Cross-Team Communication : Effective communication and collaboration skills, with the ability to push forward and step back as needed. Collaborate effectively with cross-functional teams.
- Engineering Best Practices : Commitment to pushing the bar on engineering best practices and helping build a strong engineering culture. Contribute to the team’s engineering culture in terms of quality, monitoring, and on-call practices.
Nice to have:- Experience with building platforms, platform engineering, and deployment systems is a plus
- Experience with large scale, data intensive applications is a plus.
- Experience working with k8s, operators is a plus.
- Go is our primary coding language. We think Java/Rust/C++ is acceptable and transferable.
* Accommodations may be available based on religious and/or medical conditions, or as required by applicable law. To request an accommodation, please reach out to .