Expoint - all jobs in one place

Finding the best job has never been easier

Limitless High-tech career opportunities - Expoint

Airbnb Staff Operations Engineer Observability 
United States 
105981847

18.05.2024

The Community You Will Join:

  • Airbnb is a company with a mission to create a world where anyone can belong anywhere, achieved through a unified team adhering to core values. The BizTech department plays a crucial role in this mission by providing reliable internal systems, innovative products, and technical support, fostering an empowered and inclusive progress. They also create technical breakthroughs and strategies that redefine the concept of belonging anywhere, delivering value to both the business and its people.
  • The Global Operations arm of BizTech manages production services in the corporate environment. A Staff Operations Engineer within this team focuses on observability architecture, operational efficacy, and automation. They work closely with other Operations team members and BizTech engineering teams to develop solutions, anticipate, and resolve issues. Their role requires experience in Site Reliability Engineering and Observability development.

The Difference You Will Make:

  • You're tasked with identifying and rectifying persistent issues through scalable automated solutions, thereby enhancing operational performance and productivity.
  • You're also responsible for spearheading the development and upkeep of testing and monitoring tools, ensuring the continuous operation of all automation platforms.
  • You're in charge of the quality and reliability of BizTech services, which includes verifying post mortems, conducting root-cause analysis, and implementing corrective actions.
  • You'll collaborate with various BizTech engineering teams to establish and maintain service level objectives and indicators, contributing to the overall efficiency and security of our services.
  • Lastly, you'll lead the planning of operations architecture for the next 1-3 years, connecting disparate systems in production to improve compatibility and stability.

A Typical Day:

  • Dedicate a portion of the day to core Operations tasks, which involve addressing requests and issues that our users have identified and reported via tickets. Strive to comprehend the requests at hand, identify patterns, and resolve them with solutions that can make handling these types of problems more efficient.
  • Being part of an on-call rotation could mean that you are called upon to address and lead resolution of high-severity incidents related to production services, taking on a double role as an incident commander and operations engineer.
  • Participate, facilitate, lead team and project meetings, collaborating with Operations peers and cross-functional BizTech peers.
  • Work as a team, stay on top of tasks, engagements, and interactions with colleagues. Active participation and collaboration is a recipe for success.
  • Work on sprints, project tasks that involve coding, testing, designing, documenting, and reviewing operational readiness.

Your Expertise:

  • 10+ years combination of IT Operations, Site Reliability, observability and architecture
  • Strong Python coding abilities, with a focus on API, integrations, and event driven architecture (AWS Lambda/SQS architecture).
  • Proven experience with Software Development Lifecycles including infrastructure as code, configuration management, distributed version control system, and continuous delivery technology processes
  • Strong experiences with complex corporate environments, including automation, observability (e.g. open source tools grafana, kibana, vector, opensearch, cloudwatch), network (e.g. Cisco, Palo Alto), systems (e.g. Chef, Terraform, Jenkins, Kubernetes), applications, SaaS, Cloud technologies (e.g. AWS, GCP)
  • Experience designing logging pipelines, monitoring and alerting frameworks, opentelemetry, as well as tracing tools and CI/CD pipelines
  • Exceptional communication skills and ability to clearly communicate ideas into business requirements

How We'll Take Care of You:

Pay Range
$232,000 USD