Finding the best job has never been easier
Share
As a member of the Platform Observability Engineering team within Ford's Data Platforms and Engineering (DP&E) organization, you will contribute to building and maintaining a top-tier platform for monitoring and observability. This platform will focus on the four golden signals—latency, traffic, errors, and saturation—providing essential data to support operations, root cause analysis, continuous improvement, and cost optimization. Collaborating with platform architects, you will help design, develop, and maintain a scalable and reliable platform, ensuring smooth integration with systems used across various teams. Your contributions will be key in improving MTTR and MTTX through increased visibility into system performance, working with stakeholders to integrate observability data into their workflows, developing insightful dashboards and reports, continuously improving platform performance and reliability, optimizing costs, and staying updated with industry best practices and technologies. Ideally, you'll have experience with large-scale, high-availability systems, a solid understanding of the four golden signals, familiarity with monitoring tools like Prometheus, Grafana, and Jaeger, and experience with cloud platforms like AWS, Azure, or GCP. This role focuses on building and maintaining a robust platform, rather than developing individual monitoring tools, by creating a centralized, reliable source of observability data that empowers data-driven decisions and accelerates incident response across the organization.
These jobs might be a good fit