• From Monitoring to Observability: Our Ultra-Marathon to a Cloud-Native Platform 6 January / GlobalIntroduction Managing a global corporate network at Uber’s scale can feel a bit like running an ultra-marathon. • There are long stretches of smooth sailing, but you’re always preparing for the unexpected mountain pass or sudden change in weather. • For years, our engineering teams have navigated this terrain with a traditional, monolithic monitoring system. • Frankly, it felt like running in heavy hiking boots-sturdy, but slow, inflexible, and exhausting to scale up any hill. • We knew we needed to switch to a modern pair of carbon-fiber running shoes. • This meant a complete overhaul: a journey to replace our legacy system with a cloud-native observability platform built for speed, flexibility, and endurance on an open-source stack.
Article Summaries:
- Uber has upgraded its internal corporate network monitoring from a legacy monolithic system to a cloud‑native observability platform. The new solution focuses on Uber’s CorpNet-offices, data centers, cloud environments, and internal services-tracking device health, connectivity, latency, and operational data flows. Built on Kubernetes, the architecture deploys modular, containerized services worldwide (US, EMEA, APAC) for low‑latency, high‑availability probes. Uber uses open‑source tools: Telegraf for metrics collection, Prometheus and Thanos for real‑time and long‑term storage, Grafana and Kibana for visualization, and Elasticsearch for metadata search. The goal is to deliver scalable, actionable insights that match the reliability of the systems Uber supports.
Sources: