Hardening eBPF for runtime security: Lessons from Datadog Workload Protection

Hardening eBPF for runtime security: Lessons from Datadog Workload Protection

• Guillaume Fournier eBPF has opened up new capabilities for observability, networking, and security. • But when you run it in production across thousands of environments and kerne

Breaking up a monolith: How we're unwinding a shared database at scale

Breaking up a monolith: How we're unwinding a shared database at scale

• Shared relational DBs simplify early growth but create coordination headaches as teams scale. • Schema changes become risky because the database is both data store and API contra

How we scaled fast, reliable configuration distribution to thousands of workload containers

How we scaled fast, reliable configuration distribution to thousands of workload containers

• Datadog handles millions of logs per second, requiring instant config updates across thousands of containers. • User‑defined log parsing rules are applied immediately, demanding

Detecting faulty deployments: Our journey from unlabeled data to supervised learning

Detecting faulty deployments: Our journey from unlabeled data to supervised learning

• Deployments drive 70% of incidents, making rapid fault detection critical for modern DevOps. • Datadog’s Automatic Faulty Deployment Detection uses APM telemetry to spot problema

Squeezing every millisecond: How we rebuilt the Datadog Lambda Extension in Rust

Squeezing every millisecond: How we rebuilt the Datadog Lambda Extension in Rust

• Rebuilt Datadog Lambda Extension in Rust, slashing cold start latency by 82%. • Reduced memory footprint by 40%, cutting usage from 55 MB to just 7 MB. • Shifted from heavyweig

Husky: Efficient compaction at Datadog scale

Husky: Efficient compaction at Datadog scale

• Husky layers over object storage, using fragments to batch trillions of daily events. • Metadata per fragment lets queries locate relevant data without scanning entire store. • D