Inside Husky’s query engine: Real-time access to 100 trillion events
A technical deep-dive into Datadog’s Husky event-store query engine designed for interactive querying over 100+ trillion events. The post explains Husky’s multi-service query path (planner, orchestrator, metadata service, reader), storage layout (fragments, row groups, text-search segments), execution model (iterator-based, lazy decoding), extensive pruning and multi-layer caches (result, blob-range, predicate), routing via shuffle sharding for affinity/isolation, and streaming partial results to mitigate latency tails. It includes metrics for cache hit rates and pruning effectiveness and notes future moves toward Arrow/Parquet/Substrait interoperability.