GLYPH Segmented 2-Shard Benchmark

Benchmark#

Tool:

benchmarks/segmented_fanout_v1.py

Mode:

sequential_fanout_over_persistent_cpp_backends

Corpus:

HDFS 1GB split into 2 shards

Shard layout:

shard0: first 512 MiB
shard1: second 512 MiB

Query set:

bench_1gb/queries.txt

Query count:

100

Warm runs:

3

Warm measurements:

300

Single-index baseline#

Reference benchmark:

benchmarks/HDFS_1GB_PERSISTENT_PXX.md

Single 1GB persistent backend:

warm p50:  ~0.0098 ms
warm p95:  ~0.0104 ms
warm p99:  ~0.0105 ms

2-shard segmented result#

Two persistent backends queried sequentially.

Startup:

p50: ~1910.8 ms per shard

Cold queries:

p50: ~0.0270 ms
p95: ~0.0376 ms
p99: ~0.0819 ms

Warm queries:

min:  ~0.0192 ms
p50:  ~0.0207 ms
p95:  ~0.0228 ms
p99:  ~0.0239 ms
max:  ~0.0291 ms
mean: ~0.0210 ms

Sample fan-out response#

Query:

blk_-100000266894974466

Responses:

shard0:
    count = 31

shard1:
    count = 0

Total:

31

Interpretation#

This benchmark measures sequential fan-out across two persistent FM backends.

Observed behavior:

single backend p50:
    ~0.010 ms

two-shard fan-out p50:
    ~0.021 ms

Observed scaling:

~2.1× latency increase

This is consistent with sequential querying over two resident indexes.

Important:

No explosive tail-latency behavior was observed.

Tail stability:

p99 / p50 ≈ 1.15

This indicates stable warm-query behavior even under segmented fan-out.


What this benchmark does NOT measure#

  • parallel shard querying
  • shard overlap handling
  • cross-shard locate merge cost
  • HTTP/network overhead
  • distributed multi-machine fan-out
  • shard balancing strategies

Machine#

See:

benchmarks/MACHINE_SPEC.md