SIMD OCC PLAN

Goal:

Accelerate FM rank/Occ layer.

Motivation:

Correctness and contracts are stabilized.

Next leverage point:

Occ()

Current:

scalar implementation

Future:

scalar ↓

AVX2 ↓

AVX512

Benchmark isolation requirements:

measure:

  • Occ only
  • rank only

exclude:

  • mmap startup
  • HTTP
  • IPC
  • backend startup
  • shard orchestration

Benchmark outputs:

p50 p95 p99

Machine profile mandatory:

  • CPU
  • compiler
  • SIMD capability
  • corpus size

Success metric:

lower rank latency

Future dispatch layer:

capability probe ↓

runtime dispatch ↓

scalar / avx2 / avx512

Principle:

benchmark = contract

performance follows correctness