unified_bench

command

v0.3.0 Latest Latest Go to latest Published: Feb 7, 2026 License: MIT Imports: 54 Imported by: 0

Details

Valid go.mod file
Redistributable license
Tagged version
Stable version
Learn more about best practices

Repository

github.com/snissn/gomap

Links

Open Source Insights

README ¶

unified_bench

Side-by-side benchmarks for HashDB, BTreeOnHashDB, TreeDB (cached), Badger, and LevelDB.

Run

Build: make unified-bench (writes bin/unified-bench)
Run: ./bin/unified-bench
Or: go run ./cmd/unified_bench

Reproducibility

Randomized tests use a per-test PRNG derived from -seed so every DB sees the same random key/query sequence.
The chosen seed is printed to stderr at startup.

Tests

write_seq — Sequential Write
write_rand — Random Write
batch_write — Batch Write
batch_random — Batch Random
batch_delete — Batch Delete
delete_rand — Random Delete
read_rand — Random Read
full_scan — Full Scan (iterate the full keyspace)
prefix_scan — Prefix Scan (range scans over [start,end))
- Aliases: scan → full_scan, range_scan → prefix_scan

Common flags

-profile benchmark profile preset (see cmd/unified_bench/profiles.go):
- balanced (default)
- durable (strict durability)
- fast (max throughput; TreeDB WAL off; unsafe)
- wal_on_fast (TreeDB WAL on + relaxed durability; unsafe)
-dbs (all or CSV): hashdb,btree,treedb,badger,leveldb
-test (all or CSV): see list above
-keys number of keys (default 100000)
-keycounts comma-separated key counts to sweep over (overrides -keys)
-keyscale generate keycounts by scale: log10 or doubling (uses -keys-min / -keys-max)
-valsize value size in bytes (default 128)
-val-pattern value pattern for non-dataset write tests (zero|repeat|repeat_tail64|ultra_compressible_repeat|highly_compressible_notail|half_repeat_half_random|medium_compressible_sparse|random)
-val-pool-size number of distinct values to cycle through for -val-pattern (0 = auto)
-dataset-val-pattern dataset value pattern for dataset_write_* (random|zero|repeat|repeat_tail64|half_repeat_half_random)
-batchsize batch size (default 1000)
-range-queries number of prefix/range queries (default 200)
-range-span number of keys per range (default 100)
-leveldb-block-compression LevelDB: block compression mode (default|on|off|both)
-leveldb-block-size LevelDB: table block size in bytes (default 4096)
-treedb-flush-threshold TreeDB (cached) flush threshold in bytes (default 64MB)
-treedb-max-queued-memtables TreeDB (cached) max queued immutable memtables before applying backpressure flush (0=default, <0=disable)
-treedb-slowdown-backlog-seconds TreeDB (cached) start backpressure when queued backlog exceeds this many seconds of flush work
-treedb-stop-backlog-seconds TreeDB (cached) block writers when queued backlog exceeds this many seconds of flush work
-treedb-max-backlog-bytes TreeDB (cached) absolute cap on queued backlog bytes
-treedb-writer-flush-max-memtables TreeDB (cached) max memtables a writer will help flush per op
-treedb-writer-flush-max-ms TreeDB (cached) max time (ms) a writer will help flush per op
-treedb-iter-debug print prefix scan iterator timing + debug stats
-treedb-iter-debug-limit max per-query debug lines to print (default 20)
-treedb-maintenance-ops-per-coalesce TreeDB: ops-per-coalesce maintenance budget (0=default, <0=disable budget)
-treedb-bg-vacuum-interval TreeDB: background index vacuum interval (0=disabled)
-treedb-bg-vacuum-span-ppm TreeDB: background index vacuum span ratio threshold (ppm), 0=default
-treedb-allow-unsafe TreeDB: allow unsafe durability/integrity options (required for unsafe toggles)
-treedb-vlog-dict TreeDB: value-log dict compression mode (default|on|off|both)
-treedb-vlog-dict-frame-encode-level TreeDB: dict frame zstd encoder level (engine|fastest|default|better|best|all|<int>)
-treedb-vlog-dict-frame-entropy TreeDB: dict frame entropy mode (engine|on|off|both)
-seed PRNG seed for randomized tests (default 1; 0 = time-based)
-keep keep temp DB directories after run
-settle-before-scans close+reopen DBs before full_scan/prefix_scan to measure scan performance on a “settled” (fully flushed) state
-progress live table updates to stderr (default true)
-format output format: table or markdown
-cpuprofile write per-test CPU profiles to <prefix>_<test>_<db>.pprof
-cpuprofile-tests restrict CPU profiling to a CSV list of tests (e.g. random_read,batch_random)
-checkpoint-cpuprofile write per-checkpoint CPU profiles to <prefix>_checkpoint_<test>_<db>.pprof
-checkpoint-cpuprofile-tests restrict checkpoint CPU profiling to a CSV list of tests
-treedb-cache-stats-before-reads print select treedb.cache.* stats before read/scan tests (treedb only)
-blockprofile, -mutexprofile, -trace write profiling artifacts to files
-max-wall abort the run if wall time exceeds this duration (guardrail; 0 = disabled)
-max-rss-mb abort the run if RSS exceeds this many MiB (guardrail; 0 = disabled; Linux-only)
-checkpoint-between-tests force a best-effort durability checkpoint between tests (DBs that support Checkpoint())
-vacuum-between-tests vacuum supported DBs between tests (implies -checkpoint-between-tests; TreeDB uses VacuumIndexOnline)
-checkpoint-every-ops force a best-effort durability checkpoint every N ops during write-heavy tests (DBs that support Checkpoint())
-checkpoint-every-bytes force a best-effort durability checkpoint every N approx bytes during write-heavy tests (DBs that support Checkpoint())
-suite named suite:
- readme — generates the README graphs + sweep tables
- churn — churn + settled scans (treedb,leveldb)
- churnvacuum — churn + settled scans, then index compaction and scan again
- flushdrain — write burst → checkpoint boundary → read; prints checkpoint timing (TreeDB-focused). Use -flushdrain-checkpoint-max=<duration> to fail the suite if the checkpoint before random_read exceeds your latency target.
- flushthrash — forces a small TreeDB flush threshold; catches flush thrash / runaway backlog regressions
- bigkeys_guard — small TreeDB flush threshold + large keycount, with wall/RSS caps for CI guardrails
- longmix — long-ish mixed workload + settle boundary with fragmentation reports
- sload_readheavy — settled point reads with value-log pointers + forkchoice-style batch commits
- maintenance_budget — sweep TreeDB maintenance K values; reports checkpoint time vs index size, recommends K
-outdir output directory for suite artifacts (plots/images; used by -suite readme)

Notes

TreeDB is a cached engine (memtable + background flush). If you run long write-heavy phases and then measure random_read/scans immediately, the results can be dominated by background flush work (“flush debt”).

Recommended:

For settled read/scan performance: use -checkpoint-between-tests or -settle-before-scans.
For mixed workload under flush debt: keep defaults and optionally enable -treedb-cache-stats-before-reads to see queue/backlog stats.

Repro: mixed vs settled reads (TreeDB)

Mixed (reads under flush debt; intentionally stressful):

go run ./cmd/unified_bench -dbs treedb -profile fast -keys 900000 -valsize 128 -batchsize 1000 \\
  -test sequential_write,random_write,dataset_write_random,dataset_write_sorted,batch_write,batch_random,batch_delete,batch_small_seq,random_delete,random_read \\
  -treedb-cache-stats-before-reads -progress=false

Settled (reads after a durability boundary):

go run ./cmd/unified_bench -dbs treedb -profile fast -keys 900000 -valsize 128 -batchsize 1000 \\
  -test sequential_write,random_write,dataset_write_random,dataset_write_sorted,batch_write,batch_random,batch_delete,batch_small_seq,random_delete,random_read \\
  -checkpoint-between-tests -progress=false

Repro: compression matrix (TreeDB dict + LevelDB block compression)

Run TreeDB twice (dict on/off) and LevelDB twice (block compression on/off) in one invocation:

./bin/unified-bench -test batch_write,random_write,batch_delete -dbs treedb,leveldb -profile fast -keys 4000000 -format markdown \\
  -treedb-force-value-pointers \\
  -treedb-vlog-dict both \\
  -leveldb-block-compression both

To sweep dict-frame encoder knobs (zstd level × entropy coding), use:

./bin/unified-bench -test batch_write -dbs treedb -profile fast -keys 1000000 -format markdown \\
  -treedb-force-value-pointers \\
  -treedb-vlog-dict on \\
  -treedb-vlog-dict-frame-encode-level all \\
  -treedb-vlog-dict-frame-entropy both

Documentation ¶

Rendered for

There is no documentation for this package.

?	: This menu
/	: Search site
f or F	: Jump to
y or Y	: Canonical URL

README ¶

unified_bench

Run

Reproducibility

Tests

Common flags

Notes

Repro: mixed vs settled reads (TreeDB)

Repro: compression matrix (TreeDB dict + LevelDB block compression)

Documentation ¶

Source Files ¶