eval/

directory
v0.2.0 Latest Latest
Warning

This package is not in the latest version of its module.

Go to latest
Published: Apr 30, 2026 License: MIT

Directories

Path Synopsis
Package corpus extracts plain-text turns from Claude Code session transcripts (`~/.claude/projects/<project>/<sessionUUID>.jsonl`).
Package corpus extracts plain-text turns from Claude Code session transcripts (`~/.claude/projects/<project>/<sessionUUID>.jsonl`).
Package dataset owns the on-disk layout for memmy-eval datasets.
Package dataset owns the on-disk layout for memmy-eval datasets.
Package embedcache caches embedding vectors keyed by (model_id, dim, sha256(text)) so repeated ingest runs over the same corpus do not re-embed unchanged chunks.
Package embedcache caches embedding vectors keyed by (model_id, dim, sha256(text)) so repeated ingest runs over the same corpus do not re-embed unchanged chunks.
Package harness is the composition layer that turns a Claude Code session directory into a memmy store, runs query batteries against it, and captures the per-node state changes that result.
Package harness is the composition layer that turns a Claude Code session directory into a memmy store, runs query batteries against it, and captures the per-node state changes that result.
Package inspect is a read-only window into a memmy Neo4j database used by the eval harness to capture per-node state (weight, last touch, edge degree) before and after a query.
Package inspect is a read-only window into a memmy Neo4j database used by the eval harness to capture per-node state (weight, last touch, edge degree) before and after a query.
Package manifest defines the on-disk metadata records that document what produced an eval dataset and what an eval run actually exercised.
Package manifest defines the on-disk metadata records that document what produced an eval dataset and what an eval run actually exercised.
Package metrics computes the IR + dynamics scores the eval harness reports per run.
Package metrics computes the IR + dynamics scores the eval harness reports per run.
Package queries owns the labeled-query side of the eval framework.
Package queries owns the labeled-query side of the eval framework.
Package sweep runs an eval harness over a matrix of memmy configs to find a parameter setting that scores best on a labeled query set.
Package sweep runs an eval harness over a matrix of memmy configs to find a parameter setting that scores best on a labeled query set.

Jump to

Keyboard shortcuts

? : This menu
/ : Search site
f or F : Jump to
y or Y : Canonical URL