Documentation
¶
Overview ¶
Package bench implements a LongMemEval-style evaluation harness for Yaad. It measures retrieval accuracy (R@K), MRR, and token efficiency.
Index ¶
Constants ¶
This section is empty.
Variables ¶
This section is empty.
Functions ¶
This section is empty.
Types ¶
type QA ¶
type QA struct {
Question string
ExpectedNodeID string // ID of the node that should be retrieved
ExpectedContent string // or match by content substring
}
QA is a single question-answer pair for evaluation.
func CodingBenchQAs ¶
func CodingBenchQAs() []QA
CodingBenchQAs returns an extended set of 50 coding-specific QA pairs for more rigorous evaluation. Seed your DB with realistic coding memories first.
func DefaultQAs ¶
func DefaultQAs() []QA
DefaultQAs returns a built-in set of memory QA pairs. Covers the same categories as LongMemEval: single-hop, multi-hop, temporal, preference.
Click to show internal directories.
Click to hide internal directories.