Documentation
¶
Index ¶
- func Coverage(want, got map[string]struct{}) float64
- func IntersectionSize(a, b map[string]struct{}) int
- func Jaccard(a, b map[string]struct{}) float64
- func JaccardSignificantWords(a, b string) float64
- func OverlapCoefficient(a, b map[string]struct{}) float64
- func SignificantWordSet(s string) map[string]struct{}
Constants ¶
This section is empty.
Variables ¶
This section is empty.
Functions ¶
func IntersectionSize ¶
IntersectionSize returns the count of tokens present in both sets.
func JaccardSignificantWords ¶
JaccardSignificantWords returns Jaccard similarity over lower-cased word tokens after stripping common punctuation and dropping tokens shorter than three bytes.
func OverlapCoefficient ¶
OverlapCoefficient returns intersection/min(len(a), len(b)) for two token sets.
func SignificantWordSet ¶
SignificantWordSet tokenizes text for lightweight natural-language similarity.
Types ¶
This section is empty.
Click to show internal directories.
Click to hide internal directories.