Documentation
¶
Overview ¶
Package tokenizer provides basic text tokenization functionality.
Index ¶
Constants ¶
This section is empty.
Variables ¶
This section is empty.
Functions ¶
This section is empty.
Types ¶
type Tokenizer ¶
type Tokenizer struct {
// contains filtered or unexported fields
}
Tokenizer provides basic text tokenization functionality. This is a highly simplified example (whitespace tokenization). A feature-complete tokenizer would implement subword algorithms (BPE, WordPiece, SentencePiece).
Click to show internal directories.
Click to hide internal directories.