wordlevel

package
v0.2.4 Latest Latest
Warning

This package is not in the latest version of its module.

Go to latest
Published: Apr 10, 2025 License: Apache-2.0 Imports: 6 Imported by: 0

Documentation

Index

Constants

This section is empty.

Variables

This section is empty.

Functions

This section is empty.

Types

type WordLevel

type WordLevel struct {
	// contains filtered or unexported fields
}

WordLevel is a model for building WordLevel tokenizer

func New

func New(vocab map[string]int, unkToken string) (*WordLevel, error)

New creates new WordLevel from input data.

func NewWordLevel

func NewWordLevel() *WordLevel

NewWordLevel initiates a new WordLevel

func NewWorldLevelFromFile

func NewWorldLevelFromFile(vocabFile string, unkToken string) (*WordLevel, error)

NewWordLevelFromFile initializes a WordLevel from file

func (*WordLevel) GetVocab

func (wl *WordLevel) GetVocab() (retVal map[string]int)

GetVocab returns model vocab.

func (*WordLevel) GetVocabSize

func (wl *WordLevel) GetVocabSize() (retVal int)

GetVocabSize returns size of vocab.

func (*WordLevel) IdToToken

func (wl *WordLevel) IdToToken(id int) (string, bool)

IdToToken gets token of given id if existing

func (*WordLevel) Save

func (wl *WordLevel) Save(dir string, nameOpt ...string) (err error)

Save saves vocab to a file

func (*WordLevel) TokenToId

func (wl *WordLevel) TokenToId(token string) (int, bool)

TokenToId returns id of a given token if existing

func (*WordLevel) Tokenize

func (wl *WordLevel) Tokenize(token string) ([]tokenizer.Token, error)

Tokenize transforms given input to token

type WordLevelBuilder

type WordLevelBuilder struct {
	// contains filtered or unexported fields
}

WordLevelBuilder is a builder for WordLevel model

func NewWordLevelBuilder

func NewWordLevelBuilder() *WordLevelBuilder

NewWordLevelBuilder creates a WordLevelBuilder with default values

func (*WordLevelBuilder) Build

func (wlb *WordLevelBuilder) Build() *WordLevel

Build builds a WordLevel using configuration

func (*WordLevelBuilder) UnkToken

func (wlb *WordLevelBuilder) UnkToken(unkToken string)

UnkToken set `UNK` token for the vocab

func (*WordLevelBuilder) Vocab

func (wlb *WordLevelBuilder) Vocab(vocab map[string]int)

Vocab set the vocab (token -> id) mapping

Jump to

Keyboard shortcuts

? : This menu
/ : Search site
f or F : Jump to
y or Y : Canonical URL