Documentation
¶
Index ¶
Constants ¶
This section is empty.
Variables ¶
This section is empty.
Functions ¶
Types ¶
type Environment ¶
type Environment interface {
Reset() (map[string]interface{}, error)
Step(action map[string]interface{}) (map[string]interface{}, error)
Observation() map[string]interface{}
Close() error
}
Environment RL 训练环境接口
type ToolEnv ¶
type ToolEnv struct {
// contains filtered or unexported fields
}
ToolEnv 基于工具的RL环境
func (*ToolEnv) Observation ¶
type Trajectory ¶
type Trajectory struct {
TaskID string `json:"task_id"`
Steps []TrajectoryStep `json:"steps"`
FinalReward float64 `json:"final_reward"`
}
Trajectory 轨迹
type TrajectoryStep ¶
type TrajectoryStep struct {
Observation map[string]interface{} `json:"observation"`
Action map[string]interface{} `json:"action"`
Reward float64 `json:"reward"`
}
TrajectoryStep 轨迹步骤
Click to show internal directories.
Click to hide internal directories.