orchestrate

package
v1.3.3 Latest Latest
Warning

This package is not in the latest version of its module.

Go to latest
Published: Feb 21, 2026 License: Apache-2.0 Imports: 10 Imported by: 0

Documentation

Index

Constants

This section is empty.

Variables

This section is empty.

Functions

func GetAllSiteKeys

func GetAllSiteKeys(appCfg *config.AppConfig) []string

GetAllSiteKeys returns all site keys from the config

func ValidateSiteKeys

func ValidateSiteKeys(appCfg *config.AppConfig, siteKeys []string) error

ValidateSiteKeys checks that all provided site keys exist in the config

Types

type Orchestrator

type Orchestrator struct {
	// contains filtered or unexported fields
}

Orchestrator manages parallel crawling of multiple sites

func NewOrchestrator

func NewOrchestrator(appCfg *config.AppConfig, siteKeys []string, resume bool, log *logrus.Entry) *Orchestrator

NewOrchestrator creates a new orchestrator for parallel site crawling

func (*Orchestrator) Cancel

func (o *Orchestrator) Cancel()

Cancel cancels all running crawls

func (*Orchestrator) GetProgress

func (o *Orchestrator) GetProgress() []crawler.CrawlerProgress

GetProgress returns the current progress of all sites

func (*Orchestrator) Run

func (o *Orchestrator) Run() []SiteResult

Run starts crawling all sites in parallel and waits for completion

type SiteResult

type SiteResult struct {
	SiteKey        string
	Success        bool
	Error          error
	PagesProcessed int64
	Duration       time.Duration
}

SiteResult contains the result of crawling a single site

Jump to

Keyboard shortcuts

? : This menu
/ : Search site
f or F : Jump to
y or Y : Canonical URL