Documentation
¶
Index ¶
Constants ¶
This section is empty.
Variables ¶
This section is empty.
Functions ¶
This section is empty.
Types ¶
type Scraper ¶
type Scraper struct {
URL *urlpkg.URL // contains the main URL to parse, will be modified in case of a redirect
Client download.HttpClient
Fs afero.Fs // filesystem
// ETagsDB stores ETags (hashes of file state) for each URL
ETagsDB *db.DB
// contains filtered or unexported fields
}
Scraper contains all scraping data, starts the process and handles the concurrency. It includes the logic to decide what URLs to include/exclude and when to stop.
func (*Scraper) Downloader ¶
Click to show internal directories.
Click to hide internal directories.