scrape-go
A tiny scraper built with Golang. Work in progress!
I'm very aware of gocolly/colly but would love to experiment with scraping and learning a bit more than by using a framework directly from the start.
See TODO.
A well meant advice
For heavens sake please don't use this tool for anything serious. I'm mainly experimenting here and don't want to build a tool for the whole world. I'm mainly figuring out how far I can get with my Go knowledge, how to make my code work, and once I reach my limit, I try to learn enough to just about clear the next hurdle.
License
See LICENSE.
Usage
$ scrape-go -h
Usage of scrape-go:
-allow-duplicate-urls
Allow duplicated links. Only takes effect if URL scraping is active.
-base-url string
The base URL to start with.
-images
Set to true if you want to scrape images.
-images-dir string
The directory to save scraped images to. (default "./images/")
-link-depth int
Maximum scraping depth. (default 5)
-url-outfile string
The file to write scraped URLs to. (default "urls.txt")
-urls
Set to true if you want to scrape deep link URLs.
Try it
go run cmd/scrape/main.go -urls -base-url=https://torbentechblog.com