Documentation
¶
Overview ¶
Amerigo is an individual website crawler. It is named after Amerigo Vespucci, the 15th century Italian explorer and cartographer.
It crawls a website to output the pages and their relationships. It is designed to take a the website graph and output the serialization as it crawls. This output is then fed to tools which may use the data as they please.
Directories
¶
| Path | Synopsis |
|---|---|
|
Package crawler implements an internal website crawler.
|
Package crawler implements an internal website crawler. |
|
Package page deals with representing HTML pages and their connections
|
Package page deals with representing HTML pages and their connections |
|
Package resource implements the collection of URIs and their types
|
Package resource implements the collection of URIs and their types |
Click to show internal directories.
Click to hide internal directories.