Directories
¶
| Path | Synopsis |
|---|---|
|
cmd
|
|
|
sparkflow-server
command
Command sparkflow-server starts the Sparkflow orchestrator server.
|
Command sparkflow-server starts the Sparkflow orchestrator server. |
|
sparkflow-worker
command
Command sparkflow-worker starts a Sparkflow worker node.
|
Command sparkflow-worker starts a Sparkflow worker node. |
|
sparkflowctl
command
Command sparkflowctl is the CLI tool for interacting with a Sparkflow server.
|
Command sparkflowctl is the CLI tool for interacting with a Sparkflow server. |
|
internal
|
|
|
hadoop
Package hadoop provides HDFS and YARN client implementations using the WebHDFS and YARN Resource Manager REST APIs.
|
Package hadoop provides HDFS and YARN client implementations using the WebHDFS and YARN Resource Manager REST APIs. |
|
server
Package server provides the main gRPC and HTTP server for the Sparkflow orchestrator.
|
Package server provides the main gRPC and HTTP server for the Sparkflow orchestrator. |
|
spark
Package spark provides Spark Connect client and spark-submit wrapper for submitting and monitoring Spark jobs.
|
Package spark provides Spark Connect client and spark-submit wrapper for submitting and monitoring Spark jobs. |
|
worker
Package worker provides the worker node implementation that receives and executes tasks dispatched by the scheduler.
|
Package worker provides the worker node implementation that receives and executes tasks dispatched by the scheduler. |
|
pkg
|
|
|
consensus
Package consensus provides Raft-based distributed consensus for leader election and state replication using hashicorp/raft.
|
Package consensus provides Raft-based distributed consensus for leader election and state replication using hashicorp/raft. |
|
dag
Package dag provides directed acyclic graph structures for workflow definitions.
|
Package dag provides directed acyclic graph structures for workflow definitions. |
|
executor
Package executor defines the Executor interface and provides implementations for running tasks via shell, Python, Docker, Spark, Hadoop, and HDFS.
|
Package executor defines the Executor interface and provides implementations for running tasks via shell, Python, Docker, Spark, Hadoop, and HDFS. |
|
observability
Package observability provides OpenTelemetry tracing, Prometheus metrics, and structured logging setup for Sparkflow components.
|
Package observability provides OpenTelemetry tracing, Prometheus metrics, and structured logging setup for Sparkflow components. |
|
retry
Package retry provides retry policies with various backoff strategies and a dead letter queue for tasks that exhaust their retries.
|
Package retry provides retry policies with various backoff strategies and a dead letter queue for tasks that exhaust their retries. |
|
scheduler
Package scheduler provides task scheduling strategies including FIFO, priority-based, and distributed scheduling with cron support.
|
Package scheduler provides task scheduling strategies including FIFO, priority-based, and distributed scheduling with cron support. |
|
store
Package store defines the storage interface and provides in-memory and PostgreSQL implementations for DAG and task run persistence.
|
Package store defines the storage interface and provides in-memory and PostgreSQL implementations for DAG and task run persistence. |
Click to show internal directories.
Click to hide internal directories.