tipoca-stream

module
v1.0.0-beta.1 Latest Latest
Warning

This package is not in the latest version of its module.

Go to latest
Published: Jun 17, 2021 License: Apache-2.0

README

tipoca-stream

CI Status


Near realtime cloud native data pipeline. Just another data pipeline.

Install

The pipeline is a combination of services deployed independently. This repo holds the code for the redshiftsink only.

  • RedshiftSink Please follow REDSHIFTSINK.md to install the RedshiftSink Kubernetes Operator. Creating the RedshiftSink resource installs Batcher and Loader pods in the cluster. These pods sinks the data from Kafka topics to Redshift, it takes care of the database migration when required. Redshiftsink has a rich masking support. It also supports table reloads in Redshift when masking configurations are modified in Github.
      kubectl get redshiftsink
  • Kafka Install Kafka using Strimzi CRDs or self hosted or managed kafka.
      kubectl get kafka
  • Producer Install Producer using Strimzi CRDs and Debezium. Creating the kafkaconnect and kafkaconnector creates a kafkaconnect pod in the cluster which start streaming the data from the source(MYSQL, RDS, etc..) to Kafka.
      kubectl get kafkaconnect
      kubectl get kafkaconnector

The project has pluggable libraries which can be composed to solve any other data pipeline use case.

Contribute

Please follow this to bring a change.

Thanks

Directories

Path Synopsis
api
v1
Package v1 contains API Schema definitions for the tipoca v1 API group +kubebuilder:object:generate=true +groupName=tipoca.k8s.practo.dev
Package v1 contains API Schema definitions for the tipoca v1 API group +kubebuilder:object:generate=true +groupName=tipoca.k8s.practo.dev
cmd
redshiftbatcher command
redshiftloader command
redshiftsink command
pkg
git

Jump to

Keyboard shortcuts

? : This menu
/ : Search site
f or F : Jump to
y or Y : Canonical URL