gcf

package module

v0.0.0-...-3d7dced Latest Latest Go to latest Published: Mar 18, 2026 License: Apache-2.0 Imports: 7 Imported by: 0

Details

Valid go.mod file
Redistributable license
Tagged version
Stable version
Learn more about best practices

Repository

github.com/datacommonsorg/tools

Links

Open Source Insights

README ¶

Bigtable Automation

Background

This directory contains the Google Cloud Function involved in BT cache generation. The Cloud Function first gets notified by a Borg pipeline, via init.txt, when the cache files have been uploaded to GCS. As a result, it creates a new BT table, scales-up the node-limit (for faster loads, only on base cache), kicks off the CsvImport Dataflow job and registers launched.txt.

After the dataflow job has run to completion, it notifies this Cloud Function again, via completed.txt. As a result, it scales-down the node-limit (only on base cache). In future, for branch cache, it will notify Mixer.

All of the above .txt files are created in a per-cache directory. So, they allow for concurrent cache builds and tracking past state.

There are two distince GCF, one used for production imports with entry point ProdBTImportController and the other used for private imports with entry point PrivateBTImportController. The two functions has the same workflow as described below, except the GCS folder structure are different. The production entry point can be tested locally through local/main.go

Validate BT Import End-to-end using (GCS | BT) Test Environment

First start the Cloud Function locally, as follows:
```
./base/local/deploy.sh
```
A test branch cache exists in this folder.

Just in case a test was run with that cache before, clean-up first:
```
./base/local/test.sh cleanup
```
Fake an init trigger from Google pipeline:
```
./base/local/test.sh init
```
To validate this step:
- prophet-test BT instance should have 3 nodes (instead of 1) and a new table dcbranch_2022_05_06_16_16_13 under Tables
- A dataflow job dcbranch_2022_05_06_16_16_13 runs for ~4-5 minutes here.
- A directory for the cache name within gs://automation_control_test/dcbranch_2022_05_06_16_16_13 containing a launched.txt file. When the dataflow job above ends, there should be a completed.txt file.
Fake a completion trigger from Dataflow job:
```
./base/local/test.sh completed
```
Validate this step by confirming that the prophet-test BT instance now has 1 node.

Deployment

After validating the change in test environment, deploy to PROD by running:

./base/gcp/deploy.sh base
./base/gcp/deploy.sh branch

When this completes, look at the prophet-cache-trigger on GCP console to version.

To deploy private GCF, identify the environment, pick the corresponding yaml files in private/*.yaml and run

./base/custom/deploy.sh <env>

Documentation ¶

Index ¶

func BaseController(ctx context.Context, e lib.GCSEvent) error
func CustomController(ctx context.Context, e lib.GCSEvent) error

Constants ¶

This section is empty.

Variables ¶

This section is empty.

Functions ¶

func BaseController ¶

func BaseController(ctx context.Context, e lib.GCSEvent) error

BaseController consumes a GCS event and runs an import state machine.

func CustomController ¶

func CustomController(ctx context.Context, e lib.GCSEvent) error

Controller consumes a GCS event and runs an import state machine.

Types ¶

This section is empty.

Source Files ¶

View all Source files

Directories ¶

Path	Synopsis
base
local command
custom
local command
lib Package gcf runs a GCF function that triggers in 2 scenarios:	Package gcf runs a GCF function that triggers in 2 scenarios:
proto

?	: This menu
/	: Search site
f or F	: Jump to
y or Y	: Canonical URL