integrations/

Details

Valid go.mod file
Redistributable license
Tagged version
Stable version
Learn more about best practices

Repository

Links

Open Source Insights

README ¶

Integrations Architecture

The integration system compiles provider builders into a single provider package per definition. That package covers setup, auth, credential collection, client construction, operations, mappings, and webhooks.

Core Model

Integration (per-tenant record)
│
│  user input ·········· non-secret config
│  provider data ······· active credentialRef
│  metadata ············ external account / workspace / tenant identity
│
└─► Definition (reusable provider package, shared across integrations)
    │
    │  OperatorConfig ·· process-wide secrets and config the builder needs
    │  UserInput ······· per-integration config collected from the user
    │
    ├── Credential Registrations ─────────────────────────────────────────────┐
    │   Declare what secrets the provider needs (API keys, tokens, etc.)      │
    │   and how to collect them from the user                                 │
    │                                                                         │
    ├── Client Registrations ─────────────────────────────────────────────────┤
    │   Factories that build SDK/API clients from credentials                │
    │   Cached by Keystore per integration so we don't rebuild on every call  │
    │                                                                         │
    ├── Mappings ─────────────────────────────────────────────────────────────┤
    │   CEL expressions that transform provider data into our                 │
    │   internal data model                                                   │
    │                                                        ╭─ references ──╯
    │                                                        ▼
    ├── Connections ───────────────────────────────────────────────────────────
    │   Wire together credential slots + clients from above, and configure:
    │     Auth ············· how to acquire credentials (browser, app-install, etc.)
    │     Disconnect ······· how to tear down (revoke tokens, uninstall app, etc.)
    │     Validation op ···· how to verify credentials work before saving
    │     Resolve ·········· how to identify the external account/workspace (what did we connect to)
    │
    ├── Operations ────────────────────────────────────────────────────────────
    │   Actions the provider can perform (sync data, send messages, etc.)
    │     Client ref ······· which client to use if a client is needed
    │     Ingest ··········· which internal data models this op can produce
    │     Policy ··········· inline vs queued, reconcile-eligible or not
    │                        (Reconcile ops auto-dispatch on first connection)
    │
    └── Webhooks ──────────────────────────────────────────────────────────────
        Receive inbound events from the provider
          Events ··········· each event type has its own handler and ingest
                             reconciled on first successful connection

Who runs what

Component	What it does
Builders	Compile definitions at startup
Registry	Validate and index definitions
Runtime	Integration lifecycle, credential updates, operation dispatch, ingest, webhook routing
Keymaker	Temporary auth sessions and callback validation
Keystore	Long-lived credentials and pooled client caching

Where data lives

We store these separately because they change at different rates and for different reasons - credential rotation shouldn't require touching user config, and temporary auth callbacks shouldn't share a lifecycle with long-lived secrets.

Data	Scope	Changes when	Examples
Operator config	Process-wide	Deploy	Client IDs, secrets, redirect URLs
User input	Per integration	Reconfigure	Non-secret config fields
Credentials	Per integration + slot	Rotation	OAuth tokens, API keys
Provider data	Per integration	Connection change	Active `credentialRef`
Integration metadata	Per integration	First connection	Account name, workspace, tenant ID
Webhook rows	Per integration + webhook	Creation	Endpoint identity, verification secret
Keymaker auth data	Per auth flow	Ephemeral	Callback nonces (not the stored credential)

Setup flow

The runtime walks through the same reconciliation path regardless of how the credential arrives (direct input, OAuth exchange, or app-install callback). The connection's fields determine which steps actually run, so adding auth or validation to a connection automatically changes the setup behavior without touching the runtime:

Resolve or create the integration
If user input is provided, validate it against the definition's schema and persist
Resolve the connection from credentialRef
If Auth is configured, run the auth flow (start/redirect/callback) to produce the credential
Validate the credential against the slot's schema
If a ValidationOperation is configured, execute it as a health check
Save the credential to keystore
Persist the active connection in provider data
If Integration.Resolve is configured, derive and save integration metadata
On first successful connection: reconcile webhooks and dispatch reconcile-eligible operations

sequenceDiagram
    participant Caller
    participant API as API / Auth Callback
    participant Runtime
    participant Provider
    participant Keystore
    participant Store as Integration Store
    participant Reconcile as Reconcile Loop

    Caller->>API: configure integration or start auth
    API->>Runtime: ensure integration
    alt Auth configured on connection
        Runtime->>Provider: start/complete auth flow
        Provider-->>Runtime: credential + integration identity
    else Direct credential
        Caller->>Runtime: credential via API
    end
    opt ValidationOperation configured
        Runtime->>Provider: health-check credential
    end
    Runtime->>Keystore: save credential
    opt Integration.Resolve configured
        Runtime->>Provider: derive integration metadata
    end
    Runtime->>Store: save provider data, metadata, webhooks
    opt First successful connection
        Runtime->>Reconcile: start reconcile cycle
    end
    Reconcile->>Runtime: dispatch reconcile-eligible operations
    Runtime->>Provider: execute operations
    Provider-->>Runtime: raw results or ingest payloads

Execution Model

All entrypoints - manual API calls, the reconcile scheduler, the workflow engine, and inbound webhooks - converge on the same execution and ingest path. This means every provider operation gets the same credential resolution, client caching, run tracking, and ingest behavior regardless of what triggered it. The runtime loads the integration and definition, builds or reuses a client, runs the operation, and either returns a raw result or feeds the output through mappings into our internal data model.

Manual callers can run inline operations synchronously or queue persistent runs
Reconciliation dispatches the definition's non-inline operations after the first successful connection
Workflow-triggered runs add workflow metadata and complete through the same run path
Webhooks either dispatch operations or emit ingest directly; delivery IDs provide idempotency when available
Push-style integrations use the same flow but skip per-integration secret credentials and recurring polling (this is basically only SCIM but a good example of the built in flexibility to accommodate the wide variety of capabilities under the same roof)

Ingest Handoff

Provider operations and webhook handlers usually stop at provider-shaped payloads. We split it this way so provider code only worries about collecting data from the external API, and doesn't need to know anything about our internal models or persistence. The shared ingest path takes over from there:

Verify the emitted schema is declared by the operation or webhook
Resolve the mapping for that schema and variant
Apply integration-level and definition-shipped CEL filters
Apply the CEL map expression
Decode and persist the result into our internal data model

Stable Names and Schema-Derived IDs

Many identifiers in a definition (credential slot names, operation names, webhook event names) end up stored in the database, used in routing, and returned in API responses. Renaming them is a breaking change, so how we produce them matters.

We considered a few approaches:

Database-generated IDs - these are opaque, so you can't look at a credential slot or operation and know what it is without a lookup. They also create a chicken-and-egg problem: the definition needs to reference its own parts before they're persisted
Developer-assigned slugs - these work, but they're hand-written strings that duplicate information already captured by the Go type. Two developers can independently pick conflicting names, and there's no compile-time signal when that happens
Go type-derived names (what we do) - the name of the Go struct that defines a credential, operation config, or webhook event is the identifier. The compiler already enforces uniqueness within a package, so collisions between definitions are structurally impossible without deliberate effort. Renaming a type is a visible, reviewable change

The tradeoff is that Go type names become part of the public interface. Renaming GitHubAppToken to GitHubInstallationToken is a migration, not a refactor. We accept this because the alternative - invisible string drift between code and stored identifiers - is worse.

A few identifiers (definition IDs, webhook names) are still set explicitly because they don't map cleanly to a single Go type. Client IDs are ephemeral and in-process only, so they don't need stability at all.

Topics for operations and webhook events are built deterministically from definition ID + operation/event name. Once names are unique within a definition, topic uniqueness follows automatically. Refs are bound once at package scope and reused everywhere, so there's one declaration site per identifier.

Evolving a Definition

Adding a new credential or auth mode

Choose the credential-slot name first - this becomes part of the public interface
Define the credential schema around the final persisted secret, not temporary auth data
Register the slot and add or update a connection that selects it
On that connection, declare every participating slot, enabled client, validation logic, auth logic, integration metadata derivation, and disconnect cleanup
If existing clients are shared across modes, make credential resolution deterministic
Test new install, update-in-place, mode switching, and disconnect

Keep non-secret selectors in user input or integration metadata. Don't overload credential payloads with display or routing data.

Adding a new operation

Choose the operation name before writing logic - if it's schema-derived, the root config type name is part of the public interface
Define a typed config schema even if the input is small
Decide whether the operation is inline or queued, client-backed or clientless, and raw-result or ingest-producing
Register it with one execution path only. If it emits ingest, declare every internal model it can produce
Add or update mappings for each emitted schema and variant
Test provider logic first, then runtime validation and queueing behavior, then ingest behavior when applicable

CEL guidance

Available variables are envelope, variant, resource, action, and payload.

Use CEL for filtering and projection, not heavy normalization. If the expression starts encoding provider-specific control flow, that logic belongs in provider code instead.

Empty filter means allow the payload through
Empty map means use the raw payload as-is
Prefer small, explicit object construction over large ad hoc expressions
Preserve the fields the target model needs for required, upsert, and lookup fields
Use integration-level filters for tenant scoping and definition-shipped filters for payload semantics

Builder gotchas

The selecting credentialRef must belong to the connection that uses it
Auth and disconnect must refer only to slots in that same connection
A credential slot with no user-facing schema is meaningful, not incomplete. Auth-managed slots work this way because the browser or app-install flow produces the credential, so there's nothing to render in a manual form
Adding a credential schema to an auth-managed slot changes setup UX: it now exposes fields for manual entry
A client can support multiple slots, but the build path still has to resolve one unambiguous credential shape
An operation must have exactly one execution path
Ingest-producing operations must declare what internal models they emit
Integration metadata should represent the external system being connected, not transient setup details
The first successful connection can activate recurring behavior such as reconciliation, not just mark the row connected

`integrationgenerated` and Ent Annotations

The integration ingest layer is partly handwritten and partly generated during entc post-generation. We use generation so provider definitions can target internal models without hand-maintaining the same metadata, topic shapes, and baseline persistence wiring in multiple places.

Ent integration-mapping annotations
└── entc post-generation hook
    ├── internal/ent/integrationgenerated
    │   ├── generated schema metadata
    │   └── generated typed ingest types and topics
    └── internal/integrations/operations
        ├── generated ingest routing
        └── per-schema persistence stubs

The annotations do three jobs:

Schema-level opt-in marks an internal model as part of the integration mapping surface
Schema-level options can request stock persistence scaffolding or exclude fields from that surface
Field-level flags mark allowed mapped fields and distinguish general fields, upsert keys, lookup keys, and fields sourced from integration context

Generation produces three things the runtime actually uses:

Shared metadata describing supported models and their allowed, required, upsert, and lookup keys
Typed second-stage ingest types and topics for each supported model
Ingest routing plus per-schema persistence stubs

"Stock persistence" means generated starting point, not final implementation. Straightforward upserts may work unchanged; anything that needs extra lookup resolution, edge resolution, or conflict handling usually needs custom persistence logic.

Inside operations, the safe rule is:

Anything marked generated and DO NOT EDIT should be regenerated, not hand-edited
Anything marked as a starting point is intended to be customized
Handwritten cross-cutting orchestration is also safe to change, but those edits affect every provider rather than one model

Directories ¶

?	: This menu
/	: Search site
f or F	: Jump to
y or Y	: Canonical URL

Path	Synopsis
auth Package auth provides shared authentication helpers and protocol types for integration auth flows	Package auth provides shared authentication helpers and protocol types for integration auth flows
definitions
awssecurityhub Package awssecurityhub defines the consolidated AWS Security Hub and Audit Manager integration definition.	Package awssecurityhub defines the consolidated AWS Security Hub and Audit Manager integration definition.
azureentraid Package azureentraid provides the Azure Entra ID integration definition for integrations	Package azureentraid provides the Azure Entra ID integration definition for integrations
azuresecuritycenter Package azuresecuritycenter provides the Azure Security Center integration definition for integrations	Package azuresecuritycenter provides the Azure Security Center integration definition for integrations
catalog Package catalog exposes the built-in reference definition builders for integrations	Package catalog exposes the built-in reference definition builders for integrations
cloudflare Package cloudflare provides the Cloudflare integration definition for integrations	Package cloudflare provides the Cloudflare integration definition for integrations
gcpscc Package gcpscc provides the GCP Security Command Center integration definition for integrations	Package gcpscc provides the GCP Security Command Center integration definition for integrations
githubapp Package githubapp defines the GitHub App reference definition for integrations	Package githubapp defines the GitHub App reference definition for integrations
googleworkspace Package googleworkspace provides the Google Workspace integration definition for integrations	Package googleworkspace provides the Google Workspace integration definition for integrations
microsoftteams Package microsoftteams provides the Microsoft Teams integration definition for integrations	Package microsoftteams provides the Microsoft Teams integration definition for integrations
okta Package okta provides the Okta integration definition for integrations	Package okta provides the Okta integration definition for integrations
scim Package scim defines the SCIM reference definition for integrations	Package scim defines the SCIM reference definition for integrations
slack Package slack provides the Slack integration definition for integrations	Package slack provides the Slack integration definition for integrations
operations Package operations dispatches and executes definition-scoped operations for integrations	Package operations dispatches and executes definition-scoped operations for integrations
providerkit Package providerkit provides shared helpers used by integration definition implementations which assist in building consistent and robust integrations while reducing boilerplate.	Package providerkit provides shared helpers used by integration definition implementations which assist in building consistent and robust integrations while reducing boilerplate.
registry Package registry stores definition registrations for the greenfield integration runtime	Package registry stores definition registrations for the greenfield integration runtime
runtime Package runtime wires the integrations services into one executable runtime	Package runtime wires the integrations services into one executable runtime
types Package types defines the greenfield integration definition and registry contracts	Package types defines the greenfield integration definition and registry contracts