mcp-ynab

command module

v0.2.0 Latest Latest Go to latest Published: Apr 4, 2026 License: MIT Imports: 23 Imported by: 0

Details

Valid go.mod file
Redistributable license
Tagged version
Stable version
Learn more about best practices

Repository

github.com/bold-minds/mcp-ynab

Links

Open Source Insights

README ¶

mcp-ynab

A read-only Model Context Protocol server for the YNAB budgeting API. Lets an LLM (Claude Desktop, Cursor, Claude Code, or any MCP-compatible client) inspect your plans, accounts, categories, transactions, and monthly summaries — without the ability to modify anything.

Built in Go against the official MCP Go SDK. One binary, stdio transport, no inbound network surface, OS-keyring token storage.

Why read-only

YNAB personal access tokens are long-lived and full-scope — they grant read and write on every plan on the account, and YNAB has no bulk-undo for mutations. An LLM that can call deleteTransaction is one prompt-injected transaction memo away from scrambling a budget with no recourse. This server deliberately exposes only the read endpoints, and the design has no code path that mutates YNAB data.

Tools

Read tools (always available)

Tool	Description
`list_plans`	List all YNAB plans (called "budgets" in the YNAB UI) owned by the authenticated user.
`list_accounts`	List accounts in a plan with current balances. Closed accounts excluded by default. Delta-synced.
`list_categories`	List all categories in a plan with this month's assigned / activity / balance amounts and goal details.
`list_transactions`	List transactions for a plan, most recent first. Filter by `since_date`, approval state (`type`), or scope — one of `account_id`, `category_id`, or `payee_id`. Unfiltered variant is delta-synced. Category / payee scoping flattens split transactions. Default limit 100, max 500.
`list_months`	Monthly rollup summaries (income, budgeted, activity, to_be_budgeted, age_of_money) for recent months, most recent first. Use for month-over-month trends. Default limit 6, max 60.
`get_month`	Full plan month with per-category breakdown. Accepts `"current"` or `YYYY-MM-01`. Use for the Sunday ritual / True Expenses check.
`list_scheduled_transactions`	Recurring and future-dated scheduled transactions in date_next order (soonest first). Optional `upcoming_days` filter.
`list_payees`	List payees in a plan. Optional `name_contains` performs a case-insensitive substring match — use to resolve payee names to IDs before calling `list_transactions` with `payee_id` or `ynab_spending_check` with `excluded_payee_ids`.

Task-shaped tools (composition over primitives)

Tool	Description
`ynab_status`	One-call Sunday ritual dashboard: Ready-to-Assign, overspent categories (with credit card payment categories excluded), debt accounts with optional APR enrichment, liquid accounts (checking/savings/cash), days-since-last-reconciled per account, unapproved count, and next-7-days scheduled cash flow with recurrence expansion.
`ynab_spending_check`	"Did I stay under $500 on groceries this week?" Sums net outflow across one or more categories over a date range, compares to a budget, returns `on_plan` verdict and offending transactions when over budget. Supports `excluded_payee_ids` for carve-outs like "except Chipotle on date nights".
`ynab_weekly_checkin`	Week-over-week comparison of income, outflows, and unapproved count, plus month-over-month newly-overspent categories and age-of-money delta.
`ynab_debt_snapshot`	Current debt balances + avalanche payoff projection. Integer basis-points simulation, no floats in the compounding loop. Optional `extra_per_month_milliunits` runs a comparison scenario. Returns structured negative-amortization error with shortfall amount when minimums can't cover interest.
`ynab_waterfall_assignment`	Advisory, no writes. Walks a priority waterfall given per-category `need_milliunits` the skill has computed, returns proposed allocations and remainder. The LLM presents the plan; if approved, issues `update_category_budgeted` calls separately.

Write tools (opt-in, require `YNAB_ALLOW_WRITES=1`)

Write tools are not registered at startup unless YNAB_ALLOW_WRITES=1 is set in the MCP server's environment. When disabled, they do not appear in tools/list and cannot be called at all.

Tool	Description
`create_transaction`	Create a new transaction. Asks the MCP client to confirm via elicitation. Amounts > $10K require an `amount_override_milliunits` echo-back acknowledgment. Provide an `import_id` to dedupe idempotently on retry.
`update_category_budgeted`	Change the assigned amount on a single category for a single plan month. The primitive for Rule 3 money moves during the Sunday ritual. Returns before/after snapshots of budgeted and balance.
`update_transaction`	Partial update of a transaction: category, payee, memo, approved state, cleared state, flag color. Amount changes are structurally not supported — the input struct has no amount field, enforced by a reflection regression test.
`approve_transaction`	Convenience wrapper setting `approved=true` on a transaction. Skips per-call elicitation to support batch daily pending-cleanup workflows. The `YNAB_ALLOW_WRITES=1` env-var gate remains the primary defense.

All read and task-shaped tools advertise readOnlyHint: true in their MCP annotations. Write tools advertise readOnlyHint: false.

Tool counts:

Without YNAB_ALLOW_WRITES: 13 tools (8 reads + 5 task-shaped)
With YNAB_ALLOW_WRITES=1: 17 tools (add 4 writes)

What each tool enables

Use case	Tool(s)
"What's my grocery spend this week?"	`list_categories` → find grocery id → `list_transactions` with `category_id`
Debt snapshot (all debt accounts + balances)	`list_accounts` (filter to creditCard / loan / debt types)
True Expenses check / overspending detection	`list_categories` or `get_month`
Month-over-month trend ("am I spending more than last month?")	`list_months`
"What's coming up this month?" (rent, subscriptions)	`list_scheduled_transactions` with `upcoming_days: 30`
Ready-to-Assign for waterfall conversation	`get_month` → `to_be_budgeted` field
Pattern detection across a payee	`list_transactions` with `payee_id`
Eating plan audit	`list_categories` → grocery/restaurant ids → `list_transactions` with `category_id` and `since_date`

Money representation

Every monetary value in a tool response is a Money object with two fields:

"balance": { "milliunits": 123456, "decimal": "123.456" }

milliunits is the authoritative int64 value — YNAB's native format, exact across every currency.
decimal is a pre-formatted string with 3 fractional digits (milliunit precision).

No code path in this server uses float64 for currency. Formatting is performed via integer arithmetic only. See money.go.

Install

Homebrew / prebuilt binaries

Download a release from GitHub Releases — static binaries for Linux, macOS, and Windows (amd64 and arm64).

Docker

docker pull ghcr.io/bold-minds/mcp-ynab:latest

Distroless-based image (no shell, no package manager), runs as non-root, static binary, stdio-only — no exposed ports.

From source

go install github.com/bold-minds/mcp-ynab@latest

Requires Go 1.25 or newer.

Configure

Get a YNAB personal access token

Log in to YNAB, go to Account Settings → Developer Settings, and create a Personal Access Token. Copy it once — you cannot retrieve it later.

Provide the token to the server

Three options, in order of preference. The server checks them in the order below and uses the first source that provides a non-empty value.

1. OS keyring (recommended)

Store the token once in your operating system's native credential store (macOS Keychain, Linux Secret Service, Windows Credential Manager):

echo -n "your-ynab-personal-access-token" | mcp-ynab store-token

The token is read from stdin — it never appears on the command line, so it never lands in shell history or /proc/PID/cmdline. Subsequent runs of mcp-ynab pick it up automatically with no environment variables needed.

To rotate, re-run store-token with the new value.

2. File-based secret

Useful for Docker secrets, systemd LoadCredential, or Kubernetes secret volumes:

export YNAB_API_TOKEN_FILE=/run/secrets/ynab_token

Keep the file at chmod 600.

3. Environment variable (plaintext)

The simplest option, but the token lives in plaintext in process environment:

export YNAB_API_TOKEN=your_personal_access_token

Wire it into your MCP client

Claude Desktop

Edit ~/Library/Application Support/Claude/claude_desktop_config.json (macOS) or the equivalent on your OS. If you have stored the token in the OS keyring, you don't need an env block at all:

{
  "mcpServers": {
    "ynab": {
      "command": "/path/to/mcp-ynab"
    }
  }
}

If you prefer file-based:

{
  "mcpServers": {
    "ynab": {
      "command": "/path/to/mcp-ynab",
      "env": {
        "YNAB_API_TOKEN_FILE": "/Users/you/.config/ynab/token"
      }
    }
  }
}

Do not paste the raw token into claude_desktop_config.json — that file is typically world-readable and may be backed up by iCloud / Time Machine. Use the keyring instead.

Cursor

Edit ~/.cursor/mcp.json:

{
  "mcpServers": {
    "ynab": {
      "command": "mcp-ynab"
    }
  }
}

(Token comes from the keyring.)

Other MCP clients

Any client that speaks the MCP stdio transport can launch mcp-ynab as a subprocess. Configure the command to run the binary and provide the token via YNAB_API_TOKEN, YNAB_API_TOKEN_FILE, or the OS keyring (recommended).

Security model

What this server protects against

Threat	Mitigation
Prompt-injected instructions telling the LLM to delete/modify financial data	No write tools exist. There is no code path that issues a `POST`, `PATCH`, `PUT`, or `DELETE` to YNAB.
Token leakage via log statements / `%+v` on a config struct	Token is wrapped in a redacting `Token` type. `String`, `GoString`, `Format`, `MarshalJSON`, `MarshalText` all return `[REDACTED]`. Raw value only accessible via a package-private `reveal()` called in exactly ONE place.
Token leakage via HTTP-client errors (axios-style config-in-error pattern)	Adversarial regression test: pathological RoundTripper returns errors containing the literal token; every tool's error path is asserted not to echo it.
Exfiltration of the YNAB token via a rogue URL (SSRF / spec injection)	Custom `http.RoundTripper` refuses any request whose hostname is not `api.ynab.com` (case-insensitive, port-tolerant). Strips `Authorization` defensively. Refuses all redirects.
Token leakage via YNAB error responses surfaced to the LLM	All YNAB errors go through `sanitize()`, which strips `Bearer <token>` and `Authorization:` patterns. No code path formats the raw token into an error.
Runaway LLM exhausting YNAB's per-token rate limit (200 req/hour)	Token-bucket rate limiter: 1 request per 20 seconds refill rate with a burst of 10 (max 190 calls/hour steady-state). Enforced in the RoundTripper.
Unbounded write access by an LLM with credentials	Write tools are opt-in. Unless `YNAB_ALLOW_WRITES=1` is set at MCP server startup, write tools are not registered at all and cannot be invoked. When writes are enabled, every write goes through an MCP elicitation confirmation, an amount safety cap with echo-back override for >$10K transactions, and returns before/after state in the response so the calling skill can persist an audit record.
Wrong-amount updates on existing transactions	`update_transaction` has no amount field on its input struct — amount changes are structurally impossible via this tool. Regression test enforces the field's absence via reflection.
Hung upstream	30-second per-request timeout, 8 MB response body cap.
Plaintext token storage	OS keyring is the recommended storage path; file-based and env-var are fallbacks documented with tradeoffs.
Inbound network attack on the server	stdio transport only — no listening socket, no HTTP endpoints.
Floating-point drift in money arithmetic	All currency stored as int64 milliunits. Formatting is integer arithmetic only. No `float64` anywhere in the money path.

Defense-in-depth

Token type: Token struct blocks every standard format/serialize path. UnmarshalJSON and UnmarshalText refuse, so an attacker-controlled payload cannot inject a valid Token.
Schema validation: tool arguments are validated against SDK-derived JSON Schemas before any handler runs. Missing required fields produce JSON-RPC -32602 invalid params protocol errors at the SDK layer — our handlers are never reached.
Error sanitization at the tool boundary: every handler wraps its error through sanitizedErr before returning, as a final defense even if an inner error escapes without going through the client's own sanitization.
No shell execution: no os/exec call anywhere in the credential or runtime path.
No hardcoded credentials: tests use fake sentinel values.

What it does NOT protect against

A fully compromised upstream api.ynab.com (TLS verification is on, but no certificate pinning).
A fully compromised MCP client (the client sees every tool response in plaintext).
An adversarial user on your own machine with read access to your keyring / token file.
The bare token value appearing in a YNAB error response body as a non-Bearer-prefixed substring (unlikely in practice).

Enabling writes

By default, mcp-ynab is read-only. To enable write tools, set YNAB_ALLOW_WRITES=1 in the MCP server's environment:

{
  "mcpServers": {
    "ynab": {
      "command": "mcp-ynab",
      "env": {
        "YNAB_ALLOW_WRITES": "1"
      }
    }
  }
}

When writes are enabled:

The 4 write tools appear in tools/list alongside the 13 read and task-shaped tools.
Every write goes through an MCP elicitation confirmation prompt (on clients that support it — Claude Code does, Claude Desktop does not; the env-var gate is the sole defense on clients without elicitation).
Amounts > $10,000 milliunits require an amount_override_milliunits parameter equal to the main amount, forcing the LLM to explicitly re-assert the value.
Every write returns before/after state in its response; the calling skill is responsible for persisting an audit trail from those responses (the MCP itself writes no logs to disk).

Known limitations

Delta sync is unfiltered-only. list_accounts and unfiltered list_transactions use last_knowledge_of_server for bandwidth savings. Filtered list_transactions (with since_date, type, or scope) always does a full fetch — YNAB's delta semantics on filtered endpoints are under-documented.
Delta cache has no TTL or size cap in v0.2.0. For single-session MCP processes (minutes to hours) this is fine; for long-running daemons it grows unbounded. See delta.go for the memory profile discussion.
English-only assumptions — see docs/ASSUMPTIONS.md for the full list, notably the "Credit Card Payments" group name match in ynab_status.
twiceAMonth schedules are approximated as 15-day advances in the recurrence iterator because YNAB's API doesn't expose the user's two anchor days. For 7-day dashboard windows this under-counts by at most one occurrence. See docs/ASSUMPTIONS.md.
Amount cap is currency-agnostic at 10 million milliunits. Correct for USD ($10K); tighter than intended for currencies with different subunit scales (e.g., JPY ~$70 USD equivalent).

Roadmap

Deferred from v0.2:

Currency-aware amount caps — fetch the plan's currency_format.iso_code and use a per-currency threshold table.
Bounded delta cache — LRU eviction or time-based TTL once long-running session patterns reveal whether it matters.
twiceAMonth accurate expansion — use date_first and date_next to derive the two anchor days per month.
Delta sync for filtered reads — once YNAB's documentation clarifies the semantics with query filters.
Write tool: delete_transaction — highest-risk write, not shipping until there is a strong user case.

Development

go test -race ./...        # run tests with race detector
go vet ./...
go build ./...
staticcheck ./...
govulncheck ./...

Docker build:

docker build -t mcp-ynab:dev --build-arg VERSION=dev .

Running locally against the real YNAB API

echo -n "your-token" | mcp-ynab store-token   # one-time, uses OS keyring
./mcp-ynab                                     # run the server

The server reads JSON-RPC messages from stdin and writes responses to stdout. All logs go to stderr — the binary never touches stdout from its own code, or it would corrupt the transport framing.

Adding a tool

Define input/output structs in tools.go with json and jsonschema tags.
Add a handler method on *Client.
Register it in registerTools with an appropriate ToolAnnotations hint.
Add a test in tools_test.go using the testClient helper.
Add a subprocess test in subprocess_test.go if the tool has required arguments, to confirm the SDK validation layer rejects missing fields.

The MCP SDK automatically derives JSON Schemas from struct types and validates incoming arguments before the handler runs.

License

MIT — see LICENSE.

Documentation ¶

Overview ¶

SPDX-License-Identifier: MIT

Source Files ¶

View all Source files

?	: This menu
/	: Search site
f or F	: Jump to
y or Y	: Canonical URL

README ¶

mcp-ynab

Why read-only

Tools

Read tools (always available)

Task-shaped tools (composition over primitives)

Write tools (opt-in, require YNAB_ALLOW_WRITES=1)

What each tool enables

Money representation

Install

Homebrew / prebuilt binaries

Docker

From source

Configure

Get a YNAB personal access token

Provide the token to the server

1. OS keyring (recommended)

2. File-based secret

3. Environment variable (plaintext)

Wire it into your MCP client

Claude Desktop

Cursor

Other MCP clients

Security model

What this server protects against

Defense-in-depth

What it does NOT protect against

Enabling writes

Known limitations

Roadmap

Development

Running locally against the real YNAB API

Adding a tool

License

Documentation ¶

Overview ¶

Source Files ¶

Write tools (opt-in, require `YNAB_ALLOW_WRITES=1`)