middleware

package

v0.1.4 Latest Latest Go to latest Published: Feb 12, 2026 License: MIT Imports: 11 Imported by: 0

Details

Valid go.mod file
Redistributable license
Tagged version
Stable version
Learn more about best practices

Repository

github.com/github/gh-aw-mcpg

Links

Open Source Insights

README ¶

jqschema Middleware

This middleware package implements the jqschema functionality from the gh-aw shared agentic workflow as a tool call middleware for the MCP Gateway.

Features

Automatic JSON Schema Inference: Uses the jq schema transformation logic to automatically infer the structure and types of JSON responses
Payload Storage: Stores complete response payloads in /tmp/gh-awmg/tools-calls/{randomID}/payload.json
Response Rewriting: Returns a transformed response containing:
- First 500 characters of the payload (for quick preview)
- Inferred JSON schema showing structure and types
- Query ID for tracking
- File path to complete payload
- Metadata (original size, truncation status)

How It Works

The middleware wraps tool handlers and intercepts their responses:

Random ID Generation: Each tool call gets a unique random ID (32 hex characters)
Original Handler Execution: The original tool handler is called normally
Payload Storage: The complete response is saved to disk
Schema Inference: The jq schema transformation is applied to extract types and structure
Response Rewriting: A new response is returned with preview + schema

Usage

The middleware is automatically applied to all backend MCP server tools (except sys___* tools).

Example Response

Original response:

{
  "total_count": 1000,
  "items": [
    {
      "login": "user1",
      "id": 123,
      "verified": true
    },
    {
      "login": "user2",
      "id": 456,
      "verified": false
    }
  ]
}

Transformed response:

{
  "agentInstructions": "The payload was too large for an MCP response. The complete original response data is saved as a JSON file at payloadPath. The file contains valid JSON that can be parsed directly. The payloadSchema shows the structure and types of fields in the full response, but not the actual values. To access the full data with all values, read and parse the JSON file at payloadPath.",
  "payloadPath": "/tmp/gh-awmg/tools-calls/a1b2c3d4e5f6.../payload.json",
  "payloadPreview": "{\"total_count\":1000,\"items\":[{\"login\":\"user1\",\"id\":123,\"verified\":true}...",
  "payloadSchema": {
    "total_count": "number",
    "items": [
      {
        "login": "string",
        "id": "number",
        "verified": "boolean"
      }
    ]
  },
  "originalSize": 234
}

Understanding the response:

payloadPath: Points to a JSON file containing the complete original response data
payloadSchema: Shows the structure and types (e.g., "string", "number", "boolean") but NOT the actual values
payloadPreview: First 500 characters of the JSON for quick reference
originalSize: Size of the full response in bytes

Reading the payload.json file: The file at payloadPath contains the original response data in valid JSON format. You can read it using standard tools:

# Using cat and jq
cat /tmp/gh-awmg/tools-calls/a1b2c3d4e5f6.../payload.json | jq .

# Using Node.js
const data = JSON.parse(fs.readFileSync(payloadPath, 'utf8'));

# Using Python
import json
with open(payload_path) as f:
    data = json.load(f)

Implementation Details

jq Schema Filter

The middleware uses the same jq filter logic as the gh-aw jqschema utility:

def walk(f):
  . as $in |
  if type == "object" then
    reduce keys[] as $k ({}; . + {($k): ($in[$k] | walk(f))})
  elif type == "array" then
    if length == 0 then [] else [.[0] | walk(f)] end
  else
    type
  end;
walk(.)

This recursively walks the JSON structure and replaces values with their type names.

Go Implementation

The middleware is implemented using gojq, a pure Go implementation of jq, eliminating the need to spawn external processes.

Configuration

The middleware can be controlled via the ShouldApplyMiddleware function:

func ShouldApplyMiddleware(toolName string) bool {
    // Currently excludes sys tools
    return !strings.HasPrefix(toolName, "sys___")
}

Future Enhancements

Selective Middleware Mounting: A configuration system could be added to:

Enable/disable middleware per backend server
Configure which tools get middleware applied
Set custom truncation limits
Configure storage locations
Add multiple middleware types with ordering

Example future config structure:

[middleware.jqschema]
enabled = true
truncate_at = 500
storage_path = "/tmp/gh-awmg/tools-calls"
exclude_tools = ["sys___*"]
include_backends = ["github", "tavily"]

Testing

The middleware includes comprehensive tests:

Unit tests: Test individual functions (random ID generation, schema transformation, payload storage)
Integration tests: Test complete middleware flow with mock handlers
Edge cases: Test error handling, large payloads, truncation behavior

Run tests:

make test-unit
# or
go test ./internal/middleware/...

Directory Structure

Payloads are stored in:

/tmp/gh-awmg/tools-calls/
  ├── {randomID1}/
  │   └── payload.json
  ├── {randomID2}/
  │   └── payload.json
  └── ...

Benefits

Reduced Token Usage: Preview + schema is much smaller than full responses
Better Understanding: Schema shows structure without verbose data
Audit Trail: Complete payloads are saved for later inspection
Debugging: Query IDs enable tracking and correlation
Performance: Pure Go implementation (no external process spawning)

References

Original jqschema utility: gh-aw/.github/workflows/shared/jqschema.md
gojq library: github.com/itchyny/gojq

Documentation ¶

Index ¶

Constants
func ShouldApplyMiddleware(toolName string) bool
func WrapToolHandler(...) ...
type PayloadMetadata

Constants ¶

View Source

const PayloadTruncatedInstructions = "" /* 373-byte string literal not displayed */

PayloadTruncatedInstructions is the message returned to clients when a payload has been truncated and saved to the filesystem

Variables ¶

This section is empty.

Functions ¶

func ShouldApplyMiddleware ¶

func ShouldApplyMiddleware(toolName string) bool

ShouldApplyMiddleware determines if the middleware should be applied to a tool Currently applies to all tools, but can be configured to filter specific tools

func WrapToolHandler ¶

func WrapToolHandler(
	handler func(context.Context, *sdk.CallToolRequest, interface{}) (*sdk.CallToolResult, interface{}, error),
	toolName string,
	baseDir string,
	sizeThreshold int,
	getSessionID func(context.Context) string,
) func(context.Context, *sdk.CallToolRequest, interface{}) (*sdk.CallToolResult, interface{}, error)

WrapToolHandler wraps a tool handler with jqschema middleware This middleware: 1. Generates a random ID for the query 2. Extracts session ID from context (or uses "default") 3. If payload size > sizeThreshold: saves to {baseDir}/{sessionID}/{queryID}/payload.json and returns metadata 4. If payload size <= sizeThreshold: returns original response directly (no file storage) 5. For large payloads: returns first 500 chars of payload + jq inferred schema

Types ¶

type PayloadMetadata ¶ added in v0.0.106

type PayloadMetadata struct {
	AgentInstructions string      `json:"agentInstructions"`
	PayloadPath       string      `json:"payloadPath"`
	PayloadPreview    string      `json:"payloadPreview"`
	PayloadSchema     interface{} `json:"payloadSchema"`
	OriginalSize      int         `json:"originalSize"`
	QueryID           string      `json:"-"` // Internal use only, not serialized to clients
}

PayloadMetadata represents the metadata response returned when a payload is too large and has been saved to the filesystem

Source Files ¶

View all Source files

jqschema.go

?	: This menu
/	: Search site
f or F	: Jump to
y or Y	: Canonical URL