🗂️ Mache

Mache (/mɑʃe/ mah-shay): From the French mâché, meaning "crushed and ground" (as in papier-mâché). Just as waste paper is shredded and remolded into strong, complex shapes, Mache remolds raw data into navigable filesystems.
Mache: The Graph-Native Filesystem
We realized that JSON, YAML, Source Code, and Filesystems are all just Graphs.
Mache is the engine that aligns them. It treats your structured data not as text to be parsed, but as a Graph to be mounted. By bridging the gap between your Data's structure (ASTs, Objects) and your OS's structure (Directories, Inodes), Mache allows you to traverse complex logic as easily as you traverse a directory tree.
And because it's a Graph, Mache gives you the ultimate tool to query it: SQL.
The Data Graph The Mache Bridge The OS Graph
(JSON / Code / YAML) (Query & Transform) (Filesystem)
[Root Object] / (Mount)
/ | \ / | \
{key} {key} [Arr] <==== SQL Projection ====> /key/ /key/ /Arr/
| | | | | |
"val" {obj} {item} file dir/ dir/
The "Aha!" Moment: Isomorphism
If Data = Graph and Filesystem = Graph, then the fact that they don't map to each other 1:1 is a failure of modern operating systems. Mache fixes that alignment.
- SQL as the Operator: The SQLite sidecar isn't just a feature; it's the mathematical operator for the graph.
- Schema as Topology: The schema isn't configuration; it's the topology definition for the graph projection.
- FUSE as Traversal: FUSE is just the standard interface for graph traversal (
cd = edge traversal, ls = node enumeration).
Table of Contents
Status
Mache is in early development. The core pipeline (schema + ingestion + FUSE mount) works end-to-end across multiple data sources.
Feature Matrix
| Capability |
Status |
Notes |
| Graph Filesystem |
Stable |
FUSE bridge (macOS/Linux) mounts any graph as a directory tree. |
| Hybrid SQL Index |
Active |
In-memory SQLite sidecar for instant, zero-copy queries. |
| Plan 9 Interface |
Live |
Synthetic ctl files for executing complex SQL via shell. |
| Write-Back |
Beta |
Edits to the mount splice atomically back into source code. |
| Tree-sitter Parsing |
Active |
Native support for Go and Python ASTs. |
| Schema Inference |
Experimental |
Auto-derives topology using Formal Concept Analysis (FCA). |
Quick Start
Prerequisites
- macOS:
brew install --cask fuse-t and brew install go-task
- Linux:
apt-get install libfuse-dev and install Task
Building
git clone https://github.com/agentic-research/mache.git
cd mache
# Build (checks for fuse-t on macOS, builds and codesigns)
task build
# Run tests
task test
Usage
# Mount a SQLite database (instant — zero-copy, direct SQL queries)
./mache --schema examples/nvd-schema.json --data results.db /tmp/nvd
# Mount with zero-config schema inference (no schema authoring needed)
./mache --infer --data results.db /tmp/nvd
# Mount a JSON file (ingests into memory)
./mache --schema schema.json --data data.json /tmp/mount
Example: NVD Vulnerability Database
Mount 323K NVD CVE records as a browsable filesystem, sharded by year and month.
(Data can be generated using Venturi):
./mache --schema examples/nvd-schema.json \
--data /path/to/nvd/results.db \
/tmp/nvd
/tmp/nvd/
by-cve/
2024/
01/
CVE-2024-0001/
description # "A buffer overflow in FooBar..."
published # "2024-01-15T00:00:00Z"
raw.json # Full JSON record
Example: Projecting JSON Data
Given a data.json with users, you can project it into a users/ directory where each file contains specific fields.
Example: Projecting Source Code
Mache auto-detects .go and .py files. Use tree-sitter queries in your schema to map AST nodes (functions, types) to directories.
Write-Back Mode
With --writable, file nodes backed by tree-sitter source code become editable.
# Mount Go source with write-back enabled
./mache -w -s examples/go-schema.json -d . /tmp/mache-src
When you edit a file in the mount, Mache splices the content back into the original source file and runs goimports.
How It Works
Mache uses a Topology Schema to map data from SQLite, JSON, or source code into a filesystem structure.
- Direct Mode: For SQLite, it queries the DB on-demand (zero-copy).
- Ingest Mode: For JSON/Code, it loads data into an in-memory graph.
- Inference: With
--infer, it uses Formal Concept Analysis to guess the best folder structure.
See Architecture for details.
Documentation
Contributing
See CONTRIBUTING.md for details.
License
Apache License 2.0. See LICENSE.