toskill

module

v0.0.0-...-cc74e3f Latest Latest Go to latest Published: Mar 1, 2026 License: MIT

Details

Valid go.mod file
Redistributable license
Tagged version
Stable version
Learn more about best practices

Repository

github.com/byadhddev/toskill

Links

Open Source Insights

README ¶

toskill

Autonomous AI agent swarm that transforms web articles into structured, reusable AI skills.

Built with GitHub Copilot SDK for Go + the open skills ecosystem

What It Does

Give toskill any URL — a blog post, research paper, tutorial, or documentation page — and it autonomously:

Extracts — Discovers and uses browser automation to read the page content
Curates — Analyzes the content, detects the domain, creates a structured knowledge base
Builds — Transforms the knowledge base into a distributable AI skill with progressive disclosure

Each agent self-discovers the tools it needs at runtime using the open skills ecosystem. No hardcoded dependencies.

Install

From source (requires Go 1.24+)

go install github.com/byadhddev/toskill/cmd/toskill@latest

From release binary

# Linux (amd64)
curl -sSL https://github.com/byadhddev/toskill/releases/latest/download/toskill-linux-amd64 -o toskill
chmod +x toskill && sudo mv toskill /usr/local/bin/

# macOS (Apple Silicon)
curl -sSL https://github.com/byadhddev/toskill/releases/latest/download/toskill-darwin-arm64 -o toskill
chmod +x toskill && sudo mv toskill /usr/local/bin/

Build from source

git clone https://github.com/byadhddev/toskill.git
cd toskill
make build       # → bin/toskill
make install     # → $GOPATH/bin/toskill

Prerequisites

GitHub Copilot CLI — installed and authenticated (for Auto/CLI modes)
GitHub CLI (gh) — for GitHub storage integration (optional)
Node.js 18+ — for npx skills (skill discovery)

Quick Start

# Interactive mode — guided setup wizard (recommended)
toskill

# Or run directly (SDK auto-manages the Copilot CLI process)
toskill run https://example.com/blog/interesting-article

# Check what was generated
toskill status

Authentication

toskill supports multiple ways to connect to the AI backend. Choose the method that fits your setup.

Method	Flag	Use Case	Copilot Sub?
Auto (default)	`--auth auto`	SDK auto-manages CLI process	Yes
External CLI	`--auth cli-url`	Connect to headless server	Yes
GitHub Token	`--auth github-token`	Explicit PAT/OAuth token	Yes
Environment Var	`--auth env-var`	CI/CD, automation	Yes
BYOK	`--auth byok`	Your own API keys	No

Auto (Recommended)

The SDK spawns and manages its own CLI process. Uses stored Copilot credentials, env tokens, or gh CLI auth automatically. No manual headless server needed.

toskill run https://example.com/article

External CLI Server

Connect to a running headless Copilot CLI:

# Terminal 1: start headless server
copilot --headless --port 44321

# Terminal 2: run toskill
toskill run --auth cli-url --copilot-url localhost:44321 https://example.com/article

GitHub Token

Provide an explicit token (gho_, ghu_, or github_pat_ prefix):

toskill run --copilot-token gho_xxxx https://example.com/article

Environment Variables

Set a token env var — the SDK auto-detects it:

export COPILOT_GITHUB_TOKEN=your-token  # or GH_TOKEN, or GITHUB_TOKEN
toskill run https://example.com/article

BYOK (Bring Your Own Key)

Use your own API keys from OpenAI, Anthropic, or Azure. No Copilot subscription required.

toskill run --auth byok \
  --byok-provider openai \
  --byok-url https://api.openai.com/v1 \
  --byok-key sk-xxx \
  --model gpt-4o \
  https://example.com/article

Auth Priority (Auto mode)

Explicit --copilot-token
Env vars: COPILOT_GITHUB_TOKEN → GH_TOKEN → GITHUB_TOKEN
Stored Copilot CLI OAuth credentials
gh CLI auth (gh auth token)

Interactive Mode

Run toskill with no arguments for a step-by-step wizard:

⚡ toskill — Autonomous Skill Builder

┃ 🔑 Authentication
┃ > Auto — SDK manages CLI (Recommended)
┃   External CLI — connect to headless server
┃   GitHub Token — explicit PAT / OAuth token
┃   Environment Variable — COPILOT_GITHUB_TOKEN / GH_TOKEN
┃   BYOK — Bring Your Own Key (OpenAI / Anthropic / Azure)

┃ 🔗 URLs to process
┃ > https://example.com/article

┃ 📦 Storage
┃ > GitHub Repository
┃   Local (./skill-store/)

┃ ✅ Logged in as yourname
┃ 📂 Select repository
┃ > yourname/toskill-store
┃   yourname/other-repo
┃   + Create new repository

┃ 🧠 Model
┃ > claude-opus-4.6 (Recommended)

┃ ⚙️  Use different models per phase?
┃ > No

┃ 🚀 Run pipeline? Yes, let's go!

The wizard:

Offers 5 auth methods — Auto (SDK-managed), External CLI, GitHub Token, Env Vars, BYOK
Auto-detects your GitHub login via gh CLI — no tokens to copy-paste
Lists your repos for selection, or lets you create a new one
Dynamically loads available models from the connected AI backend
Supports per-phase model selection (e.g., cheap model for extraction, premium for building)

Usage

toskill                        Interactive wizard
toskill <command> [flags] [args...]

Commands:
  run <url1> [url2] ...     Full pipeline: extract → curate → build
  extract <url1> [url2] ... Extract content from URLs only
  curate [article-paths]    Curate articles into a knowledge base
  build <kb-name>           Build a skill from a knowledge base
  build --auto              Build skills from all knowledge bases
  status                    Show current pipeline state
  remove                    Interactively select and delete artifacts
  reset                     Wipe all artifacts (local and/or GitHub)
  config show               Show configuration
  config set <key> <value>  Set a persistent config value
  version                   Print version

Flags:
  --auth <method>          Auth: auto|cli-url|github-token|env-var|byok
  --copilot-url <addr>     Headless CLI server address (with --auth cli-url)
  --copilot-token <token>  GitHub token for Copilot (with --auth github-token)
  --byok-provider <type>   BYOK provider: openai|anthropic|azure
  --byok-url <url>         BYOK base URL
  --byok-key <key>         BYOK API key
  --output <dir>           Output directory (default: ./skill-store/)
  --model <name>           LLM model (default: claude-opus-4.6)
  --extract-model <name>   Model override for extraction phase
  --curate-model <name>    Model override for curation phase
  --build-model <name>     Model override for skill building phase
  --github-repo <repo>     GitHub repo (e.g. 'owner/toskill-store')
  --github-token <tok>     GitHub token for storage (auto-detected from gh CLI)
  --verbose                Verbose output

GitHub Storage

Artifacts are committed directly to a GitHub repository after each pipeline phase.

Authentication is automatic — toskill uses your gh CLI login. No manual token needed.

# If you have gh CLI authenticated:
toskill run --github-repo yourname/toskill-store https://example.com/article

# Or use the interactive wizard — it lists your repos
toskill

If gh CLI is not installed or not authenticated, toskill falls back to: GITHUB_TOKEN env var → config file → local-only storage.

What gets committed:

articles/{slug}.md — extracted content
knowledge-bases/{name}/KB.md — curated knowledge base
skills/{name}/SKILL.md — distributable skill
skills/{name}/references/*.md — supporting reference material

Managing Artifacts

Remove specific items

toskill remove

Interactive multi-select lets you pick individual articles, knowledge bases, or skills to delete — from both local and GitHub simultaneously. Supports toggle, filter, and select-all.

Reset everything

toskill reset

Choose to wipe local store only, GitHub repo only, or both. Includes a confirmation step before any deletion.

Per-Phase Models

Use different models for each pipeline stage to optimize cost vs quality:

# Fast extraction, premium skill building
toskill run \
  --extract-model claude-haiku-4.5 \
  --curate-model claude-sonnet-4.5 \
  --build-model claude-opus-4.6 \
  https://example.com/article

# Or persist in config
toskill config set extract-model claude-haiku-4.5
toskill config set build-model claude-opus-4.6

Configuration

Settings are loaded in order (later overrides earlier):

Config file (~/.config/toskill/config)
Environment variables
CLI flags

# Persistent config
toskill config set copilot-url localhost:44321
toskill config set model claude-opus-4.6
toskill config set github-repo yourname/toskill-store

# Environment variables
export COPILOT_CLI_URL=localhost:44321
export TOSKILL_OUTPUT=./my-skills
export TOSKILL_MODEL=claude-opus-4.6
export GITHUB_TOKEN=ghp_xxx

Valid config keys: auth-method, copilot-url, output, model, extract-model, curate-model, build-model, github-repo, github-token, redact-paths

Path Redaction

Hide your home directory path in all output — useful for screenshots, recordings, and sharing:

# One-time via flag
toskill run --redact https://example.com/article

# Persist in config
toskill config set redact-paths true

With --redact, paths like /home/user/ai/toskill/skill-store/skills/... become ~/ai/toskill/skill-store/skills/....

Token Usage Tracking

toskill tracks token consumption across all pipeline phases and displays a summary at the end:

📊 Token Usage
   Extract: 15.2K in / 3.1K out (cache: 8.0K read) [2 premium req]
   Curate:  8.4K in / 2.0K out [1 premium req]
   Build:   22.1K in / 5.8K out (cache: 12.0K read) [3 premium req]
   ─────────────────────────────
   Total: 45.7K in / 10.9K out (cache: 20.0K read) [6 premium reqs]

Use --verbose for per-turn token breakdowns during execution.

Skill Evolution

Evolve existing skills with new knowledge instead of creating from scratch:

# CLI: evolve a specific skill with new content
toskill run --evolve --skill-name my-skill https://example.com/new-article

# Build phase only: evolve from an existing knowledge base
toskill build --evolve --skill-name my-skill new-kb-name

In interactive mode, the wizard asks whether to create a new skill or evolve an existing one, and lists available skills to choose from.

How evolution works:

Reads the existing SKILL.md and all references
Merges new knowledge from the knowledge base
Preserves all existing content (never removes)
Adds a changelog entry noting what was added

Output Structure

skill-store/
├── articles/                        # Raw extracted content
│   └── example-com-blog-article.md
├── knowledge-bases/                 # Curated knowledge bases
│   └── web-security/
│       └── KB.md
└── skills/                          # Distributable AI skills
    └── web-security/
        ├── SKILL.md
        └── references/
            ├── techniques.md
            └── hardening-guide.md

How It Works

Three AI agents run in sequence inside a single binary. Each agent autonomously discovers and loads the skills it needs from the open skills ecosystem.

1. Content Extractor

Finds the agent-browser skill → loads its instructions → uses browser automation CLI to open the URL, wait for load, extract title and full body text → saves structured markdown.

2. Knowledge Curator

Reads extracted articles → auto-detects the domain → checks for existing knowledge bases to merge with → creates or updates a comprehensive KB with zero information loss. All code examples, data, and technical detail are preserved verbatim.

3. Skill Builder

Finds the skill-creator skill → loads its guidelines → transforms a knowledge base into a proper distributable skill following progressive disclosure: quick reference up front, detailed content in references/.

Architecture

toskill run <urls>
    │
    ├── Content Extractor (in-process)
    │     Tools: find_skill, install_skill, load_skill, run_command, save_result
    │
    ├── Knowledge Curator (in-process)
    │     Tools: read_article, list_knowledge_bases, write_knowledge_base
    │
    └── Skill Builder (in-process)
          Tools: find_skill, load_skill, read_knowledge_base, write_skill

All agents run in-process — single binary, no subprocesses, no separate services.

Stack:

GitHub Copilot SDK for Go — AI backend
charmbracelet/huh — interactive terminal forms
skills.sh — open skill discovery ecosystem

Contributing

Contributions are welcome! Here's how to get started:

Fork the repository
Create a branch for your feature: git checkout -b feat/my-feature
Make your changes and ensure the build passes: make build
Commit with a descriptive message
Push and open a Pull Request

Development Setup

git clone https://github.com/byadhddev/toskill.git
cd toskill
make build          # Build the binary
make test           # Run tests
make release        # Cross-compile for all platforms

Areas for Contribution

New agent types — add agents for different content sources (PDFs, videos, repos)
Skill formats — support additional output formats beyond SKILL.md
Model providers — extend beyond Copilot SDK to other LLM backends
Caching — skip re-extraction for already-processed URLs
Batch processing — parallel URL extraction for large sets
Dashboard — the web UI in ../dashboard/ needs work (see issues)

Code Structure

cmd/toskill/main.go          # CLI entry point, flag parsing, pipeline orchestration
pkg/config/config.go          # Config loading (file → env → flags)
pkg/extractor/extractor.go    # Content extraction agent
pkg/curator/curator.go         # Knowledge curation agent
pkg/builder/builder.go         # Skill builder agent
pkg/tools/tools.go             # Shared agent tools (find/install/load skill, run command)
pkg/ghstore/ghstore.go         # GitHub REST API storage client
pkg/ghauth/ghauth.go           # GitHub CLI auth integration
pkg/headless/headless.go       # Auto-start headless Copilot CLI server
pkg/interactive/interactive.go # Interactive wizard (charmbracelet/huh)

License

MIT License

Permission is hereby granted, free of charge, to any person obtaining a copy of this software and associated documentation files (the "Software"), to deal in the Software without restriction, including without limitation the rights to use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies of the Software, and to permit persons to whom the Software is furnished to do so, subject to the following conditions:

The above copyright notice and this permission notice shall be included in all copies or substantial portions of the Software.

THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE.

Directories ¶

Path	Synopsis
cmd
toskill command
pkg
builder
config
curator
extractor
ghauth
ghstore
headless
interactive
tools

?	: This menu
/	: Search site
f or F	: Jump to
y or Y	: Canonical URL