Cloudflare AI ToolSmith

Give AI agents instant access to any API. Upload an OpenAPI specification, and chat with an AI that can autonomously execute your APIs in real-time.

Live Demo

Try it now (no setup required):

Live Site: https://main.toolsmith-ui.pages.dev

Sample APIs are pre-loaded. Just open the link and start chatting!

Overview

ToolSmith is an AI agent platform built on Cloudflare Workers that transforms OpenAPI specifications into executable AI skills. Upload an API spec, and an AI assistant gains the ability to call those APIs during conversations—executing real HTTP requests and returning results without writing code.

Architecture

graph TB
    A[Upload OpenAPI Spec] --> B[Parse Operations]
    B --> C[SkillRegistry DO<br/>stores per-user skills]

    D[User Chats] --> E[Load User Skills]
    E --> F[AI + Function Calling]
    F -->|Chooses Skills| G[Execute HTTP Request]
    G --> H[Stream Results]
    H --> I[AI Interprets & Responds]

    J[SessionState DO<br/>chat history] -.->|Context| D
    K[X-User-ID Header] -.->|Isolates| C

  style C fill:#ffd166,stroke:#333,stroke-width:2px,color:#000
  style J fill:#7dd3fc,stroke:#333,stroke-width:2px,color:#000

One-Minute Quickstart

git clone https://github.com/lesprgm/cf_ai_toolsmith.git && cd cf_ai_toolsmith
npm install && (cd ui && npm install)
echo "ENCRYPTION_KEY=$(openssl rand -base64 32)" > .dev.vars
npm run dev &    # Start worker on http://localhost:8787
npm run dev:ui   # Start UI on http://localhost:5173
# Open http://localhost:5173 → Upload examples/petstore.yaml → Chat: "List available pets"

AI autonomously calling a weather API skill during conversation

AI autonomously calling a flight tracking API skill during conversation

How to Run Everything

Required Environment Setup

1. Environment Variables

# Create .dev.vars file (for local development):
ENCRYPTION_KEY=<32-character-random-string>   # Generate: openssl rand -base64 32
ENVIRONMENT=development
LOG_LEVEL=debug

# For production deployment:
wrangler secret put ENCRYPTION_KEY            # Same key for production

2. Durable Objects Configuration

Required bindings in wrangler.toml:

[[durable_objects.bindings]]
name = "SESSION_STATE"
class_name = "SessionState"
script_name = "cf-ai-toolsmith"

[[durable_objects.bindings]]
name = "SKILL_REGISTRY"
class_name = "SkillRegistry"
script_name = "cf-ai-toolsmith"

# Migration required on first deploy:
[[migrations]]
tag = "v1"
new_classes = ["SessionState", "SkillRegistry"]

3. Workers AI Binding

Model used: @cf/meta/llama-3.3-70b-instruct-fp8-fast

[ai]
binding = "AI"

4. Multi-Tenancy Header

All API requests must include:

X-User-ID: <unique-user-identifier>

This header isolates skills and chat history per user. Each unique X-User-ID maps to a separate Durable Object instance.

Quick Verification

# 1. Start services
npm run dev      # Worker on :8787
npm run dev:ui   # UI on :5173

# 2. Test worker health
curl http://localhost:8787/api/health

# 3. Test skill registration
curl -X POST http://localhost:8787/api/skills/register \
  -H "X-User-ID: test-user" \
  -H "Content-Type: application/json" \
  -d '{"apiName":"Test","spec":{...},"baseUrl":"https://api.example.com"}'

# 4. Test chat (requires registered skills)
curl -X POST http://localhost:8787/api/chat \
  -H "X-User-ID: test-user" \
  -H "Content-Type: application/json" \
  -d '{"message":"Hello"}'

Core Components

SkillRegistry DO - Per-user API skill storage with encrypted credentials
Skill Parser - Converts OpenAPI specs to AI tool schemas, executes skills via HTTP
Chat Orchestrator - Loads skills, orchestrates function calling, streams results
SessionState DO - Maintains conversation history per session
Skills UI - Upload, manage, and delete registered APIs
Chat UI - Real-time streaming chat with skill execution display

Workflow

1. Register APIs as Skills

Upload OpenAPI Spec → System parses operations → Skills stored per-user

Example: Upload GitHub's OpenAPI spec
Result: 200+ operations become AI skills (listRepositories, createIssue, etc.)

2. Chat with Skill-Enabled AI

User Question → AI loads your skills → AI chooses appropriate skill(s) → Executes HTTP request → Streams results

User: "What's the weather in NYC?"

AI process:
1. Loads user's registered Weather API skill
2. Chooses: getCurrentWeather(city="New York")
3. Executes: GET https://api.weather.com/current?city=New+York
4. Returns: {"temp": 68, "conditions": "Sunny"}
5. Responds: "The current weather in NYC is 68°F and sunny."

3. Multi-Skill Orchestration

AI can chain multiple skills autonomously:

User: "Find all Cloudflare repos with >1000 stars and save to Airtable"

AI executes:
1. listRepositories(org="cloudflare")
2. Filters locally: stars > 1000
3. createRecords(base="projects", records=[...])

User Input

Skills Page

Click "Skills" in navigation
Upload OpenAPI spec (JSON/YAML) or paste directly
Enter API name (e.g., "Weather API")
Enter API key (if required)
Click "Register API"
View registered skills in table

Chat Page

Click "Chat" in navigation
Type natural language queries
Watch AI autonomously execute skills
See detailed results before AI's response

Example Interactions

Single API:

User: "What's the weather in Paris?"
AI: Executes getCurrentWeather(location="Paris")

Multi-API:

User: "List my GitHub starred repos about 'ai' and save to Airtable"
AI: Executes listStarredRepos() → filters → createRecords()

Memory & State

Durable Objects Storage

SkillRegistry (workers/durable_objects/SkillRegistry.ts)

Stores per-user registered APIs and their skills
Key structure: user:{userId} → UserSkills object
Contains: API name, base URL, encrypted API key, skill definitions
Multi-tenant isolation via X-User-ID header

SessionState (workers/durable_objects/SessionState.ts)

Maintains chat conversation history per session
Key structure: history → Message[] array
Enables context-aware conversations across page refreshes
Auto-trimmed when history exceeds token limits

Data Flow

User uploads spec → SkillRegistry DO stores skills
User chats → Worker loads skills from SkillRegistry
AI function calls → Worker executes HTTP request
Results → SessionState DO stores in history

LLM Integration

Model

@cf/meta/llama-3.3-70b-instruct-fp8-fast via Cloudflare Workers AI

Function Calling

// 1. Load user's skills
const skills = await loadUserSkills(userId);

// 2. Convert to OpenAI-compatible tool schemas
const tools = skillsToAIToolSchemas(skills);

// 3. AI call with tools
const response = await AI.run(model, {
  messages: [...history, userMessage],
  tools: tools,
  tool_choice: "auto",
});

// 4. AI returns tool calls, worker executes them
if (response.tool_calls) {
  for (const call of response.tool_calls) {
    const result = await executeSkill(call.name, call.arguments);
  }
}

Capabilities

Autonomous function calling based on user intent
Parameter extraction from natural language
Multi-turn context using chat history
Error interpretation and recovery
Multi-skill orchestration
Streaming responses via Server-Sent Events

Decision-Making

When to use skills: Based on user intent and available operations
Which skill to use: Matches query to skill descriptions
What parameters to pass: Extracts from query or requests clarification
How to handle errors: Retries or explains limitations

Getting Started

Prerequisites

Node.js 18+
Cloudflare account with Workers AI and Durable Objects enabled
Wrangler CLI authenticated (wrangler login)

Local Development

git clone https://github.com/lesprgm/cf_ai_toolsmith.git
cd cf_ai_toolsmith
npm install
(cd ui && npm install)

npm run dev
npm run dev:ui

Deploy to Production

wrangler deploy
cd ui && npm run build
npx wrangler pages deploy dist --project-name=toolsmith

See DEPLOYMENT.md for complete instructions.

Testing

npm test                  # Run full suite (124 tests)
npm run test:unit         # Unit tests
npm run test:integration  # Integration tests
npm run test:coverage     # Coverage report

Technical Stack

Backend:

Cloudflare Workers (V8 isolates)
Llama 3.3 70B via Workers AI
Durable Objects (SkillRegistry, SessionState)

Frontend:

React 18 with TypeScript
Vite build tool
Tailwind CSS

Testing:

Vitest (124 passing tests)

Future Improvements

ToolSmith will continue to grow from a chat-based skill execution environment into a complete agent automation platform. Planned improvements include:

Workflow Automation: Support natural language instructions such as "send the weather report to Slack every morning" that automatically generate and deploy scheduled Workers chaining multiple skills together.
One-Click Cloudflare Deployment: Allow users to export tested agents as standalone Cloudflare Workers with cron triggers, secrets, and webhooks preconfigured for reuse across projects or integrations.
Multi-Step Agent Composition: Enable users to chain APIs, for example "fetch data, filter results, then post to Slack," within a single deployable flow.
Integration Hub: Add built-in connectors for Slack, Zapier, Notion, and Google Sheets to extend agents beyond ToolSmith.
Monitoring and Insights: Provide dashboards for execution logs, latency tracking, and success metrics to help users observe and optimize deployed agents.

These enhancements will move ToolSmith toward becoming a full chat-to-deployment automation system built on Cloudflare’s edge network.

License

MIT License - see LICENSE for details.

Name		Name	Last commit message	Last commit date
Latest commit History 45 Commits
.vscode		.vscode
docs		docs
examples		examples
screenshots		screenshots
tests		tests
types		types
ui		ui
workers		workers
workflows		workflows
.gitignore		.gitignore
LICENSE		LICENSE
PROMPTS.md		PROMPTS.md
README.md		README.md
deploy.sh		deploy.sh
package.json		package.json
postcss.config.js		postcss.config.js
requirements.txt		requirements.txt
tailwind.config.js		tailwind.config.js
tsconfig.json		tsconfig.json
vite.config.ts		vite.config.ts
vitest.config.ts		vitest.config.ts
wrangler.toml		wrangler.toml

Folders and files

Latest commit

History

Repository files navigation

Cloudflare AI ToolSmith

Live Demo

Overview

Architecture

One-Minute Quickstart

How to Run Everything

Required Environment Setup

Quick Verification

Core Components

Workflow

1. Register APIs as Skills

2. Chat with Skill-Enabled AI

3. Multi-Skill Orchestration

User Input

Skills Page

Chat Page

Example Interactions

Memory & State

Durable Objects Storage

Data Flow

LLM Integration

Model

Function Calling

Capabilities

Decision-Making

Getting Started

Prerequisites

Local Development

Deploy to Production

Testing

Technical Stack

Future Improvements

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages