vasted

vasted is a CLI that launches on-demand Vast.ai GPU workers for llama.cpp GGUF inference and exposes a stable OpenAI-compatible /v1 endpoint.

Built by deeflect.com · Follow on X: x.com/deeflectcom

Demo

Why `vasted`

Stable client endpoint while worker URLs rotate.
Setup wizard for local machine and VPS deployments.
Non-interactive automation mode for agents/CI.
OpenAI-compatible proxy for tools that expect /v1 APIs.
Session usage and cost tracking.
Optional Telegram bot control commands.

Requirements

Python 3.12+
uv
Vast.ai account + API key
Optional: Telegram bot token (telegram extra)

Install

From PyPI (recommended)

uv tool install vasted
vasted --version

Upgrade:

uv tool upgrade vasted

From source (development)

git clone https://github.com/deeflect/vasted.git
cd vasted
uv sync --extra dev

Run CLI commands from the repo:

uv run vasted --help

Git install (latest main)

uv tool install "git+https://github.com/deeflect/vasted.git"

Quick Start

If installed as a tool:

vasted setup
vasted up
vasted status --verbose

From source checkout:

uv run vasted setup
uv run vasted up
uv run vasted status --verbose

Client connection values after setup:

Base URL: http://<host>:<port>/v1
Auth header: Authorization: Bearer <token>

When proxy_host is 0.0.0.0, use your real machine/VPS IP or domain in clients.

Automation / Unattended Mode

Use non-interactive commands to avoid prompts:

uv run vasted setup --non-interactive \
  --vast-api-key "$VASTED_API_KEY" \
  --bearer-token "$VASTED_BEARER_TOKEN" \
  --client openclaw \
  --deployment-mode local_pc \
  --model qwen3-coder-30b \
  --quality balanced \
  --gpu-mode auto

uv run vasted up --non-interactive --yes --jinja --model qwen3-coder-30b --quality balanced --gpu-mode auto --no-serve
uv run vasted status --verbose
uv run vasted usage
uv run vasted down --force

Environment variables accepted by setup --non-interactive:

VASTED_API_KEY
VASTED_BEARER_TOKEN
VASTED_CLIENT (openclaw, opencode, custom)
VASTED_LLAMA_JINJA (true/false)
VASTED_MODEL, VASTED_QUALITY, VASTED_GPU_MODE, VASTED_GPU_PRESET
VASTED_DEPLOYMENT_MODE, VASTED_PROXY_HOST, VASTED_PROXY_PORT, VASTED_PUBLIC_HOST

Client Profiles and Jinja Behavior

setup supports client presets that define default llama.cpp --jinja behavior:

--client openclaw: jinja on by default
--client opencode: jinja off by default
--client custom: keep/manual behavior

Per launch override is still available:

uv run vasted up --jinja
uv run vasted up --no-jinja

Command Reference

vasted setup [--non-interactive] [--manual] [--client openclaw|opencode|custom]
vasted up [--model ...] [--quality ...] [--gpu-mode auto|manual] [--gpu-preset ...] [--profile ...] [--max-price ...] [--jinja|--no-jinja] [--yes] [--non-interactive] [--serve|--no-serve]
vasted down [--force]
vasted status [--verbose]
vasted logs [--instance-id N] [--tail N]
vasted usage
vasted token show [--full]
vasted token rotate
vasted rotate-token
vasted config show
vasted profile list|add|use|remove
vasted completions <bash|zsh|fish>

Telegram Bot (Optional)

Install telegram extra and run:

uv sync --extra telegram
uv run python bot.py

Development

uv run ruff check .
uv run mypy app tests bot.py
uv run pytest -q

Project Layout

app/commands/*: CLI command handlers
app/service.py: worker lifecycle + launch policy
app/proxy.py: OpenAI-compatible reverse proxy
app/vast.py: Vast API integration + startup script generation
app/usage.py: token/time/cost accounting
app/user_config.py: persistent config + keyring integration
app/state.py: runtime state persistence
bot.py: optional Telegram control plane

Security

Keep Vast API keys and bearer tokens private.
Prefer localhost binds unless remote access is required.
See SECURITY.md for disclosure policy.

Contributing

See CONTRIBUTING.md and run the validation commands before opening a PR.

License

MIT — see LICENSE.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
.github		.github
app		app
docs/assets		docs/assets
tests		tests
.gitignore		.gitignore
AGENTS.md		AGENTS.md
CHANGELOG.md		CHANGELOG.md
CLAUDE.md		CLAUDE.md
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
RELEASING.md		RELEASING.md
SECURITY.md		SECURITY.md
bot.py		bot.py
pyproject.toml		pyproject.toml
uv.lock		uv.lock
vasted		vasted

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

vasted

Demo

Why `vasted`

Requirements

Install

From PyPI (recommended)

From source (development)

Git install (latest main)

Quick Start

Automation / Unattended Mode

Client Profiles and Jinja Behavior

Command Reference

Telegram Bot (Optional)

Development

Project Layout

Security

Contributing

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

vasted

Demo

Why vasted

Requirements

Install

From PyPI (recommended)

From source (development)

Git install (latest main)

Quick Start

Automation / Unattended Mode

Client Profiles and Jinja Behavior

Command Reference

Telegram Bot (Optional)

Development

Project Layout

Security

Contributing

License

About

Topics

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Why `vasted`

Packages