🦀 Tōki

🔥 Adversarial fine-tuning lab for small LLMs (1B–3B). Break models ⚔️, harden them 🛡️, and measure what actually improves 📊.

🏺 Meaning

Tōki (陶器) — ceramic, shaped under pressure.

Models, like clay, only reveal their strength when stress-tested. Tōki is about forcing models through pressure — adversarial inputs — and reshaping them into something more robust.

🚀 What it is

Tōki is an end-to-end adversarial ML lab:

Generate adversarial prompts (jailbreaks, edge cases, failure modes)
Fine-tune models using LoRA / QLoRA (MLX or HuggingFace)
Evaluate robustness before and after training
Publish:
- adversarial datasets 📦
- hardened model weights 🧠
- evaluation reports 📊

❗ The problem

LLMs are brittle.

They fail under adversarial prompts
They overfit to narrow behaviors
There’s little systematic research on small model robustness

Most teams:

test a few prompts and call it “safe”

Tōki answers:

Do models actually get safer — or just better at passing tests?

🧠 What you learn

Adversarial ML & red-teaming
LoRA / QLoRA fine-tuning
Dataset construction & curation
Robustness evaluation & benchmarking

⚙️ Architecture

🦀 Rust CLI — orchestration, experiments, pipelines
🐍 Python core — training, generation, evaluation

🚀 Quick Start

git clone https://github.com/yourusername/toki.git
cd toki
cargo build

# Python core (no ML deps required for generate/evaluate/report/upload --dry-run)
cd python && pip install -e .
python -m toki generate --count 32 --output dataset.json
python -m toki evaluate --dataset dataset.json
python -m toki run --name baseline --output-dir experiments/runs
python -m toki report experiments/runs/<ts>_baseline/result.json --format both

# Continuous hardening loop (stops at convergence)
python -m toki pipeline \
  --name harden_v1 \
  --iterations 10 \
  --convergence-threshold 0.95 \
  --convergence-window 3

# A/B compare two models on the same adversarial dataset
# (paired t-test + Wilcoxon decide the winner at α=0.05)
python -m toki compare --model-a unsafe --model-b safe --name baseline_ab

# Publish to HuggingFace Hub (requires `pip install -e ".[hf]"`)
python -m toki upload \
  --dataset dataset.json \
  --repo your-username/toki-adversarial-v1 \
  --version 0.4.0

🎯 Vision

Break the model. Fix the model. Prove it.

If you want next step, I can: → unify all 4 under a Konjo umbrella README + architecture diagram (that’s what really makes this pop in interviews)

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
.github		.github
demo		demo
python		python
src		src
tests		tests
.gitignore		.gitignore
AGENTS.md		AGENTS.md
CHANGELOG.md		CHANGELOG.md
CLAUDE.md		CLAUDE.md
Cargo.toml		Cargo.toml
KONJO_PROMPT.md		KONJO_PROMPT.md
PLAN.md		PLAN.md
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🦀 Tōki

🏺 Meaning

🚀 What it is

❗ The problem

🧠 What you learn

⚙️ Architecture

🚀 Quick Start

🎯 Vision

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🦀 Tōki

🏺 Meaning

🚀 What it is

❗ The problem

🧠 What you learn

⚙️ Architecture

🚀 Quick Start

🎯 Vision

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages