Shady Thinker

GPU-accelerated LLM inference and online learning in pure Rust, powered by WGPU shaders.

What is this?

Shady Thinker is a from-scratch implementation of LLM inference and fine-tuning using Rust + WGPU. No Python, no CUDA — just shaders doing the thinking.

Unlike the reference projects, this is a single-process Rust binary with no Python or Node backend.

Inspiration

TensorBend — a Hugging Face demo showcasing GPU shader-based tensor operations. The original is obfuscated, but it demonstrated that full LLM inference through compute shaders is viable.
JIT-LoRA (code) — a paper and reference implementation for runtime LoRA adaptation, enabling online learning without a separate training pipeline.

Methodology

The shader kernels were developed from descriptions of the required operations rather than ported line-by-line from the reference code. Claude first documented the necessary kernels (matmul, softmax, RMSNorm, RoPE, etc.) at a functional level in SHADER_RESEARCH.md, and only then began implementing them as standalone WGSL compute shaders in a clean repository targeting the WGPU/naga toolchain.

This approach — rewriting from a spec rather than translating source — avoids inheriting obfuscation or framework-specific patterns from the originals, and produces shaders that compose cleanly as building blocks for the full inference pipeline.

Engineering choices around kernel fusion are informed by the SOTA techniques used in TensorBend.

Features

WGSL compute shaders for matmul, softmax, RMSNorm, RoPE, and more
Feature-gated modules: chat for tokenizer support
Designed for portability across any GPU backend WGPU supports (Vulkan, Metal, DX12)

Building

# Core library only
cargo build

# With chat/tokenizer support
cargo build --features chat

Future Work

JIT-LoRA online learning — runtime LoRA adaptation based on the JIT-LoRA paper, enabling on-the-fly fine-tuning during inference. This is gated behind the jit-lora feature flag and is currently a work in progress.
```
cargo build --features jit-lora
```

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 107 Commits
examples		examples
src		src
tests		tests
.gitignore		.gitignore
AUDIO_TODOS.md		AUDIO_TODOS.md
CLAUDE.md		CLAUDE.md
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
SHADER_RESEARCH.md		SHADER_RESEARCH.md
build.rs		build.rs

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Shady Thinker

What is this?

Inspiration

Methodology

Features

Building

Future Work

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Shady Thinker

What is this?

Inspiration

Methodology

Features

Building

Future Work

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages