lightdash
diff --git a/‎docs.json‎
Lines changed: 17 additions & 3 deletions b/‎docs.json‎
Lines changed: 17 additions & 3 deletions
diff --git a/‎self-host/nats-workers.mdx‎
Lines changed: 0 additions & 41 deletions b/‎self-host/nats-workers.mdx‎
Lines changed: 0 additions & 41 deletions
diff --git a/‎self-host/nats-workers/overview.mdx‎
Lines changed: 182 additions & 0 deletions b/‎self-host/nats-workers/overview.mdx‎
Lines changed: 182 additions & 0 deletions
@@ -295,8 +295,14 @@
               "self-host/enterprise-on-prem",
               "self-host/self-host-lightdash-docker-compose",
               "self-host/update-lightdash",
-              "self-host/pre-aggregates",
-              "self-host/nats-workers",
+              {
+                "group": "NATS workers",
+                "pages": [
+                  "self-host/nats-workers/overview",
+                  "self-host/nats-workers/warehouse-workers",
+                  "self-host/nats-workers/pre-aggregate-workers"
+                ]
+              },
               {
                 "group": "Customize deployment",
                 "pages": [
@@ -423,6 +429,14 @@
       "source": "/references/pre-aggregates",
       "destination": "/references/pre-aggregates/overview"
     },
+    {
+      "source": "/self-host/pre-aggregates",
+      "destination": "/self-host/nats-workers/pre-aggregate-workers"
+    },
+    {
+      "source": "/self-host/nats-workers",
+      "destination": "/self-host/nats-workers/warehouse-workers"
+    },
     {
       "source": "/guides/ai-analyst",
       "destination": "/guides/ai-agents"
@@ -677,4 +691,4 @@
       "display": "simple"
     }
   }
-}
+}
@@ -0,0 +1,182 @@
+---
+title: "NATS workers"
+sidebarTitle: "Overview"
+description: "Scale Lightdash query processing with dedicated NATS worker pods using the Helm chart."
+---
+
+<Badge color="blue" size="md" shape="pill">Helm chart</Badge>
+
+<Callout icon="wrench" color="#6B7280">
+  This page is for engineering teams self-hosting their own Lightdash instance.
+</Callout>
+
+By default, Lightdash processes all queries on the main API server. NATS workers move query execution onto dedicated pods, improving responsiveness under load and letting you scale query capacity independently.
+
+Lightdash uses [NATS](https://nats.io/) — a lightweight, high-performance messaging system — with [JetStream](https://docs.nats.io/nats-concepts/jetstream), its built-in persistent streaming layer, to distribute work between the API server and worker pods.
+
+NATS powers two opt-in features in Lightdash:
+
+<CardGroup cols={2}>
+  <Card title="Warehouse workers" icon="database" horizontal href="/self-host/nats-workers/warehouse-workers">
+    Process interactive and background warehouse queries on dedicated pods.
+  </Card>
+  <Card title="Pre-aggregate workers" icon="layer-group" horizontal href="/self-host/nats-workers/pre-aggregate-workers">
+    Materialize pre-aggregates and serve queries from DuckDB.
+  </Card>
+</CardGroup>
+
+## Requirements
+
+- **Helm chart** version **2.7.2** or later
+- **Lightdash** version [**0.2675.0**](https://hub.docker.com/r/lightdash/lightdash/tags) or later. Older images will fail with `MODULE_NOT_FOUND`.
+
+<Note>
+  Upgrading the Helm chart alone does not change how Lightdash works. NATS features are entirely opt-in — your existing deployment will behave exactly the same until you explicitly enable the new Helm values described below.
+</Note>
+
+## Architecture
+
+```mermaid
+flowchart LR
+    API[Lightdash API] -->|publish job| NATS[NATS JetStream]
+    NATS -->|deliver message| Worker[Worker pod<br/>concurrency: 100]
+    Worker -->|return result| API
+```
+
+The Lightdash API publishes jobs to NATS JetStream. Worker pods consume messages from their stream and process them concurrently (default 100 concurrent jobs per pod).
+
+## Enabling NATS
+
+You should be using the [Helm chart](/self-host/self-host-lightdash) to deploy Lightdash with NATS workers.
+
+### Warehouse queries only
+
+If you only need to scale warehouse query processing:
+
+```yaml
+nats:
+  enabled: true
+warehouseNatsWorker:
+  enabled: true
+```
+
+### With pre-aggregates
+
+To also enable [pre-aggregates](/references/pre-aggregates/overview), add the pre-aggregate worker and required storage configuration:
+
+```yaml
+nats:
+  enabled: true
+warehouseNatsWorker:
+  enabled: true
+preAggregateNatsWorker:
+  enabled: true
+
+# S3 storage for materialized data
+configMap:
+  S3_ENDPOINT: "https://s3.us-east-1.amazonaws.com"
+  PRE_AGGREGATE_RESULTS_S3_BUCKET: "my-lightdash-pre-aggs"
+  PRE_AGGREGATE_RESULTS_S3_REGION: "us-east-1"
+secrets:
+  PRE_AGGREGATE_RESULTS_S3_ACCESS_KEY: "your-access-key"
+  PRE_AGGREGATE_RESULTS_S3_SECRET_KEY: "your-secret-key"
+```
+
+<Note>
+  Pre-aggregates require an [Enterprise license key](/self-host/customize-deployment/enterprise-license-keys) and a **dedicated S3 bucket** separate from your main Lightdash results cache bucket. See [Pre-aggregate workers](/self-host/nats-workers/pre-aggregate-workers) for details.
+</Note>
+
+### What each setting does
+
+We recommend enabling these incrementally so you can validate each step:
+
+| Setting | What changes | Risk level |
+| --- | --- | --- |
+| `nats.enabled: true` | Deploys the NATS StatefulSet. No queries are routed through it yet. | Low — just adds infrastructure, no behavior change. |
+| `warehouseNatsWorker.enabled: true` | All warehouse query execution moves from the API server to dedicated worker pods via NATS. | Higher — changes the query execution path for all users. |
+| `preAggregateNatsWorker.enabled: true` | Enables the pre-aggregates feature and deploys the pre-aggregate worker. | Low — queries only use pre-aggregates after you [define them in your dbt YAML](/references/pre-aggregates/getting-started). Until then, nothing changes. |
+
+## Auto-configured environment variables
+
+The chart automatically sets these environment variables in the shared ConfigMap — you do not need to set them manually:
+
+| Variable | Set when | Value |
+| --- | --- | --- |
+| `NATS_ENABLED` | `nats.enabled: true` | `"true"` |
+| `NATS_URL` | `nats.enabled: true` | `nats://<release>-nats:4222` |
+
+Additional environment variables are auto-configured per worker deployment — see [Warehouse workers](/self-host/nats-workers/warehouse-workers) and [Pre-aggregate workers](/self-host/nats-workers/pre-aggregate-workers) for details.
+
+## NATS JetStream configuration
+
+JetStream supports two storage backends — we default to memory store, but you can switch to file store depending on your needs.
+
+### Memory store vs file store
+
+| | Memory store (default) | File store |
+| --- | --- | --- |
+| **How it works** | Messages are held in RAM | Messages are persisted to disk |
+| **Performance** | Faster — no disk I/O overhead | Slower — writes go through disk |
+| **Persistence** | Messages are lost if NATS restarts | Messages survive NATS restarts |
+| **Infrastructure** | No PersistentVolumeClaim needed | Requires a PersistentVolumeClaim |
+| **When to use** | Most deployments. Lightdash messages are small (just a query UUID) and are deleted once processed. | High message volume exceeding available RAM, or if you need messages to survive NATS pod restarts. |
+
+For more details, see the NATS documentation on [JetStream storage](https://docs.nats.io/nats-concepts/jetstream/streams#storage-overhead) and [memory vs file store](https://docs.nats.io/running-a-nats-service/nats_admin/jetstream_admin/streams#extracting-stream-details).
+
+### Default configuration
+
+Our recommended default configuration:
+
+```yaml
+nats:
+  enabled: true
+  config:
+    cluster:
+      enabled: false          # single-node NATS, no clustering
+    jetstream:
+      enabled: true
+      fileStore:
+        enabled: false         # no disk persistence
+      memoryStore:
+        enabled: true
+        maxSize: 1Gi           # max memory for message storage
+```
+
+| Setting | Default | Description |
+| --- | --- | --- |
+| `nats.config.jetstream.memoryStore.enabled` | `true` | Enable memory-backed storage |
+| `nats.config.jetstream.memoryStore.maxSize` | `1Gi` | Maximum memory for JetStream message storage |
+| `nats.config.jetstream.fileStore.enabled` | `false` | Enable disk-backed storage |
+| `nats.config.cluster.enabled` | `false` | Single-node NATS (no clustering) |
+
+The 1Gi memory store default is sufficient for most workloads. If you need to increase it:
+
+```yaml
+nats:
+  config:
+    jetstream:
+      memoryStore:
+        maxSize: 2Gi
+```
+
+To switch to file store instead, disable memory store and enable file store with a PersistentVolumeClaim:
+
+```yaml
+nats:
+  config:
+    jetstream:
+      memoryStore:
+        enabled: false
+      fileStore:
+        enabled: true
+        dir: /data
+        pvc:
+          size: 10Gi
+```
+
+### Pod disruption
+
+NATS is a stateful component — if the NATS pod restarts, in-flight messages are lost (queries will be retried by users). The chart protects against unplanned eviction with:
+
+- `cluster-autoscaler.kubernetes.io/safe-to-evict: "false"` annotation
+- `PodDisruptionBudget` with `maxUnavailable: 0`