Kandinsky LoRA Training

This README provides a quick and practical guide for preparing data, configuring training, and running LoRA fine-tuning for Kandinsky models.

🚀 1. Clone the Repository and Submodule

After clone this repo don't forget do:

git submodule update --init --remote

📥 2. Download Models

Download all required pretrained models with kandinsky5/download_models.py and place them into:

kandinsky5/weights

🎬 3. Prepare Your Data

Prepare a directory containing pairs:

*.mp4 or *.png
*.txt — caption for the same sample

Then:

Open encode/encode.sh
Set correct local paths for input data and output directories
Run:

bash encode/encode.sh

This will generate:

cache/latents_image/
cache/text_embeds/

⚙️ 4. Configure Training

Choose a config:

T2I → configs/lora_image.yaml
T2V / I2V → configs/lora_video.yaml

Update in the selected config:

experiment_dir
log_dir
checkpoint_dir

Then edit dataloader configs: configs/data/lora_*_dataloader.yaml.

Set:

latents_dir → path to latents from Step 3
text_embeds_dir → path to text embeds from Step 3
uncond_embed → text_embeds_dir + /null.pt

🧩 5. GPU & LoRA Setup

Edit:

configs/trainer/lora*.yaml

Configure:

devices → number of GPUs
Optional: LoRA architecture parameters

▶️ 6. Start Training

Choose the correct config inside train.sh:

configs/lora_video.yaml for T2V / I2V
configs/lora_image.yaml for T2I

Correct --nproc_per_node on your number of GPUs and then run:

bash train.sh

Note: FSDP is enabled by default.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
configs		configs
encode		encode
kandinsky5 @ 6a82818		kandinsky5 @ 6a82818
train		train
.gitmodules		.gitmodules
LICENSE		LICENSE
README.md		README.md
main.py		main.py
train.sh		train.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Kandinsky LoRA Training

🚀 1. Clone the Repository and Submodule

📥 2. Download Models

🎬 3. Prepare Your Data

⚙️ 4. Configure Training

Choose a config:

🧩 5. GPU & LoRA Setup

▶️ 6. Start Training

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Kandinsky LoRA Training

🚀 1. Clone the Repository and Submodule

📥 2. Download Models

🎬 3. Prepare Your Data

⚙️ 4. Configure Training

Choose a config:

🧩 5. GPU & LoRA Setup

▶️ 6. Start Training

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages