[Guide] Training Hunyuan Video LoRA
After Hunyuan Video started supporting LoRA, the community actively began exploring LoRA training. Currently, over 80 LoRA models have been created. On ComfyOnline, after introducing LoRA training, users have collectively trained more than 70 LoRA models, with a single user training up to 12 LoRA models. This guide will explain in detail how to train a Hunyuan Video LoRA on ComfyOnline. The dataset preparation steps described here are also applicable to training Hunyuan Video LoRA on any platform.
Preparing the Dataset
Image Dataset:
- Image Resolution
The aspect ratio of each image should not exceed 2. The optimal resolutions are 512 x 512, 512 x 768, or 768 x 512.
- Number of Images
Use 20 to 40 images for training.
Caption Dataset:
Each caption file should have the same name as its corresponding image file. For example:
1.png corresponds to the caption 1.txt.
File Content:
Describe the image. For example, if you are training a character LoRA, exclude details you don’t want the LoRA to learn. For instance, if all the characters have red hair and you don’t want to include “red hair” in prompts when using the LoRA, omit any mention of red hair in the caption files. The LoRA will automatically learn that the character has red hair, and no explicit prompt is needed to generate red-haired characters.
Example: Description for the following image:
Natsu_Dragneel staat op straat, handen vol vlammen.
Since this is a character LoRA, the caption avoids describing the character explicitly and instead focuses on the environment. Including details about clothing would also be beneficial here.
Note: Even if you include some details about the character in the captions, it is not a problem. You can add the same descriptions when using the LoRA, and it will still work fine.
Compress the Files into a ZIP Archive
Prepare the image and caption files, then compress them into a single ZIP file.
Starting the Training Process
enter train page:
https://www.comfyonline.app/explore/app/hunyuan-video-lora-train
Upload the Dataset
Upload the prepared ZIP file.
Enter a Trigger Word
Provide a unique trigger word that will activate the LoRA. This trigger word will automatically be added to the caption files. If the caption already includes the trigger word as a prefix, it will not be added again.
Wait for Training to Complete
Wait approximately one hour for the training to complete.
Once done, you’ll find the model_id and LoRA file on the right-hand side. You can download the LoRA file for local use or upload it to platforms like Hugging Face or Civitai.
Using the Trained LoRA
- Upload to Hugging Face
- Upload to Civitai
- Use Directly with ComfyOnline model_id
For more details, refer to:
ComfyOnline Dynamic LoRA in Workflow
With this workflow, you can create amazing videos. Start training your own Hunyuan Video LoRA today!