ComfyUI ImageCaptioner

A ComfyUI extension for generating captions for your images. Runs on your own system, no external services used, no filter.

Uses various VLMs with APIs to generate captions for images. You can give instructions or ask questions in natural language.

Try asking for:

workflow

Installation

git clone https://github.com/neverbiasu/ComfyUI-ImageCaptioner into your custom_nodes folder
- e.g. custom_nodes\ComfyUI-ImageCaptioner
Open a console/Command Prompt/Terminal etc
Change to the custom_nodes/ComfyUI-ImageCaptioner folder you just created
- e.g. cd C:\ComfyUI_windows_portable\ComfyUI\custom_nodes\ComfyUI-ImageCaptioner or wherever you have it installed
Run pip install -r requirements.txt

Add the node via image -> ImageCaptioner

Supports tagging and outputting multiple batched inputs.

U need to get the API of dashscope from the document