ComfyUI Node: Image to Text 🐼
Authored by zhongpei
Created 9 months ago
Updated 5 months ago
272 stars
Category
fofo🐼/image2prompt
Inputs
model IMAGE2TEXT_MODEL
image IMAGE
query
- Describe this photograph.
- What is this?
- Please describe this image in detail.
- As an AI image tagging expert, please provide precise tags for these images to enhance CLIP model's understanding of the content. Employ succinct keywords or phrases, steering clear of elaborate sentences and extraneous conjunctions. Prioritize the tags by relevance. Your tags should capture key elements such as the main subject, setting, artistic style, composition, image quality, color tone, filter, and camera specifications, and any other tags crucial for the image. When tagging photos of people, include specific details like gender, nationality, attire, actions, pose, expressions, accessories, makeup, composition type, age, etc. For other image categories, apply appropriate and common descriptive tags as well. Recognize and tag any celebrities, well-known landmark or IPs if clearly featured in the image. Your tags should be accurate, non-duplicative, and within a 20-75 word count range.
custom_query STRING
print_log BOOLEAN
Outputs
STRING
Extension: Comfyui_image2prompt
Nodes:Image to Text, Loader Image to Text Model.
Authored by zhongpei