ComfyOnline
Florence2Run

ComfyUI Node: Florence2Run

Authored by kijai

Created 4 months ago

Updated 24 days ago

672 stars

Category

Florence2

Inputs

image IMAGE

florence2_model FL2MODEL

text_input STRING

task

  • region_caption
  • dense_region_caption
  • region_proposal
  • caption
  • detailed_caption
  • more_detailed_caption
  • caption_to_phrase_grounding
  • referring_expression_segmentation
  • ocr
  • ocr_with_region
  • docvqa

fill_mask BOOLEAN

keep_model_loaded BOOLEAN

max_new_tokens INT

num_beams INT

do_sample BOOLEAN

output_mask_select STRING

Outputs

IMAGE

MASK

STRING

Extension: ComfyUI-Florence2

Nodes to use Florence2 VLM for image vision tasks: object detection, captioning, segmentation and ocr

Authored by kijai

related extension: