π¨ Object Fusion Pipeline
π Motivation
Have you ever fantasized about crafting a single masterpiece from elements of different photos? π² What if I told you that manually prompting each object's shape, color, and features for diffusion could be a thing of the past? And letβs be real, using GPT to describe each object step by step can feel like chasing an AI in slow motion. π’π€
Here, I introduce a complete pipeline that takes two images as input, allowing you to choose which objects from the images you want to fuse seamlessly.
π Features
- Combine objects from two different images into a single scene.
- Easy selection and customization of objects to be fused.
- Optimized integration with ComfyUI.
πΌοΈ Some Examples
π οΈ Installation
-
Clone this repo into the
custom_nodes
directory in ComfyUI:git clone https://github.com/ducido/ObjectFusion_ComfyUI_nodes
-
Clone these amazing repositories and follow their instructions:
- ComfyUI-SD3-nodes
Note: Place the 3 clips model intomodels/clip
. - ComfyUI-Custom-Scripts
- img2txt-comfyui-nodes
- ComfyUI-SD3-nodes
-
Install the required packages:
conda create -n objectfusion python=3.10 -y conda activate objectfusion pip install -r custom_nodes/ObjectFusion_ComfyUI_nodes/requirements.txt wget https://huggingface.co/camenduru/YoloWorld-EfficientSAM/resolve/main/efficient_sam_s_gpu.jit -P custom_nodes/ObjectFusion_ComfyUI_nodes/Custom_ComfyUI_YoloWorld_EfficientSAM
-
Or you can install with ComfyUI-Manager which is much easier and faster.
π Note
All the folders, except CROP_OBJECT
, are from other repositories. Thank you for your amazing works, I appreciate that. Besides, I have made some minor modifications to fit this project. Here are the details:
-
Custom_ComfyUI-YoloWorld-EfficientSAM
- Added 2 output fields:
BBOX
,categories
. - Displayed ID also in the IMAGE output (e.g.,
{ID} - {class} - {confidence}
).
- Added 2 output fields:
-
- Removed input field:
prompt
. - Added 4 input fields:
object1
,desc_obj1
,object2
,desc_obj2
. - Change all
default
value to""
because the newest frontend of ComfyUI considerNone
value as a bug
- Removed input field:
π€ Contributing
Contributions are welcome! Please open an issue or submit a pull request.
π License
This project is licensed under the MIT License.