ComfyOnline
ComfyUI-FunAudioLLM
<!-- * @Author: SpenserCai * @Date: 2024-10-04 11:28:28 * @version: * @LastEditors: SpenserCai * @LastEditTime: 2024-10-06 11:56:47 * @Description: file content -->

ComfyUI-FunAudioLLM

Comfyui custom node for FunAudioLLM include CosyVoice and SenseVoice

Features

CosyVoice

  • CosyVoice Version: 2024-10-04
  • Support SFT,Zero-shot,Cross-lingual,Instruct
  • Support CosyVoice-300M-25Hz in zero-shot and cross-lingual
  • Support SFT's 25Hz(unoffical)
  • <details> <summary>Save and load speaker model in zero-shot</summary> <img src="./assets/SaveSpeakerModel.png" alt="zh-CN" /> <br> <img src="./assets/LoadSpeakerModel.png" alt="zh-CN" /> </details>

SenseVoice

  • SenseVoice Version: 2024-10-04
  • Support SenseVoice-Small
  • <details> <summary>Support Punctuation segment (need turn off use_fast_mode)</summary> <img src="./assets/SenseVoice.png" alt="zh-CN" /> <br> <img src="./assets/PuncSegment.png" alt="zh-CN" /> </details>

How use

apt update
apt install ffmpeg

## in ComfyUI/custom_nodes
git clone https://github.com/SpenserCai/ComfyUI-FunAudioLLM
cd ComfyUI-FunAudioLLM
pip install -r requirements.txt

Windows

In windows need use conda to install pynini

conda install -c conda-forge pynini=2.1.6
pip install -r requirements.txt

If your network is unstable, you can pre-download the model from the following sources and place it in the appropriate directory.

WorkFlow

<img src="./assets/Workflow_FunAudioLLM.png" alt="zh-CN" />