wan video effect collection

flux redux image to anime

flux redux image to anime

cogVideoX-Fun Video to Video

SDT_SileroVADApply

ComfyUI Node: Apply Silero VAD

Authored by kale4eat

Created 7 months ago

Updated about a month ago

12 stars

Category

speech-dataset-toolkit/ai/SileroVAD

Inputs

model SILERO_VAD

audio AUDIO

threshold FLOAT

min_speech_duration_ms INT

max_speech_duration_s FLOAT

min_silence_duration_ms INT

window_size_samples INT

speech_pad_ms INT

Outputs

SILERO_VAD_TIMESTAMPS

Extension: ComfyUI-speech-dataset-toolkit

Basic audio tools using torchaudio for ComfyUI. It is assumed to assist in the speech dataset creation for ASR, TTS, etc.

Authored by kale4eat

related extension:

photo styling v2 - flux pulid + redux + style lora + depth controlnet

photo styling v2 - flux pulid + redux + style lora + depth controlnet

flux redux anime to real

flux redux anime to real

hunyuanvideo video to video