ComfyOnline
SDT_SileroVADApply

ComfyUI Node: Apply Silero VAD

Authored by kale4eat

Created 7 months ago

Updated about a month ago

12 stars

Category

speech-dataset-toolkit/ai/SileroVAD

Inputs

model SILERO_VAD

audio AUDIO

threshold FLOAT

min_speech_duration_ms INT

max_speech_duration_s FLOAT

min_silence_duration_ms INT

window_size_samples INT

speech_pad_ms INT

Outputs

SILERO_VAD_TIMESTAMPS

Extension: ComfyUI-speech-dataset-toolkit

Basic audio tools using torchaudio for ComfyUI. It is assumed to assist in the speech dataset creation for ASR, TTS, etc.

Authored by kale4eat