ComfyUI Node: Apply Silero VAD
Authored by kale4eat
Created 7 months ago
Updated about a month ago
12 stars
Category
speech-dataset-toolkit/ai/SileroVAD
Inputs
model SILERO_VAD
audio AUDIO
threshold FLOAT
min_speech_duration_ms INT
max_speech_duration_s FLOAT
min_silence_duration_ms INT
window_size_samples INT
speech_pad_ms INT
Outputs
SILERO_VAD_TIMESTAMPS
Extension: ComfyUI-speech-dataset-toolkit
Basic audio tools using torchaudio for ComfyUI. It is assumed to assist in the speech dataset creation for ASR, TTS, etc.
Authored by kale4eat