ComfyUI Node: Transcribe by kotoba-whisper (Long-Form)
Authored by kale4eat
Created 7 months ago
Updated about a month ago
12 stars
Category
speech-dataset-toolkit/ai/kotoba-whisper
Inputs
model KOTOBA_WHISPER_LONG
audio AUDIO
Outputs
STRING
KOTOBA_WHISPER_SEGMENTS
Extension: ComfyUI-speech-dataset-toolkit
Basic audio tools using torchaudio for ComfyUI. It is assumed to assist in the speech dataset creation for ASR, TTS, etc.
Authored by kale4eat