ComfyOnline
SDT_KotobaWhisperTranscribeLong

ComfyUI Node: Transcribe by kotoba-whisper (Long-Form)

Authored by kale4eat

Created 7 months ago

Updated about a month ago

12 stars

Category

speech-dataset-toolkit/ai/kotoba-whisper

Inputs

model KOTOBA_WHISPER_LONG

audio AUDIO

Outputs

STRING

KOTOBA_WHISPER_SEGMENTS

Extension: ComfyUI-speech-dataset-toolkit

Basic audio tools using torchaudio for ComfyUI. It is assumed to assist in the speech dataset creation for ASR, TTS, etc.

Authored by kale4eat