Skip to Content
Back to Marketplace

faster-whisper

Local speech-to-text using faster-whisper, a CTranslate2 reimplementation of OpenAI's Whisper, for 4-6x faster transcription with identical accuracy.

4,554downloads10installs10stars
v1.5.1
cmdopAI & Agentsspeaker diarization, speech-to-text, subtitles, transcription3/2/2026

Overview

Local speech-to-text using faster-whisper, a CTranslate2 reimplementation of OpenAI's Whisper, for 4-6x faster transcription with identical accuracy. With GPU acceleration, expect ~20x realtime transcription.

Key Features

  • Transcribe audio/video files
  • Generate subtitles (SRT, VTT, ASS, LRC, TTML)
  • Identify speakers (diarization labels)
  • Transcribe from URLs (YouTube links and direct audio URLs)
  • Batch process files (glob patterns, directories, skip-existing support)

How It Works

This skill uses faster-whisper to transcribe audio files locally, without relying on API costs or online services. It supports 99+ languages, auto-detection, and multilingual transcription.

Use Cases

  • Transcribe meetings, interviews, podcasts, lectures, and YouTube videos
  • Generate subtitles for broadcast-standard formats
  • Identify speakers in audio files
  • Transcribe podcast feeds and YouTube links
  • Batch process files with ETA shown automatically
  • Transcribe audio with specific terms or jargon-heavy content
  • Preprocess noisy audio before transcription
  • Stream output and clip time ranges
  • Search the transcript and detect chapters
  • Export speaker audio and spreadsheet output

Reviews

No reviews yet.