Home › Catalog › Transcription Tool › Why Forge Transcription
OpenAI's Whisper API, AssemblyAI, Deepgram, and Rev are excellent paid speech-to-text services — they bill per minute and need an API key and billing on file. The Forge Transcription API runs OpenAI's Whisper model for you, server-side, for free: POST an audio or video file and get back plain text or ready-to-use SRT/VTT subtitles. No key, no per-minute meter, no signup.
| Feature | Forge Transcription | Whisper API / AssemblyAI / Rev |
|---|---|---|
| Transcribe in the browser | ✓ | ✓ (dashboards / consumer apps) |
| Free REST API for scripts & CI | ✓ (POST /transcribe) | × (API key + billing required) |
| Direct SRT & VTT subtitle output | ✓ | partial (often JSON; subtitle formatting is extra) |
| Video files (audio auto-extracted) | ✓ (MP4, WEBM, etc.) | varies by provider |
| Per-minute charge | × ($0) | ✓ (metered or subscription) |
| Speaker diarization | × | ✓ (AssemblyAI / Deepgram) |
| Real-time streaming | × | ✓ (Deepgram / AssemblyAI) |
| Requires signup / API key | × | ✓ |
The paid services charge per minute of audio and gate everything behind an API key. Forge Transcription is one curl away with no account:
curl -X POST -F "file=@audio.mp3" \
https://forge-node.tail2b516d.ts.net/transcribe
The same call works from Python, Node, Bash, or a cron job. No login, no billing, no key.
Need a caption file rather than a transcript blob? Ask for SRT or VTT directly — no post-processing step to turn JSON timestamps into a subtitle file:
curl -X POST -F "file=@talk.mp3" \
https://forge-node.tail2b516d.ts.net/transcribe -o subtitles.srt
Swap the output filename to subtitles.vtt for WebVTT. Drop the file straight into a video player or YouTube's subtitle uploader.
Upload an MP4, WEBM, or other video file and the audio track is extracted and transcribed automatically. You do not need to run ffmpeg first to pull the audio.
Forge Transcription runs OpenAI's open Whisper model (the base model) on the server. You get Whisper-quality output without standing up a GPU, installing the model, or paying the hosted Whisper API's per-minute rate.
base model trades some accuracy for speed; a larger model or a paid service is the right call there.Yes. No signup, no API key. Rate-limited per IP to keep abuse low. Details in the Transcription API docs.
Upload audio (MP3, WAV, M4A, etc.) or video (MP4, WEBM, etc.). Output is plain text, SRT, or WebVTT subtitles. Files up to 25 MB.
OpenAI's Whisper base model, running server-side. Good for clear audio; noisy or heavily accented multi-speaker audio may need a larger or paid model.
Try the Transcription tool, or browse the full free API catalog for 15 more developer APIs in the same no-signup style.