InstantTranscriber API
Speech-to-text API for product teams
Upload audio or video, create a transcription job, and receive clean text, timestamps, speaker labels, summaries, and export links from one REST API.
Use Cases
- User interview repositories
- Call recording workflows
- Voice note and media ingestion
Latency
Jobs usually process at roughly 10x audio speed, depending on file length, queue depth, and requested post-processing.
Pricing
API usage is counted by completed audio duration. API Plan overage can be capped and is billed at $0.49 per audio hour.
Code Example
Start a transcription job
Use a bearer API key from account settings. Send files directly or import a public URL, then poll status or wait for a webhook before fetching exports.
curl -X POST https://api.instanttranscriber.com/v1/transcriptions \
-H "Authorization: Bearer $INSTANTTRANSCRIBER_API_KEY" \
-F "[email protected]" \
-F "speaker_labels=enhanced" \
-F "timestamps=true"Limits
Plan and job constraints
Free: 1 API audio hour per UTC month
Premium: 8 API audio hours per billing month
API Plan: 100 API audio hours per billing month
Webhooks
Completion callback example
Webhooks are at-least-once notifications. Use the transcript ID and status as your idempotency key, then fetch the transcript by ID for the authoritative result.
{
"id": "0a2c9f72-0f0b-42f3-a30b-15dc82619500",
"status": "completed",
"download_urls": {
"txt": "https://api.instanttranscriber.com/export/0a2c9f72.txt",
"docx": "https://api.instanttranscriber.com/export/0a2c9f72.docx"
}
}FAQ
Speech-to-Text API questions
Can I use the API for both audio and video?
Yes. The API accepts common audio and video files when the media contains a usable audio stream.
Does the speech-to-text API support webhooks?
Yes. Set callback_url and optionally callback_secret to receive a completion callback.
Build with the Speech-to-Text API
Create an account, generate an API key, and send your first transcription job.