InstantTranscriber API

Speech-to-text API for product teams

Upload audio or video, create a transcription job, and receive clean text, timestamps, speaker labels, summaries, and export links from one REST API.

Use Cases

  • User interview repositories
  • Call recording workflows
  • Voice note and media ingestion

Latency

Jobs usually process at roughly 10x audio speed, depending on file length, queue depth, and requested post-processing.

Pricing

API usage is counted by completed audio duration. API Plan overage can be capped and is billed at $0.49 per audio hour.

Code Example

Start a transcription job

Use a bearer API key from account settings. Send files directly or import a public URL, then poll status or wait for a webhook before fetching exports.

curl -X POST https://api.instanttranscriber.com/v1/transcriptions \
  -H "Authorization: Bearer $INSTANTTRANSCRIBER_API_KEY" \
  -F "[email protected]" \
  -F "speaker_labels=enhanced" \
  -F "timestamps=true"

Limits

Plan and job constraints

Free: 1 API audio hour per UTC month

Premium: 8 API audio hours per billing month

API Plan: 100 API audio hours per billing month

Webhooks

Completion callback example

Webhooks are at-least-once notifications. Use the transcript ID and status as your idempotency key, then fetch the transcript by ID for the authoritative result.

{
  "id": "0a2c9f72-0f0b-42f3-a30b-15dc82619500",
  "status": "completed",
  "download_urls": {
    "txt": "https://api.instanttranscriber.com/export/0a2c9f72.txt",
    "docx": "https://api.instanttranscriber.com/export/0a2c9f72.docx"
  }
}

FAQ

Speech-to-Text API questions

Can I use the API for both audio and video?

Yes. The API accepts common audio and video files when the media contains a usable audio stream.

Does the speech-to-text API support webhooks?

Yes. Set callback_url and optionally callback_secret to receive a completion callback.

Build with the Speech-to-Text API

Create an account, generate an API key, and send your first transcription job.