Speaker separation
WhisperX + diarization identifies who is speaking and splits the transcript per speaker with timestamps — perfect for meetings and interviews.
Whisper-large with speaker separation on our own hardware in Sweden. Upload audio, get timestamped text split by speaker. OpenAI-compatible audio endpoint.
WhisperX + diarization identifies who is speaking and splits the transcript per speaker with timestamps — perfect for meetings and interviews.
Call /v1/audio/transcriptions just like OpenAI Whisper. Swap base_url, keep your code.
All transcription runs on our own GPUs in Sweden. No files are sent to third parties and nothing is stored after delivery.
Common audio formats such as mp3, wav, and m4a via an OpenAI-compatible transcription endpoint.
Yes. With speaker separation (diarization), the transcript is split per speaker with timestamps.
On our own hardware in Sweden. The audio never leaves the country and is not stored after the transcript is delivered.