Question 1

Which formats are supported?

Accepted Answer

Common audio formats such as mp3, wav, and m4a via an OpenAI-compatible transcription endpoint.

Question 2

Can I see who said what?

Accepted Answer

Yes. With speaker separation (diarization), the transcript is split per speaker with timestamps.

Question 3

Where is the audio processed?

Accepted Answer

On our own hardware in Sweden. The audio never leaves the country and is not stored after the transcript is delivered.

Transcribe meetings — with who said what

Speaker separation