Skip to main content

Speech to Text

Transcribe audio to text in 13 languages with speaker detection and word-level timestamps. Handles files up to 3 hours — pay per minute with Bitcoin.

Powered by Mistral Transcription
13 Languages

English, Chinese, Hindi, Spanish, Arabic, French, Portuguese, Russian, German, Japanese, Korean, Italian, Dutch

Speaker Diarization

Identify and label different speakers in multi-person audio

Timestamps

Segment or word-level timestamps for precise timing

Up to 3 hours / 1 GBNoise robustLow word error rateAudio: mp3, wav, flac, ogg, aac, opus, wmaVideo: mp4, mkv, avi, mov, webm
10 sats
per minute
~ $0.01 USD
Disabled
10 sats per minute of audio (≈ $0.01 USD/min)
Empty
No transcription yet.