Skip to main content

Speech to Text

Transcribe audio to text in 13 languages with speaker detection and word-level timestamps. Handles files up to 3 hours — pay per minute with Bitcoin.

Powered by Mistral Transcription
13 Languages

English, Chinese, Hindi, Spanish, Arabic, French, Portuguese, Russian, German, Japanese, Korean, Italian, Dutch

Speaker Diarization

Identify and label different speakers in multi-person audio

Timestamps

Segment or word-level timestamps for precise timing

Up to 3 hours / 512 MBNoise robustLow word error ratemp3, wav, flac, ogg, m4a
10 sats
per minute
~ $0.01 USD
Disabled
10 sats per minute of audio (≈ $0.01 USD/min)
Empty
No transcription yet.