Skip to main content
Transcription converts spoken content in audio or video files into written text. The AI identifies different speakers, labels them, and tags audio events like music, applause, or laughter.

How to transcribe

1

Upload your file

Attach an audio or video file using the + button in the input bar. Supported formats include MP3, WAV, MP4, MOV, and other common audio and video formats.

Transcribe a meeting recording.

2

Wait for processing

The AI processes the file and generates a text transcript. Longer files take more time.
3

Review the transcript

The agent returns the full text with speaker labels. Each segment is tagged with the speaker who said it.
Chat showing a transcription result with Speaker 1 and Speaker 2 labels on alternating paragraphs

What transcription includes

FeatureDescription
Speaker diarizationThe AI identifies different speakers and labels each segment (Speaker 1, Speaker 2, etc.).
Audio event taggingNon-speech events like music, applause, laughter, and background noise are tagged in the transcript.
Audio and video supportWorks with both audio files (MP3, WAV) and video files (MP4, MOV).

When to use this

  • Transcribing meeting recordings for written notes.
  • Converting podcast episodes to text for blog posts or show notes.
  • Creating subtitles or captions from video content.
  • Extracting dialogue from video files for editing or analysis.

What you cannot do

  • You cannot transcribe in real time. Upload a complete file.
  • You cannot assign custom names to detected speakers. Speakers are labeled numerically (Speaker 1, Speaker 2).
  • You cannot transcribe content in multiple languages within the same file. The AI processes one language at a time.
  • You cannot edit the transcript within Runable. Copy the text and edit it in your preferred text editor.

Next steps