Audio | podcast | transcribe

Audio | podcast | transcribe

Editor’s note: Before using any transcription tool, think about the security of your data. Read these articles from Politico and the Freedom of the Press Foundation about security issues with these tools and apps. For sensitive interviews, it might be best to transcribe it yourself.

Otter.ai
The Ferrari of transcription tools. Note privacy issues on free version, which give you 600 free minutes of transcription. Paid version has more privacy.

Descript
An AI-powered editor that automatically transcribes your audio and video recordings so that you can edit them just like text. 

Whisper
Audio transcription tool.

Bark – Text to Audio AI Tool
Generates highly realistic, multilingual speech as well as other audio – including music, background noise and simple sound effects – all for free

Resound
Great for removing extra words.

Speechify
Text-to-speech reader. Works with docs and PDFs, etc.

Assembly.ai
Identifies voice and speakers in mixed conversations. 

Murf.ai
Text-to-voice generator

Adobe Enhance Speech
Free AI tool that cleans up your audio.

Spacebar.fm
A voice-to-text AI app that transcribes and  transforms audio content. After recording with Spacebar, a “memory” is generated which provides a thorough recap of the content that’s been captured. Users are then able to explore new perspectives with AI –– chat with a single memory or multiple, recall important details, turn field notes into articles, podcasts, and more.

WhisperTranscribe
Creates podcasts, blogs, and social content

Lovo.ai
AI Voice Generator & Text to Speech platform.

Amazon Polly
Deploy high-quality, natural-sounding human voices in dozens of languages

Wondershare Virbo
A multiple languages video translator that can generate AI voice clones and lip syncs. It automatically generates subtitles, avatars scripts with AI.

Lipdub
A free app from Captions.ai that records short clips in one language and translate

Notta Showcase
Translate videos into 15-plus languages while retaining the original voice.

CloneDub
Convert audio into other languages using the same voices. Only audio files, YouTube, or audio links less than 15 minutes will work. Note: Many ethical concerns with this tool. Fact-check any translations for accuracy.

Poynter: AI Detection Tools for Audio Deepfakes Fall Short
Experts test four tools and show options on what to do.

SynthID
Audio tool from Google Deepmind

Resemble.ai
Deepfake audio detection

Pindrop
Realtime audio deepfake detection

MusicFX
Google’s free text-to-song creation tool

Voxio
Organizes voice recordings into structured text formats.

MetaVoice-1B
Emotional Speech in English. Voice cloning with fine-tuning.

NY Times: ‘A.I. Obama’ and Fake Newscasters: How A.I. Audio Is Swarming TikTok



Peech
Effortlessly transforms any text into incredibly realistic AI-generated audio. Peech supports over 50 languages, including English, French, German, Italian, Spanish, and more. 

Listnr
A free voice AND video generator in multiple languages

OpenVoice
Clone voices with style control and multilingual support

Papercup
AI video and audio dubbing tool

MusicFX
Google tool that lets you describe what you want and it will write a song.

AudioRead
Use AI to listen to articles, PDFs, emails, etc. in your podcast player. Read while walking, driving, cleaning and more.”Read” while walking, driving, cleaning, and more

Superwhisper
Fast and accurate voice to text

Udio Audio Inpainting
Select a portion of an AI-generated music track and regenerate it. Be careful with music rights using this tool. 

Stability AI’s Stable Audio Open
Generates up to 47-second audio samples based on text descriptions. It’s trained on thousands of  royalty-free music samples.

Transcript LOL
Transcribes podcasts, videos and meetings

ElevenLabs Audio Native
Add narration to your blog or news site

PlayAI
Create human-like voice agents

Soundraw
AI music generator

Mubert
Generative AI music for videos and podcasts

ElevenLabs Text to Sound Effects
Generate any sound from a prompt

Pika Labs
Add sound effects to your videos

Synthflow AI
Create conversational AI voice agents without coding.

Jellypod
Transform your inbox into a personalized daily podcast 

izTalk
Translation and speech recognition

OpenVoice
A free tool to clone voices 

Letterly
Record your voice and let AI turn it into well-written text

Ques.ai
Audio-to-text tool

LALAL AI
Remove voices and instrumental audio splitter

Deepgram Aura
A text-to-speech API built for real-time conversations.

CleanVoice
Cleanvoice is an artificial intelligence which removes filler sounds, stuttering and mouth sounds from your podcast or audio recording

CloneDub
Automatically dubs your videos and podcasts

Dubformer
AI-powered dubbing solution with broadcast quality 

Video Insights
Video and audio summarization and transcription

Dubformer
AI audio dubbing tool

Poynter: How Text-to-Speech Technology Can Help Journalists Avoid Copy Errors

Udio New Features
Generate AI music longer than 2 minutes and extend tracks up to 15 minutes

Hive AI Detection
Check photos, video, audio and text. Freemium tool and you can request a demo.

PocketPod
Creates personalized podcasts that match your interests.

Replica Voice AI
Ethical voice AI for creators and business

Chuchotis
Chuchotis (French word for whisper) is a fee-based speech-to-text application based on OpenAI’s whisper. It runs fully locally on any M-silicon-based Mac, to preserve privacy of recordings and transcriptions.

Testing the Best Speech to Text Software in 2024

FlowTunes
Endless AI-generated music for focused work

SpeechTexter
Type with your voice

Beatoven AI
Create royalty-free background music.

SpeakAI
Get transcription, research, data analysis and NLP software.

Udio Audio Prompting
Upload a sound and let AI generate a song


AI Jukebox
In-browser text-to-music generation

 

Soundry AI
AI sound sample VST for music creation and DJing

AIornot.com

Ramble Fix
An audio to text tool. Speak your mind and it will paraphrase/rewrite what you say. Five free uses per month, then $10 monthly for more recordings.

Teachable Machine
Teachable Machine is a web-based Google tool that makes creating machine learning models fast, easy, and accessible to everyone. Train a computer to recognize your own images, sounds, and poses. A fast, easy way to create machine learning models for your sites, apps, and more – no expertise or coding required.

Adobe Speech Enhancer
Speech enhancement makes voice recordings sound as if they were recorded in a professional studio. Be careful with ethics and edits with this tool.

Google AudiopaLM
A large language model for speech understanding and generation. AudioPaLM fuses text-based and speech-based language models, PaLM-2 and AudioLM [Borsos into a unified multimodal architecture that can process and generate text and speech with applications including speech recognition and speech-to-speech translation.

Easy_Peasy.ai
Create images with text, transcribe audio and create audio with this Swiss Army knife of an AI tool

Rizzle AI
Convert text and podcasts into captivating videos

Toasty AI
AI content creation for podcasts

CastMagic
Turn audio into content

Audio Notes AI
Organize thoughts into structured notes with AI

DeepL: Speech to Translation

Article Audio
Convert your article to audio. Free model with paid upgrade.

Wordtune
Rewrite your thoughts in different styles.

UberDuck
Text-to-speech with over  5,000 voices

Castmagic
Turn long-form audio into ready-to-use content assets, instantly. Requires a log-in to PartnerSnack first.

FineVoice
Content recording and editing

Music Maestro GPT
A ChatGPT GPT assistant for music creation

MusicFX DJ
Google has added ‘DJ Mode’ in MusicFX, the generative text-to-music tool powered by Google’s MusicLM

PDFToMP3
Transforms PDFs into MP3 formats

ButterReader
Transforms blogs into audio

SwellAI
AI assistant for podcast producers

Nendo
Open-sourced AI music generation models

Musicgen-remixer
Remix music into other styles

Whisper WebGPU
A real-time in-browser speech recognition with OpenAI Whisper. The model runs fully on-device and supports  transcription across 100 different languages.

MAGNeT
A high-quality text to sound and text-to-music

Gotalk.ai
Text into AI voiceovers

Podsqueeze 2.0
Podcast content repurposing

Hypernatural
Use it to generate audiograms

HearTheWeb
Convert your newsletter or any other text into an engaging podcast

HearTheWeb
Convert newsletters into podcasts.

Readany
Turn any website into a podcast or audiobookMusify
Sing into the software and turn your notes into any instrument you want

KoeApp
Tool transcribes human speeches with AI

Splash Pro
AI music generator. Write what type of song you want and compose a 30-second piece.

Scriber
A transcription tool for 18 languages, developed by independent media

Dub AI
AI-powered voice cloning and translation

Framedrop AI
Automatically converts your videos (podcasts, streams, and vlogs) to TikToks, shorts, and reels

Read This AI
Transform text into high-quality audio effortlessly

Voicenotes
Notetaker that transcribes and provides info on your thoughts

izTalk
Overcome language barriers with instant AI translation 

Text Reader AI
Convert text to speech with free, realistic AI voices

Taped.ai
Convert audio and visuals to text summaries using AI

BlueDotHQ
AI meeting recorder backed by Google

AudioPen
Just hit record. Then start talking. AudioPen will clean things up when you’re done.

TalkNotes
Transcribes your voice notes in more than 50 languages

Lalal
Extract vocal, accompaniment and various instruments from any audio and video. One-time fees start at $15

Zealous
Convert audio files into social media posts. Tool is marketing-driven but has social media desk benefits. Free with reasonable upgrades.

Voicemod
An AI voice creator that lets users make their voices by changing gender, age and tone.

PlayHT
AI-driven text to audio generator. Lets you replicate your voice.

Mubert
Create soundtracks for your projects with AI. Freemium account

Sumly
Audio and podcast summary tool

Voicify
Create AI covers using AI in seconds with Voicify, with hundreds of community-uploaded AI voice models available for creative use now.

Krisp
Noise-canceling app

Altered
Voice-changing software. Be sure to use this ethically and carefully.

VoiceMod
Free real-time voice-changer. Be sure to use this ethically and carefully.

BeyondWords
Text-to-speech tool. Free account with paid upgrades.

Samplab
Edit audio with AI 

Vocal Remover
Separate voice from music out of a song free with powerful AI algorithms

AI Jingle Maker
Create jingles, radio sweepers, podcast intros, audio promos and more.

AssemblyAI | Playground
Production-ready AI models for speech recognition, speaker detection, audio summarization, and more through our API. Quickly test below using any YouTube link, audio file or video file.

RadioInfo Australia: Embracing the AI Wave: How Media Companies Can Successfully Integrate AI Technologies

Narakeet
Text-to-speech tool for creating voiceovers. Pay by the length of the audio.

Cassette.ai
Music generation and text-to-audio tool

CastMagic
Upload and .mp3 and download a transcript.

Wavel
AI video dubbing with realistic voices, subtitles, accents, etc.. 

Flowjin
AI-generated short video clips for audio or video podcasts

Songburst
Mobile AI music generation app with prompt enhancer

Byrdhouse AI 2.0
Multilingual video call AI interpreter

Listnr
Free AI voice and video generator, choose from 900+ voices in 142 languages. Get started for free, download in MP4/MP3/WAV formats.

Listnr
Text-to-voice and text-to-video tool.

Big Speak AI
A free app that generates realistic sounding audio from text. It uses a mix of machine learning algorithms to bring you the best voice generation technology.

Assembly
Exposes AI models for speech recognition, speaker detection, speech summarization, and more.

Poised
AI-powered communication coach that helps you speak with confidence and clarity. Private and secure, an essential tool for digital-first workplaces. Free trial with personalized plans.

Kits.ai
Create AI voices or modify your own with a library of commercial use and officially licensed artist voices.

Brain.fm

BeyondWords
Convert writing into audio. Free for up to 30,000 words, then paid.

Notta
Notta is an AI-based voice-to-text transcription software that supports 104 languages. Notta excels at transcribing and summarizing audio or video files, online meetings, and voice recordings. Additionally, Notta offers a suite of team alignment features, enabling users to schedule and transcribe Zoom, Google Meet, and Teams meetings, among other functionalities.

Wondercraft AI

Boomy – Make Generative Music with Artificial Intelligence

Whisper
Popular AI tool for transcription.

Fathom
Note-taking and transcription tool for Zoom

Media.io
A collection of AI tools ranging from video/image enhancement to audio.

Listen Monster
Free tool that generates subtitles in 97 languages

Wellsaid
Creates texts from voiceovers in seconds

Transistor
Quick and precise speech-to-text conversion for podcasts

Rely.io
AI-enhanced developer portal with voice interaction

PolyAI Pheme
Generate conversational voices for phone-call apps

Nolan Free Script-Writing Software

Decoherence
Make AI music videos

Castmagic
Create podcast notes by uploading an .mp3 file download all your post-production content.

Kaiber
An AI creative lab on a mission to unlock creativity through powerful and intuitive generative audio and video. Upload a song, add a touch of your artistic style, and let its audio analysis technology do the visual work. Monthly plans ranging from $5 to $25 with a free trial.

Audyo AI
Text-to-audio generation tool . Free up to 3 hours a month, then $6 monthly after that.

Suno.ai
Create songs 

Koe Recast
AI-powered voice transformation tool that allows users to transform their voice into different styles such as a narrator, female, or anime. Warning: Ethical issues with using this tool.

AI Phone
Provides live transcription and teal-time translation during phone calls using AI.

TurboScribe
Converts all audio and video files into transcripts

Songburst AI
AI-driven music generator

Papercup
AI dubbing for video

Respeecher
Clones voices for content creators

Bloks
Free desktop note-taking app

Wavel.ai
Dub videos in more than 30 languages. Free with paid upgrades. Be careful with ethics with this tool.

Shownotes
A ChatGPT plugin that converts your lengthy podcasts into brief summaries

Fluxon
Ultrarealistic AI voice generator. Be careful with this tool and be mindful of ethics and transparency with readers.


AI PODCASTING TOOLS

Adobe Podcast
AI audio recording and editing, all on the web. Very similar to Descript.

Podcastle
A cloud recorder and AI-powered editor that lets you record a remote interview, edit and mix

SpeakUp
Generative AI podcasting tool that turns text into podcasts

Auphonic
Upload your file, and Auphonic will automatically optimize your recording and clean it.

Castup
A professional audio and video editing service for podcasters and podcast networks. Castup offers subscriptions starting at $30 per episode, or 40 cents per published minute. Castup also offers a ChatGPT-powered podcast assistant called Castup AI, which can help you record and promote episodes.

Streamyard
A professional live streaming and recording studio in your browser. Record your content, or stream live to Facebook, YouTube, and other platforms. Freemium account with paid models starting at $20 a month.

Audioread: Read. In Audio.
Turns articles, PDFs, etc. into podcasts using this AI-driven tool

Cleanvoice.ai
 AI audio tool that can remove unwanted sounds and speech imperfections.

PodcastDB
Boost podcast production with automated content creation

Listener.fm
Use AI to generate podcast titles, descriptions, and show notes in seconds

Dexa
Search for insights in podcasts.

Wondercraft
Streamline your podcast with only text.

Spotify AI Playlist
Turn any idea into a personalized playlist

Promptcast
Summarizes any podcast with AI

Framedrop
Auto AI highlights from YouTube and Twitch – gaming, podcasts, and more

Listnr
Paste text into the text-to-speech converter, and the app will convert it into one of their 600 voices.

Alitu
AI-powered one-stop shop for podcast production. Record remotely and edit.

StartSpeakup
Turns articles into AI podcasts

Podwise
A learning app for podcast listeners

Dubb
Good tool for recording podcasts.

Newsroom Robots: Podcast About AI

Google’s Music LM
Create your own music in Google’s test kitchen.

Mubert
AI-generated music.

Boomy
AI music generator. Free version with paid models ranging $10 to $30 a month.

Listen Notes
A ChatGPT plug-in that allows you to search for podcasts by person or topic.