Speech AI tools
-
Speechmatics Foundational Speech TechnologySpeechmatics offers enterprise-grade APIs for Automatic Speech Recognition (ASR) and building Conversational AI products, delivering top transcription accuracy and supporting over 50 languages.
- Freemium
-
openvoiceos.com Community-driven open-source voice AI platform for custom voice-controlled interfacesOpen Voice OS is an open-source voice AI platform that enables developers to create custom voice-controlled interfaces with NLP capabilities, focusing on privacy, security, and multi-platform compatibility.
- Free
-
Accent Voice Discover your unique speech profile with advanced AI accent detection.Accent Voice is an AI tool that analyzes speech patterns to identify English accents and provide insights into pronunciation using advanced AI for real-time feedback.
- Free
-
sesame-ai.pro Crossing the Uncanny Valley of VoiceSesame AI offers advanced AI companions, Maya and Miles, powered by a unique Conversational Speech Model (CSM) for natural, emotionally intelligent voice interactions.
- Contact for Pricing
-
Pronunciation Exercises Free Pronunciation Exercises for Worldwide LanguagesImprove your pronunciation in 15 major languages with this free, AI-powered platform offering guided practice and instant feedback.
- Free
-
ClearCypherAI Generative Audio solutions and datasetsUS-based AI startup ClearCypherAI excels in creating advanced multilingual, multimodal, real-time voice intelligence solutions, including text-to-audio, audio-to-text, and audio-to-audio conversions.
- Contact for Pricing
- API
-
Voicetapp Transform Your Workflow with AI-Powered ToolsVoicetapp is a comprehensive AI platform offering speech-to-text transcription, content writing, voiceover generation, and YouTube-to-blog conversion capabilities with multilingual support and up to 99% accuracy.
- Paid
- From 12$
-
Lemonfox.ai Low Cost, Easy-to-Use Speech-to-Text APILemonfox.ai provides a cost-effective, high-quality speech-to-text API with features like speaker recognition and support for over 100 languages. It also offers LLM Chat and SDXL Image APIs.
- Paid
- From 5$
-
Murf AI AI Voice Generator: Versatile Text to Speech SoftwareMurf AI is a versatile and powerful text to speech software ideal for education, marketing, corporate coaching, podcasting, animation, customer support, and more. With over 120+ voices in 20+ languages, users can create studio-quality voice overs in minutes for videos, presentations, podcasts, and other professional uses.
- Freemium
- From 19$
- API
-
Open Voice OS A community-driven open-source voice AI platform for custom voice-controlled interfacesOpen Voice OS is an open-source voice AI platform that enables developers to create custom voice-controlled interfaces with privacy-focused features, NLP capabilities, and a customizable UI. It supports multiple platforms and devices, making it ideal for DIY smart speaker projects.
- Free
-
MetaVoice Reimagining AI voice experiences.MetaVoice is re-architecting AI models (STT, TTS, LLMs) to create natural, reliable conversational voice experiences, aiming for plug-and-play voice AI solutions.
- Contact for Pricing
-
Fluent.ai Voice enabling the world's devicesFluent.ai provides unique speech-to-intent technology offering offline, noise-robust speech recognition that supports any language and accent.
- Contact for Pricing
-
My Speaking Score The Smart Way to Get 26+ in TOEFL® SpeakingMy Speaking Score is an AI-powered platform utilizing ETS's SpeechRater™ engine to provide instant feedback and scoring for TOEFL® Speaking practice.
- Freemium
-
AI4Bharat Advancing AI Technology for Indian Languages Through Open-Source ContributionsAI4Bharat is an IIT Madras research lab developing open-source AI tools and datasets for Indian languages, focusing on translation, speech recognition, TTS, and LLMs.
- Free
-
Spoken Practice speaking real-life English and Spanish with AI feedbackSpoken helps intermediate language learners improve speaking fluency through daily topics, real native speaker recordings, and AI-powered grammar and vocabulary corrections.
- Freemium
-
AssemblyAI Transform speech into meaning with industry-leading Speech AIAssemblyAI is a comprehensive speech-to-text platform offering advanced AI models for voice data processing, including real-time transcription, speaker diarization, and speech understanding capabilities with up to 95% accuracy.
- Freemium
-
Braina AI Virtual Assistant Your AI-powered virtual assistant for natural language tasks and dictation on PCBraina is an AI virtual assistant and dictation software for Windows that performs tasks through natural language commands, serving as a versatile productivity tool for quick operations.
- Other
-
Vocol AI Turn voice into actionable insightsVocol AI empowers individuals and enterprises to efficiently transform voice data into text and actionable insights, enhancing collaboration and productivity.
- Freemium
- API
-
Accent Guesser Discover Your Accent with AI-Powered AnalysisAccent Guesser is a free online tool that uses AI to analyze your speech and identify your accent characteristics. Get instant, accurate results and insights for pronunciation improvement.
- Free
-
SpeechPulse Voice Typing Anywhere - Speed up your typing using Whisper voice recognitionSpeechPulse is a comprehensive voice typing software that uses Whisper voice recognition to enable real-time speech-to-text conversion across all applications, supporting 99 languages and offline processing for enhanced privacy.
- Pay Once
-
WhisperUI Transform Audio Files into Text Using OpenAI WhisperWhisperUI is a web-based speech-to-text conversion tool that leverages OpenAI's Whisper ASR system to transcribe audio files into text and SRT formats with high accuracy across multiple languages.
- Freemium
-
Duzo.ai Use the power of AI to make your content reach a global audienceDuzo.ai is an AI-powered platform for content translation, voice cloning, lip-syncing, and subtitle generation, helping creators reach a global audience across 29+ languages.
- Freemium
- From 22$
-
BlabbyAI AI-Powered Speech to Text on Any WebsiteBlabbyAI is an AI-driven browser extension that converts voice to text in real-time across any website, increasing productivity and providing customizable transcription modes.
- Freemium
-
SpeakAI - Learn Language AI-powered language app for immersive learning and practice.SpeakAI is an AI-driven language learning app offering interactive practice in multiple languages through real-life scenarios, grammar correction, and diverse voice options.
- Free
-
Phonic Build, Evaluate, and Scale Reliable Voice AI AgentsPhonic is an advanced voice AI platform that enables organizations to develop, monitor, and improve high-reliability conversational voice agents designed for dynamic customer interactions.
- Contact for Pricing
-
SpeakAce Intermediate-Advanced English Speaking App for FluencySpeakAce is an intensive English speaking app designed for intermediate to advanced learners aiming to improve fluency and conversational skills quickly.
- Freemium
-
AppTek.ai A Leader in Generative Artificial Intelligence and Machine Learning for Human Language TechnologiesAppTek.ai is a global leader in AI and ML technologies specializing in speech recognition, neural machine translation, and language processing solutions. Their platform delivers enterprise-grade language technologies across multiple industries using advanced neural networks and machine learning.
- Contact for Pricing
-
Speech Intellect Revolutionize your voice solutions with AI-powered STT/TTS.Speech Intellect offers real-time speech-to-text and text-to-speech solutions using a unique AI-focused mathematical theory, "Sense Theory," for enhanced understanding and generation of human-like voice.
- Usage Based
-
LinguaPeak AI-Powered IELTS Speaking PracticeLinguaPeak offers AI-driven IELTS speaking practice with personalized feedback and real-time analysis to help users improve their scores. It provides mock exams, detailed analytics, and multi-accent training.
- Freemium
- From 20$
-
Resemble AI AI Voice Generator with Text to Speech and Speech to SpeechResemble AI offers a powerful voice AI generator that allows users to create realistic human-like voiceovers in seconds. It enables features like text to speech, speech to speech, neural audio editing, and language dubbing.
- Free Trial
- API
-
VoiceGPT Voice-enabled Genius Personal Teammate & Chat Assistant for AndroidVoiceGPT is a specialized Android browser with voice capabilities that enhances accessibility to AI platforms like ChatGPT, Bing AI, and Bard through speech recognition and text-to-speech features, supporting 67+ languages.
- Freemium
-
TTS Voice Wizard A Voice For EveryoneTTS Voice Wizard offers high-quality speech recognition and synthesis with a wide range of voices and language support. It integrates with various services and provides features like VRChat interaction and heart rate sharing.
- Free
-
French Together Become a confident French speaker with 10 minutes of conversation practice a day.French Together helps users master practical French conversations and boost confidence through its Listen, Speak, Repeat (LSR) method and AI-powered feedback.
- Free Trial
-
Languify AI Communication Coach for Fluent English SpeakingLanguify is an AI-powered communication coach designed to help users improve their English speaking skills through real-time feedback on pronunciation, grammar, and fluency. Practice interviews, presentations, and daily conversations.
- Freemium
- From 9$
-
Talkscriber Build Speech AI Into Your AppsTalkscriber is a secure and cost-effective enterprise-grade speech-to-text platform, delivering high accuracy and advanced features like emotion and purchase intent detection.
- Usage Based
-
ToDoIt Speak, Organize, Achieve — ToDoIt Your Way!A voice-powered AI to-do list manager that converts spoken tasks into organized lists in under 10 seconds, supporting 57 languages and offering intelligent task recommendations.
- Freemium
- From 3$
-
Cockatoo Convert audio or video to text in just seconds with blazing speed and incredible accuracyCockatoo is an AI-powered transcription tool that converts audio and video files to text with 99.8% accuracy in over 90 languages, processing 1 hour of content in just 2-3 minutes.
- Freemium
- From 9$
-
InteliConvo AI-Powered Speech Analytics & Automation PlatformInteliConvo is an AI-powered speech analytics platform that analyzes customer conversations to improve sales, collections, customer experience, and compliance.
- Free Trial
-
Vext Caption Anything, InstantlyVext is an AI-powered speech-to-text tool that provides real-time captions and translations for meetings, events, and videos. It ensures seamless communication and accessibility across various platforms.
- Free
-
YouTube Video Transcripts Extract and Analyze YouTube Video Transcripts with AIYouTube Video Transcripts is an online tool that extracts transcripts from YouTube videos and offers advanced AI-powered features for video content analysis.
- Free
-
langai.io Speak a new language in 30 Days!LangAI is an AI-powered language learning app designed to help users achieve conversational fluency in a new language within 30 days by focusing on common vocabulary and interactive practice.
- Freemium
-
LumenVox Transforming customer engagement with AI-driven speech recognition and voice authentication technologyLumenVox provides AI-powered speech recognition and voice authentication solutions for businesses, offering automatic speech recognition, call progress analysis, voice biometrics, and neural text-to-speech capabilities.
- Contact for Pricing
-
Voice.ai Free Real Time Voice ChangerVoice.ai offers a free real-time AI voice changer and a comprehensive ecosystem of AI voice tools for gaming, streaming, and communication.
- Freemium
- API
-
Muchtodo Use your voice to create projects, tasks, and notesMuchtodo is an AI-powered task management platform that converts voice input into projects, tasks, and notes across 57 languages, helping users save time and boost productivity.
- Free Trial
- From 3$
-
Snips Embedded Voice Recognition Platform (Acquired by Sonos)Snips, an AI platform specializing in embedded voice recognition, has been acquired by Sonos.
- Other
-
Ultravox Build AI voice agents that communicate like humans.Ultravox is an open-weight Speech Language Model (SLM) designed for building highly natural and effective AI voice agents by processing speech directly.
- Usage Based
-
SoapBox Labs Equitable Voice AI for Kids' EducationSoapBox Labs provides equitable voice AI technology specifically designed for children, focusing on educational applications like literacy and language learning assessments.
- Contact for Pricing
-
Akkadu Real-Time AI SubtitlesAkkadu provides real-time AI subtitles for videos, live streams, webinars, and video conferences in over 90 languages. An effective tool to make content accessible across various languages.
- Usage Based
-
Picovoice On-device voice AI & local LLMsPicovoice is a platform for building voice-enabled applications with on-device voice AI and local LLMs, ensuring privacy, low latency, and efficiency.
- Freemium
-
Wavify Unlock edge inference: cloud-level performance at your fingertipsWavify is a platform for on-device speech AI, enabling software engineers to embed features like speech recognition and wake word detection into any software.
- Freemium
- From 150$
Showing 1 to 50 of 54 results
More Categories
Didn't find tool you were looking for?