How it works
The Text to Speech tool converts written text to spoken audio directly in your browser using the Web Speech API — no server upload, no API key, and no cost. Choose from the voices available on your device and operating system.
Modern browsers ship with high-quality text-to-speech voices built in — on Windows 10/11, you get Neural voices like "Microsoft Zira" and "Microsoft David" at no cost. On macOS, you get "Samantha" and others. On iOS/Android, local voices are even more diverse. This tool surfaces all available voices for your system.
How to use it: type or paste your text, select a voice from the dropdown, adjust the speed (0.5x to 2x) and pitch, then click Play. The tool reads aloud while highlighting the current word in the text. Use Pause, Resume, and Stop buttons to control playback. Download as MP3 is available when the browser supports the MediaRecorder API.
Use cases: listening to your own writing to catch awkward phrasing, generating audio for educational content, checking pronunciation of proper nouns, assistive reading for long documents, language learning (select a native-language voice and listen to correct pronunciation), and accessibility testing for voice-over compatibility.
Limitation: voice availability and quality depend entirely on the voices installed on your operating system. Neural voices (the more natural-sounding ones) are available on Windows 10+ and macOS 12+. The tool cannot add voices — it only exposes those already on your system.
Frequently Asked Questions
- Available voices depend entirely on what is installed on your operating system. Windows 10/11 ships with a set of Neural voices. macOS has different voices. Adding voices requires downloading them from your OS settings (Windows: Settings → Time & Language → Speech; macOS: System Preferences → Accessibility → Speech).
- Yes, when the browser supports the MediaRecorder API (available in Chrome and Edge). The audio is recorded directly from the Web Speech API output and offered as an MP3 download.
- Yes. The Web Speech API uses voices installed locally on your device, so the tool works without an internet connection once the page has loaded.
- Speech rate ranges from 0.5× (half speed, useful for language learning) to 2.0× (double speed). Pitch ranges from 0.5 to 2.0. Both affect real-time playback and any downloaded audio.