Does this upload my text to a server?

The text you type is never uploaded by Absolutool. Speech synthesis itself is handled by your browser's built-in engine: on Chrome and Edge the text is sent to Google or Microsoft to render the audio, on Safari it runs entirely on-device. In either case Absolutool never sees or stores your text.

Which languages are supported?

Every language your operating system has a voice installed for. Open the Voice dropdown to see the list. On Windows you can add languages via Settings → Time & Language → Language → Add a language → Speech. macOS and iOS ship many languages by default.

Why can I only see English voices?

Your OS probably only has English voices installed. On Windows install additional language speech packs; on Android install Google's speech services language data; on macOS / iOS most languages are included out of the box.

Does it work on mobile?

Yes, on Chrome Android, Safari iOS, and most mobile browsers that implement the Web Speech API. Firefox does not currently support it, the Speak button will be disabled with a note if you visit in Firefox.

Can I download the audio as a file?

Not with the browser's built-in engine, the Web Speech API doesn't expose the audio stream. If you need a downloadable .wav or .mp3, use a dedicated voice-over app or an API-based service.

Free Text to Speech

Convert any text to speech using your device's built-in voices. Free, no sign-up, no download.

Text 0 characters

Voice

Speed: 1x

Pitch: 1

About This Tool

This Text to Speech tool uses your device's built-in Web Speech API. No downloads, no installation, no account. Voice availability depends on your operating system and browser: on Safari (macOS / iOS) the synthesis happens entirely on-device, while Chrome and Edge send the text to Google or Microsoft's speech service to render the audio. Absolutool itself never sees or stores your text.

To get more voices in your language, install your OS's speech packs, Windows: Settings → Time & Language → Language → Speech. macOS and iOS include dozens of voices out of the box. Android: Settings → Accessibility → Text-to-speech output.

How the Web Speech API Works

Browsers expose a SpeechSynthesis interface (part of the Web Speech API, originally drafted by the W3C Speech API Community Group) that takes text and a chosen voice and produces audible speech via the underlying operating system's TTS engine. The full API surface is small but powerful: speechSynthesis.speak(utterance) starts speech, cancel() / pause() / resume() control playback, and getVoices() lists every voice the OS exposes. Each SpeechSynthesisUtterance carries the text, language tag, voice, rate, pitch, and volume.

The audio itself is generated by the OS, not the browser. macOS and iOS ship with dozens of high-quality voices built into the system. Windows surfaces voices installed via Settings → Time & Language → Speech. Android uses Google's Text-to-Speech engine (or alternatives like Samsung TTS). Linux falls through to whatever speech-dispatcher / espeak setup the distro provides, often robotic-sounding by default unless you've installed a richer engine.

The Cloud-vs-Local Privacy Distinction

Not every "browser" voice runs on your device. Some browsers send the text to a remote server to render the audio for higher-quality voices, then stream the result back. This matters for privacy:

Safari (macOS / iOS): synthesis runs entirely on-device. Apple's voices, including the Siri-style natural ones, are bundled in the OS. No text leaves the device.
Chrome (desktop and Android): for some voices labelled "Google", the text is sent to Google's TTS service to render the audio. Other Chrome voices that mirror local OS voices stay on-device. The SpeechSynthesisVoice.localService property tells you which is which (true = on-device, false = cloud).
Microsoft Edge: similar pattern. Edge's high-quality "Online Natural" voices route text to Microsoft's cloud TTS; the standard OS voices are local.
Firefox: Web Speech API support has historically been limited; on systems where it works, it uses the OS engine.

If your text is sensitive (drafts of confidential documents, internal company memos, anything you wouldn't want copied to a third party) pick a voice marked as local. If you don't see local voices in the dropdown, install OS voice packs and they'll appear there.

Common Use Cases

Accessibility. Screen readers (VoiceOver, NVDA, JAWS, TalkBack) handle the heavy lifting for blind and low-vision users, but a quick TTS tool helps anyone (dyslexia, eye strain, fatigue) get text read aloud occasionally.
Proofreading. Hearing your own writing read back catches awkward sentences, missing words, and rhythm problems that silent reading slides past. Common professional-writer trick.
Language learning pronunciation. Hear words spoken in the target language; helpful when reading a foreign article and unsure how a word sounds.
Reading articles aloud while doing chores. Cooking, cleaning, exercising, commuting, anywhere reading isn't practical but listening is.
Voiceover drafts. Quickly mock up a narration to test pacing before recording with a real voice actor or commissioning a paid TTS service like ElevenLabs.
Education. Generating spoken material for classroom content, vocabulary drills, dictation practice, accessibility for diverse learners.

Quirks and Limitations to Know About

Chrome's long-text cut-off. A long-standing Chromium bug (679437) makes speak() stop after roughly 15 seconds, typically around 200-250 characters. Workarounds split the text into sentence-length chunks and call speak() for each.
The voiceschanged event. The first call to speechSynthesis.getVoices() on Chrome returns an empty array. The voices populate asynchronously; pages need to listen for the voiceschanged event before showing the voice list.
User-gesture requirement. Like autoplay-with-audio, browsers block speech synthesis until the user clicks or taps something. The Speak button satisfies that gesture; programmatic speech on page load won't work.
iOS Low Power Mode. When the iPhone is in Low Power Mode, Safari sometimes refuses to start speech synthesis until the mode is disabled.
Pause / resume bugs on Android Chrome. Pausing and resuming sometimes drops the queue. If reliability matters, restart from speak() rather than relying on pause() / resume().
Out-of-range rate / pitch silently fails. Setting rate above ~3.0 or below 0.1, or pitch above 2.0, causes some engines to produce no audio at all instead of capping the value.

Why Voice Quality Varies So Much

The quality of a TTS voice depends entirely on the underlying engine, which depends on the OS, which depends on what you've installed. The 1990s-era voices (eSpeak, Microsoft Anna, the old Mac "Fred") were synthesised from concatenated phoneme samples and sound robotic and stilted. Modern voices (Apple's Siri voices, Microsoft's Online Natural voices, Google's WaveNet-based voices, ElevenLabs' subscription voices) use deep learning to generate audio that's nearly indistinguishable from a human reader.

If the voices in your dropdown sound robotic, the fix isn't this tool, it's installing better voices in your OS:

Windows: Settings → Time & Language → Speech → Add voices. Microsoft's "Online Natural" voices are dramatically better than the offline defaults.
macOS: System Settings → Accessibility → Spoken Content → System Voice → Manage Voices. Look for "Premium" / "Enhanced" voices; they download in the background and significantly improve quality.
iOS: Settings → Accessibility → Spoken Content → Voices. Same naming convention as macOS.
Android: Settings → Accessibility → Text-to-speech output → Google → Install voice data.
Linux: install festival or mbrola for better-than-eSpeak quality, or use a cloud TTS via API.

Common Mistakes

Expecting Firefox to support it. Firefox's Web Speech API support has lagged. The Speak button will be disabled when you visit in Firefox; use a Chromium-based browser or Safari for reliable TTS.
Pasting confidential text into a Chrome session and assuming it's local. The default Chrome "Google" voices send text to Google's TTS service. Pick a local voice or use Safari for sensitive content.
Long blocks of text in Chrome. The 15-second / ~250-character cut-off catches anyone who pastes a paragraph and expects it to read all the way through. Either split the text or use Safari (no cut-off).
Setting rate or pitch too far out of range. The engine doesn't clamp; it silently produces no audio. Stay within rate 0.5-2.5 and pitch 0.5-1.5 for predictable results.
Treating browser TTS as production-quality voiceover. Even the best browser voices are good enough for proofreading, accessibility, and rough drafts, not for published podcasts or commercial voiceover. For that, look at ElevenLabs, Murf, or similar paid services.
Forgetting that voices download asynchronously. First page visit on Chrome may show no voices; refresh after a moment and they'll appear.

Free Text to Speech

About This Tool

How the Web Speech API Works

The Cloud-vs-Local Privacy Distinction

Common Use Cases

Quirks and Limitations to Know About

Why Voice Quality Varies So Much

Common Mistakes

More Frequently Asked Questions

Related Tools

Speech to Text

Word & Character Counter

Case Converter

Text to Handwriting