OpenAI TTS for Home Assistant

Text-to-Speech component that connects Home Assistant to OpenAI's TTS API and any OpenAI-compatible backend.

OpenAI TTS turns text into speech inside Home Assistant. It works with the official OpenAI Audio Speech API and any compatible self-hosted backend (Chatterbox, pocket-tts, LocalAI, TTS Web UI, and others). Configure one or more TTS agents per OpenAI account, target announcements at any media player, optionally prepend a chime, normalise loudness for small speakers, and have the original volume and music restored after the announcement.

What's New

2-step config flow: profile setup is split into model first, then voice and audio. Voice picker filters by the chosen model so incompatible voices (e.g. marin on tts-1) cannot be saved.
Audio format selector: pick mp3, opus, aac, flac, wav or pcm per profile. Requested from OpenAI (or any compatible backend) and delivered end-to-end without a forced mp3 round-trip.
New voice catalog: marin, cedar, ballad, verse (gpt-4o-mini-tts only) with model-compatibility validation.
Format-aware audio pipeline: chimes are transcoded on demand to match the TTS codec; chime-only requests skip the TTS decode/encode round-trip via -c copy.
Volume-restore overhold fix: blocking TTS targets (Music Assistant, Sonos) no longer hold extra time after the audio finishes.
Cached audio reliability (issue #64): stale failure sentinels no longer block cached audio playback after a recovered API error.
Tag-triggered auto-release: pushing a v* tag creates the GitHub release.

Core Features

Text-to-Speech via OpenAI's Audio Speech API or any compatible backend.
Multiple TTS agents under one or more OpenAI accounts. Each agent has its own voice, model, speed, audio format and audio-processing settings.
Models: tts-1, tts-1-hd, gpt-4o-mini-tts (with custom speaking-style instructions).
Voices: full OpenAI catalog including alloy, ash, coral, echo, fable, nova, onyx, sage, shimmer, plus the gpt-4o-mini-tts-only voices ballad, cedar, marin, verse.
Audio formats: mp3, opus, aac, flac, wav, pcm per profile.
Streaming playback with HA 2025.7+ for low first-audio latency. Falls back automatically when post-processing is needed.
Chime prefix with a user-configurable library (drop your own mp3 in config/custom_components/openai_tts/chime).
Loudness normalisation for small speakers and mobile playback.
Volume restoration to the original speaker level after the announcement.
Media pause and resume during the announcement on supported platforms.
Sonos announcement feature with native group handling.
Multi-target playback with cast warm-up sync to keep multiple speakers aligned.
API health sensor that surfaces auth, quota, rate-limit and connectivity errors.
Custom-endpoint support with optional API key, custom voice text input, and extra_payload for backend-specific JSON parameters.
54 languages available through the HA Assist pipeline.

Installation

HACS (recommended)

Open HACS in the sidebar.
Search for OpenAI TTS in Integrations.
Download the integration and restart Home Assistant.
Add the integration via Settings → Devices & Services → Add Integration → OpenAI TTS. Enter the API key (or leave empty for a custom endpoint without auth).
Add one or more TTS agents (sub-entries) for the voice and audio configurations you want.

Manual

Copy the contents of custom_components/openai_tts/ into <config>/custom_components/openai_tts/.
Restart Home Assistant.
Add the integration via Settings → Devices & Services as above.

Configuration

Each integration entry stores the API credentials and endpoint. Each sub-entry (TTS agent) stores the per-profile settings:

Model and voice (filtered by model compatibility).
Speed (0.25 - 4.0).
Audio format (mp3 default, others on demand).
Custom instructions (gpt-4o-mini-tts only) for speaking style.
Extra JSON payload for custom backends.
Chime, chime sound and normalise audio as defaults that the service call can override.

Enabling chime or normalise audio disables streaming for that profile, since the audio has to be assembled in full before playback.

`openai_tts.say` service

Targets media players directly, with per-call overrides for voice, speed, instructions, chime, normalise, volume and pause behaviour.

action: openai_tts.say
target:
  entity_id: media_player.living_room_speaker
  # area_id: living_room
  # device_id: 12345abcde
data:
  tts_entity: tts.openai_tts_living_room
  message: "Dinner is ready"
  volume: 0.6              # snapshot and restore the speaker volume
  pause_playback: true     # pause music during the announcement
  chime: true              # prepend the configured chime
  normalize_audio: true    # loudness-normalise for small speakers
  voice: nova
  speed: 1.0
  instructions: "Say it warmly"
  extra_payload: '{"temperature": 0.8}'

Custom backends

The integration works with any OpenAI-compatible TTS endpoint. When the URL is not api.openai.com:

The API key field becomes optional.
The voice field accepts any backend-specific name.
Use the audio format selector to negotiate around backends that reject mp3 (for example pocket-tts returning PCM).
The extra payload field forwards backend-specific JSON parameters with the request.

Notes

For OpenAI, an API key with available balance is required. Pricing: https://platform.openai.com/docs/pricing

Name		Name	Last commit message	Last commit date
Latest commit History 119 Commits
.github/workflows		.github/workflows
custom_components/openai_tts		custom_components/openai_tts
.gitattributes		.gitattributes
.gitignore		.gitignore
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
WHATSNEW.md		WHATSNEW.md
hacs.json		hacs.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

OpenAI TTS for Home Assistant

Contents

What's New

Core Features

Installation

HACS (recommended)

Manual

Configuration

`openai_tts.say` service

Custom backends

Notes

About

Uh oh!

Releases 27

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

OpenAI TTS for Home Assistant

Contents

What's New

Core Features

Installation

HACS (recommended)

Manual

Configuration

openai_tts.say service

Custom backends

Notes

About

Topics

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases 27

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

`openai_tts.say` service

Packages