ElevenLabs

The ElevenLabs adapter is voice-focused. It exposes four capabilities:

Realtime voice agents (elevenlabsRealtime / elevenlabsRealtimeToken) — full-duplex voice-to-voice conversations powered by ElevenLabs Conversational AI agents.
Text-to-speech (elevenlabsSpeech) — one-shot speech generation via generateSpeech().
Music & sound effects (elevenlabsAudio) — one-shot audio generation via generateAudio().
Transcription (elevenlabsTranscription) — speech-to-text via generateTranscription().

It does not support text chat() or summarize() — use OpenAI, Anthropic, or Gemini for those.

The realtime adapter uses an agent-based architecture where you configure your conversational AI agent in the ElevenLabs dashboard (voice, personality, knowledge base, tools) and then connect to it at runtime. The adapter wraps the @elevenlabs/client SDK for seamless integration with useRealtimeChat and RealtimeClient.

Installation

npm install @tanstack/ai-elevenlabs

npm install @tanstack/ai-elevenlabs

Peer dependencies:

npm install @tanstack/ai @tanstack/ai-client

npm install @tanstack/ai @tanstack/ai-client

Server Setup

The server generates a signed WebSocket URL so your API key never reaches the client. The signed URL is valid for 30 minutes.

typescript

import { realtimeToken } from '@tanstack/ai'
import { elevenlabsRealtimeToken } from '@tanstack/ai-elevenlabs'

// In your API route (Express, Hono, TanStack Start, etc.)
export async function POST() {
  const token = await realtimeToken({
    adapter: elevenlabsRealtimeToken({
      agentId: process.env.ELEVENLABS_AGENT_ID!,
    }),
  })

  return Response.json(token)
}

import { realtimeToken } from '@tanstack/ai'
import { elevenlabsRealtimeToken } from '@tanstack/ai-elevenlabs'

// In your API route (Express, Hono, TanStack Start, etc.)
export async function POST() {
  const token = await realtimeToken({
    adapter: elevenlabsRealtimeToken({
      agentId: process.env.ELEVENLABS_AGENT_ID!,
    }),
  })

  return Response.json(token)
}

With Overrides

You can override agent settings at token generation time without changing your dashboard configuration:

typescript

const token = await realtimeToken({
  adapter: elevenlabsRealtimeToken({
    agentId: process.env.ELEVENLABS_AGENT_ID!,
    overrides: {
      voiceId: 'custom-voice-id',
      systemPrompt: 'You are a helpful voice assistant.',
      firstMessage: 'Hello! How can I help you today?',
      language: 'en',
    },
  }),
})

const token = await realtimeToken({
  adapter: elevenlabsRealtimeToken({
    agentId: process.env.ELEVENLABS_AGENT_ID!,
    overrides: {
      voiceId: 'custom-voice-id',
      systemPrompt: 'You are a helpful voice assistant.',
      firstMessage: 'Hello! How can I help you today?',
      language: 'en',
    },
  }),
})

Client Setup

React (useRealtimeChat)

tsx

import { useRealtimeChat } from '@tanstack/ai-react'
import { elevenlabsRealtime } from '@tanstack/ai-elevenlabs'

function VoiceChat() {
  const {
    status,
    mode,
    messages,
    connect,
    disconnect,
    pendingUserTranscript,
    pendingAssistantTranscript,
    inputLevel,
    outputLevel,
  } = useRealtimeChat({
    getToken: () =>
      fetch('/api/realtime-token', { method: 'POST' }).then((r) => r.json()),
    adapter: elevenlabsRealtime(),
  })

  return (
    <div>
      <p>Status: {status}</p>
      <p>Mode: {mode}</p>
      <button onClick={status === 'idle' ? connect : disconnect}>
        {status === 'idle' ? 'Start Conversation' : 'End Conversation'}
      </button>
      {pendingUserTranscript && <p>You: {pendingUserTranscript}...</p>}
      {pendingAssistantTranscript && (
        <p>AI: {pendingAssistantTranscript}...</p>
      )}
      {messages.map((msg) => (
        <div key={msg.id}>
          <strong>{msg.role}:</strong>
          {msg.parts.map((part, i) => (
            <span key={i}>
              {part.type === 'text' ? part.content : null}
              {part.type === 'audio' ? part.transcript : null}
            </span>
          ))}
        </div>
      ))}
    </div>
  )
}

import { useRealtimeChat } from '@tanstack/ai-react'
import { elevenlabsRealtime } from '@tanstack/ai-elevenlabs'

function VoiceChat() {
  const {
    status,
    mode,
    messages,
    connect,
    disconnect,
    pendingUserTranscript,
    pendingAssistantTranscript,
    inputLevel,
    outputLevel,
  } = useRealtimeChat({
    getToken: () =>
      fetch('/api/realtime-token', { method: 'POST' }).then((r) => r.json()),
    adapter: elevenlabsRealtime(),
  })

  return (
    <div>
      <p>Status: {status}</p>
      <p>Mode: {mode}</p>
      <button onClick={status === 'idle' ? connect : disconnect}>
        {status === 'idle' ? 'Start Conversation' : 'End Conversation'}
      </button>
      {pendingUserTranscript && <p>You: {pendingUserTranscript}...</p>}
      {pendingAssistantTranscript && (
        <p>AI: {pendingAssistantTranscript}...</p>
      )}
      {messages.map((msg) => (
        <div key={msg.id}>
          <strong>{msg.role}:</strong>
          {msg.parts.map((part, i) => (
            <span key={i}>
              {part.type === 'text' ? part.content : null}
              {part.type === 'audio' ? part.transcript : null}
            </span>
          ))}
        </div>
      ))}
    </div>
  )
}

Non-React (RealtimeClient)

typescript

import { RealtimeClient } from '@tanstack/ai-client'
import { elevenlabsRealtime } from '@tanstack/ai-elevenlabs'

const client = new RealtimeClient({
  getToken: () =>
    fetch('/api/realtime-token', { method: 'POST' }).then((r) => r.json()),
  adapter: elevenlabsRealtime(),
  onMessage: (message) => {
    console.log(`${message.role}:`, message.parts)
  },
  onStatusChange: (status) => {
    console.log('Status:', status)
  },
  onModeChange: (mode) => {
    console.log('Mode:', mode)
  },
})

await client.connect()

import { RealtimeClient } from '@tanstack/ai-client'
import { elevenlabsRealtime } from '@tanstack/ai-elevenlabs'

const client = new RealtimeClient({
  getToken: () =>
    fetch('/api/realtime-token', { method: 'POST' }).then((r) => r.json()),
  adapter: elevenlabsRealtime(),
  onMessage: (message) => {
    console.log(`${message.role}:`, message.parts)
  },
  onStatusChange: (status) => {
    console.log('Status:', status)
  },
  onModeChange: (mode) => {
    console.log('Mode:', mode)
  },
})

await client.connect()

Client Tools

ElevenLabs supports client-side tools that execute in the browser. Define tools using the standard toolDefinition() API:

typescript

import { toolDefinition } from '@tanstack/ai'
import { z } from 'zod'

const getWeatherDef = toolDefinition({
  name: 'getWeather',
  description: 'Get weather for a location',
  inputSchema: z.object({
    location: z.string(),
  }),
  outputSchema: z.object({
    temperature: z.number(),
    conditions: z.string(),
  }),
})

const getWeather = getWeatherDef.client(async ({ location }) => {
  const res = await fetch(`/api/weather?location=${location}`)
  return res.json()
})

// Pass tools to the hook
const chat = useRealtimeChat({
  getToken: () =>
    fetch('/api/realtime-token', { method: 'POST' }).then((r) => r.json()),
  adapter: elevenlabsRealtime(),
  tools: [getWeather],
})

import { toolDefinition } from '@tanstack/ai'
import { z } from 'zod'

const getWeatherDef = toolDefinition({
  name: 'getWeather',
  description: 'Get weather for a location',
  inputSchema: z.object({
    location: z.string(),
  }),
  outputSchema: z.object({
    temperature: z.number(),
    conditions: z.string(),
  }),
})

const getWeather = getWeatherDef.client(async ({ location }) => {
  const res = await fetch(`/api/weather?location=${location}`)
  return res.json()
})

// Pass tools to the hook
const chat = useRealtimeChat({
  getToken: () =>
    fetch('/api/realtime-token', { method: 'POST' }).then((r) => r.json()),
  adapter: elevenlabsRealtime(),
  tools: [getWeather],
})

Tool results are automatically serialized to strings and returned to the ElevenLabs agent. The adapter converts TanStack tool definitions into the @elevenlabs/client clientTools format internally.

Configuration

elevenlabsRealtimeToken Options

Used on the server to generate a signed WebSocket URL.

Option	Type	Required	Description
agentId	string	No*	Agent ID configured in the ElevenLabs dashboard. *Falls back to ELEVENLABS_AGENT_ID; required only if that env var is unset
overrides.voiceId	string	No	Custom voice ID to override the agent's default voice
overrides.systemPrompt	string	No	Custom system prompt to override the agent's default
overrides.firstMessage	string	No	First message the agent speaks when the session starts
overrides.language	string	No	Language code (e.g., 'en', 'es', 'fr')

elevenlabsRealtime Options

Used on the client to establish the connection.

Option	Type	Default	Description
connectionMode	'websocket' \| 'webrtc'	auto-detect	Transport protocol for the connection
debug	boolean \| DebugConfig	false	Enable debug logging — pass true for all categories, or a DebugConfig to select categories/sink

Differences from OpenAI Realtime

ElevenLabs and OpenAI take different approaches to realtime voice:

	ElevenLabs	OpenAI
Configuration	Agent-based. Configure voice, personality, and knowledge in the ElevenLabs dashboard or via overrides at token time.	Session-based. Configure instructions, voice, temperature, etc. per session via useRealtimeChat options.
Token type	Signed WebSocket URL (valid 30 minutes)	Ephemeral API token (valid ~10 minutes)
Transport	WebSocket (default) or WebRTC	WebRTC
Audio handling	@elevenlabs/client SDK manages audio capture and playback automatically	TanStack AI manages WebRTC peer connection and audio tracks
VAD	Handled by ElevenLabs server-side	Supports server, semantic, and manual modes
Runtime updates	Session config is set at creation time and cannot be changed mid-session	Supports updateSession() for mid-session config changes
Image input	Not supported	Supported via sendImage()
Time domain data	Not available from the SDK	Available for waveform visualizations

Audio Visualization

The ElevenLabs adapter provides audio visualization data through the same interface as other realtime adapters:

typescript

const {
  inputLevel, // 0-1 normalized microphone volume
  outputLevel, // 0-1 normalized speaker volume
  getInputFrequencyData, // Uint8Array frequency spectrum
  getOutputFrequencyData,
} = useRealtimeChat({
  getToken: () =>
    fetch('/api/realtime-token', { method: 'POST' }).then((r) => r.json()),
  adapter: elevenlabsRealtime(),
})

const {
  inputLevel, // 0-1 normalized microphone volume
  outputLevel, // 0-1 normalized speaker volume
  getInputFrequencyData, // Uint8Array frequency spectrum
  getOutputFrequencyData,
} = useRealtimeChat({
  getToken: () =>
    fetch('/api/realtime-token', { method: 'POST' }).then((r) => r.json()),
  adapter: elevenlabsRealtime(),
})

Note: ElevenLabs provides volume levels and frequency data but does not expose time-domain data. The getInputTimeDomainData() and getOutputTimeDomainData() methods return static placeholder arrays. The default audio sample rate is 16kHz.

Environment Variables

Set these in your server environment:

ELEVENLABS_API_KEY=your-elevenlabs-api-key
ELEVENLABS_AGENT_ID=your-agent-id

ELEVENLABS_API_KEY=your-elevenlabs-api-key
ELEVENLABS_AGENT_ID=your-agent-id

Variable	Required	Description
ELEVENLABS_API_KEY	Yes	Your ElevenLabs API key, used server-side for generating signed URLs
ELEVENLABS_AGENT_ID	No	Default agent ID. Can also be passed directly to elevenlabsRealtimeToken()

Get your API key from the ElevenLabs dashboard. Create and configure agents in the Conversational AI section of the dashboard.

Text-to-Speech

For one-shot speech generation (not realtime), use elevenlabsSpeech with generateSpeech():

typescript

import { generateSpeech } from "@tanstack/ai";
import { elevenlabsSpeech } from "@tanstack/ai-elevenlabs";

const result = await generateSpeech({
  adapter: elevenlabsSpeech("eleven_v3"),
  text: "Hello from ElevenLabs!",
  voice: "Rachel",
  format: "mp3",
});

console.log(result.audio); // Base64-encoded audio

import { generateSpeech } from "@tanstack/ai";
import { elevenlabsSpeech } from "@tanstack/ai-elevenlabs";

const result = await generateSpeech({
  adapter: elevenlabsSpeech("eleven_v3"),
  text: "Hello from ElevenLabs!",
  voice: "Rachel",
  format: "mp3",
});

console.log(result.audio); // Base64-encoded audio

Music & Sound Effects

elevenlabsAudio covers both music generation and sound effects depending on the model:

typescript

import { generateAudio } from "@tanstack/ai";
import { elevenlabsAudio } from "@tanstack/ai-elevenlabs";

// Music generation
const music = await generateAudio({
  adapter: elevenlabsAudio("music_v1"),
  prompt: "An upbeat synthwave track for a product launch",
});

// Sound effects
const sfx = await generateAudio({
  adapter: elevenlabsAudio("eleven_text_to_sound_v2"),
  prompt: "A glass shattering on concrete",
});

import { generateAudio } from "@tanstack/ai";
import { elevenlabsAudio } from "@tanstack/ai-elevenlabs";

// Music generation
const music = await generateAudio({
  adapter: elevenlabsAudio("music_v1"),
  prompt: "An upbeat synthwave track for a product launch",
});

// Sound effects
const sfx = await generateAudio({
  adapter: elevenlabsAudio("eleven_text_to_sound_v2"),
  prompt: "A glass shattering on concrete",
});

Transcription

Transcribe audio with elevenlabsTranscription:

typescript

import { generateTranscription } from "@tanstack/ai";
import { elevenlabsTranscription } from "@tanstack/ai-elevenlabs";

const result = await generateTranscription({
  adapter: elevenlabsTranscription("scribe_v1"),
  audio: audioFile,
});

console.log(result.text);

import { generateTranscription } from "@tanstack/ai";
import { elevenlabsTranscription } from "@tanstack/ai-elevenlabs";

const result = await generateTranscription({
  adapter: elevenlabsTranscription("scribe_v1"),
  audio: audioFile,
});

console.log(result.text);

API Reference

elevenlabsRealtimeToken(options)

Creates an ElevenLabs realtime token adapter for server-side use with realtimeToken().

Parameters:

options.agentId - Agent ID from the ElevenLabs dashboard
options.overrides?.voiceId - Custom voice ID
options.overrides?.systemPrompt - Custom system prompt
options.overrides?.firstMessage - First message the agent speaks
options.overrides?.language - Language code

Returns: A RealtimeTokenAdapter for use with realtimeToken().

elevenlabsRealtime(options?)

Creates an ElevenLabs realtime client adapter for use with useRealtimeChat or RealtimeClient.

Parameters:

options.connectionMode? - 'websocket' or 'webrtc' (default: auto-detect)
options.debug? - Enable debug logging

Returns: A RealtimeAdapter for use with useRealtimeChat() or RealtimeClient.

elevenlabsSpeech(model, config?) / createElevenLabsSpeech(model, apiKey, config?)

Creates an ElevenLabs text-to-speech adapter for use with generateSpeech().

elevenlabsAudio(model, config?) / createElevenLabsAudio(model, apiKey, config?)

Creates an ElevenLabs audio adapter that covers both music generation and sound effects (selected via the model id) for use with generateAudio().

elevenlabsTranscription(model, config?) / createElevenLabsTranscription(model, apiKey, config?)

Creates an ElevenLabs transcription adapter for use with generateTranscription().

Limitations

No text chat support -- Use OpenAI, Anthropic, Gemini, or another text adapter for chat().
No summarization -- Use a text adapter for summarize().
No image input (realtime) -- ElevenLabs realtime does not support sending images during a conversation.
No runtime session updates (realtime) -- Session configuration is fixed at connection time.
No time-domain audio data (realtime) -- Frequency data and volume levels are available, but waveform data is not.
Agent required (realtime) -- You must create and configure an agent in the ElevenLabs dashboard before using the realtime adapter.

Next Steps

Realtime Voice Chat Guide - Complete guide to building realtime voice applications
OpenAI Adapter - Alternative realtime voice provider with WebRTC
Tools Guide - Learn about the isomorphic tool system

Installation#

Server Setup#

With Overrides#

Client Setup#

React (useRealtimeChat)#

Non-React (RealtimeClient)#

Client Tools#

Configuration#

elevenlabsRealtimeToken Options#

elevenlabsRealtime Options#

Differences from OpenAI Realtime#

Audio Visualization#

Environment Variables#

Text-to-Speech#

Music & Sound Effects#

Transcription#

API Reference#

elevenlabsRealtimeToken(options)#

elevenlabsRealtime(options?)#

elevenlabsSpeech(model, config?) / createElevenLabsSpeech(model, apiKey, config?)#

elevenlabsAudio(model, config?) / createElevenLabsAudio(model, apiKey, config?)#

elevenlabsTranscription(model, config?) / createElevenLabsTranscription(model, apiKey, config?)#

Limitations#

Next Steps#

Installation

Server Setup

With Overrides

Client Setup

React (useRealtimeChat)

Non-React (RealtimeClient)

Client Tools

Configuration

elevenlabsRealtimeToken Options

elevenlabsRealtime Options

Differences from OpenAI Realtime

Audio Visualization

Environment Variables

Text-to-Speech

Music & Sound Effects

Transcription

API Reference

elevenlabsRealtimeToken(options)

elevenlabsRealtime(options?)

elevenlabsSpeech(model, config?) / createElevenLabsSpeech(model, apiKey, config?)

elevenlabsAudio(model, config?) / createElevenLabsAudio(model, apiKey, config?)

elevenlabsTranscription(model, config?) / createElevenLabsTranscription(model, apiKey, config?)

Limitations

Next Steps