All in one backend for creating realtime AI. No deployment required.

Gabber is a hosted end-to-end backend for designing and deploying realtime AI apps that can talk, text, and see. AI voice and video input, output, and orchestration. Simplified.

Building Blocks with SDKs for all platforms and use cases

Build Your AI

High Definition Text-to-Speech

Emotive. Low-latency. High Quality. $1/hr.

Text-to-Speech

$1/hr

No hidden fees

Low-Latency, High-Quality TTS

Real-time text-to-speech with minimal delay for instant voice generation

Try it Live

Experience our low-latency TTS in real-time. Type a message and hear it come to life instantly.

Give Your AI Eyes

Realtime video input into your AI to create multi-modal AI experiences that can see the world and your users

AI that can see

Sub-100ms frame processing latency
Object detection and classification
Facial expression recognition
Gesture and movement tracking
Scene understanding
Works for every use case
👁️‍🗨️

Interactive Video Demo

Book a demo

Use Cases

Mutli-modal Companions

Mutli-modal Companions

Create experiences where the AI can talk, text, and watch the user, a movie, videogame, or something else. Let your AI respond to the world and your users

Trainers and Coaches

Trainers and Coaches

AI trainers that visually track your form, movements, and progress - providing real-time guidance and motivation through voice coaching and visual analysis

Kids and Toys

Kids and Toys

Interactive AI toys that combine voice, vision, and play - watching children's activities and responding with adaptive games and conversations

Tour Guides & Storytellers

Tour Guides & Storytellers

AI tour guides that see you and your environment to create personalized narratives and stories about your surroundings through engaging speech

Gaming NPCs

Gaming NPCs

AI characters that can see and speak to players in your game, providing a deeper level of immersion and interaction

The Age Of Backend Is Over

Wrap your API key in a user-linked token to easily track usage and safely make API calls. Integrate SDKs directly on the frontend.

Generate User Token

const token = await generateToken({
  user_id: "[email protected]",
  limits: {
    voice_seconds: 100,
    tokens: 10000
  }
});

SDK and API Integration

const { api } = useApi();

api.realtime.startRealtimeSession({
  usageToken: token
});

<RealtimeSessionEngineProvider>
  <VoiceChat />
</RealtimeSessionEngineProvider>

Track Usage

const usage = await getUsage();
console.log(usage);
// {
//   voice_minutes: 45,
//   messages: 328
// }

Integrate in Minutes

Build voice-enabled applications across any platform

Coming soon: Unity and C/ESP32 SDKs

Web/React SDK Integration

"use client"

import { RealtimeSessionEngineProvider } from "gabber-client-react"

import { useCallback, useState } from "react";


// Voice and LLM configuration

const GABBER_VOICE = "626c3b02-2d2a-4a93-b3e7-be35fd2b95cd"

const GABBER_LLM = "21892bb9-9809-4b6f-8c3e-e40093069f04"


export default function Home() {

  const [details, setDetails] = useState(undefined);


  const generateConnectionDetails = useCallback(async () => {

    const { token, context: contextId } = await createTokenAndContext()

    setDetails({ token, context: contextId })

  }, [])


  if (!details) {

    return (

      <button onClick={() => generateConnectionDetails()}>Start Session</button>

    );

  }


  return (

    <RealtimeSessionEngineProvider connectionOpts={{

      token: details.token,

      config: {

        generative: {

          llm: GABBER_LLM,

          voice_override: GABBER_VOICE,

          context: details.context,

        },

        output: {

          speech_synthesis_enabled: true

        }

      }

    }}>

      <App />

    </RealtimeSessionEngineProvider>

  );

}

Ready to Get Your Realtime AI App Live?

Get started with Gabber and have your backend done in minutes.