@pipecat-ai/openai-realtime-webrtc-transport

Package Overview

Dependencies

Maintainers

Versions

Alerts

File Explorer

Advanced tools

License

Install Socket

Detect and block malicious and high-risk dependencies

Install

@pipecat-ai/openai-realtime-webrtc-transport

Pipecat OpenAI RealTime Transport Package

latest

Source

npm

Version: 1.2.0

Version published: 2 weeks ago

Maintainers: 7

Created: 9 months ago

Source

OpenAI RealTime WebRTC Transport

NPM Version

A real-time websocket transport implementation for interacting with Google's Gemini Multimodal Live API, supporting bidirectional audio and unidirectional text communication.

Installation

npm install \
@pipecat-ai/client-js \
@pipecat-ai/openai-realtime-webrtc-transport

Overview

The OpenAIRealTimeWebRTCTransport is a fully functional Pipecat Transport. It provides a framework for implementing real-time communication directly with the OpenAI Realtime API using WebRTC voice-to-voice service. It handles media device management, audio/video streams, and state management for the connection.

Features

Real-time bidirectional communication with OpenAI Realtime API
Input device management
Audio streaming support
Text message support
Automatic reconnection handling
Configurable generation parameters
Support for initial conversation context

Usage

Basic Setup

import { OpenAIRealTimeWebRTCTransport, OpenAIServiceOptions } from '@pipecat-ai/openai-realtime-webrtc-transport';

const options: OpenAIServiceOptions = {
  api_key: 'YOUR_API_KEY',
  session_config: {
    instructions: 'you are a confused jellyfish',
  }
};

let PipecatConfig: PipecatClientOptions = {
  transport: new OpenAIRealTimeWebRTCTransport(options),
  ...
};

Configuration Options

/**********************************
 * OpenAI-specific types
 *   types and comments below are based on:
 *     gpt-4o-realtime-preview-2024-12-17
 **********************************/
type JSONSchema = { [key: string]: any };
export type OpenAIFunctionTool = {
  type: "function";
  name: string;
  description: string;
  parameters: JSONSchema;
};

export type OpenAIServerVad = {
  type: "server_vad";
  create_response?: boolean; // defaults to true
  interrupt_response?: boolean; // defaults to true
  prefix_padding_ms?: number; // defaults to 300ms
  silence_duration_ms?: number; // defaults to 500ms
  threshold?: number; // range (0.0, 1.0); defaults to 0.5
};

export type OpenAISemanticVAD = {
  type: "semantic_vad";
  eagerness?: "low" | "medium" | "high" | "auto"; // defaults to "auto", equivalent to "medium"
  create_response?: boolean; // defaults to true
  interrupt_response?: boolean; // defaults to true
};

export type OpenAISessionConfig = Partial<{
  modalities?: string;
  instructions?: string;
  voice?:
    | "alloy"
    | "ash"
    | "ballad"
    | "coral"
    | "echo"
    | "sage"
    | "shimmer"
    | "verse";
  input_audio_noise_reduction?: {
    type: "near_field" | "far_field";
  } | null; // defaults to null/off
  input_audio_transcription?: {
    model: "whisper-1" | "gpt-4o-transcribe" | "gpt-4o-mini-transcribe";
    language?: string;
    prompt?: string[] | string; // gpt-4o models take a string
  } | null; // we default this to gpt-4o-transcribe
  turn_detection?: OpenAIServerVad | OpenAISemanticVAD | null; // defaults to server_vad
  temperature?: number;
  max_tokens?: number | "inf";
  tools?: Array<OpenAIFunctionTool>;
}>;

export interface OpenAIServiceOptions {
  api_key: string;
  model?: string;
  initial_messages?: LLMContextMessage[];
  settings?: OpenAISessionConfig;
}

Sending Messages

// at setup time...
pcClient.appendToContext({ role: "user", content: 'Hello OpenAI!' });

Handling Events

The transport implements the various Pipecat event handlers. Check out the docs or samples for more info.

Updating Session Configuration

pcClient.transport.updateSessionConfig({
  instructions: 'you are a an over-sharing neighbor',
  input_audio_noise_reduction: {
    type: 'near_field'
  }
});

API Reference

Methods

initialize(): Set up the transport and establish connection
sendMessage(message): Send a text message
handleUserAudioStream(data): Stream audio data to the model
disconnectLLM(): Close the connection
sendReadyMessage(): Signal ready state

States

The transport can be in one of the following states:

"disconnected"
"initializing"
"initialized"
"connecting"
"connected"
"ready"
"disconnecting
"error"

Error Handling

The transport includes comprehensive error handling for:

Connection failures
WebRTC connection errors
API key validation
Message transmission errors

License

BSD-2 Clause

FAQs

What is @pipecat-ai/openai-realtime-webrtc-transport?

Is @pipecat-ai/openai-realtime-webrtc-transport well maintained?

Package last updated on 12 Sep 2025

Did you know?

Socket for GitHub automatically highlights issues in each pull request and monitors the health of all your open source dependencies. Discover the contents of your packages and block harmful activity before you install or update your dependencies.

Install

@pipecat-ai/openai-realtime-webrtc-transport

OpenAI RealTime WebRTC Transport

Installation

Overview

Features

Usage

Basic Setup

Configuration Options

Sending Messages

Handling Events

Updating Session Configuration

API Reference

Methods

States

Error Handling

License

Related posts

Malicious fezbox npm Package Steals Browser Passwords from Cookies via Innovative QR Code Steganographic Technique

Identifying and Preventing Fraudulent Engineering Candidates: An Investigation into 80 Confirmed Cases