Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings

open-gitagent/voice

Folders and files

NameName
Last commit message
Last commit date

Latest commit

History

1 Commit

Repository files navigation

@open-gitagent/voice

Voice mode + web UI for @open-gitagent/gitagent.

This package was split out of @open-gitagent/gitagent in core v2.0.0 so the CLI/SDK tarball stays slim and supply-chain scanners stop blocking it. The voice runtime (OpenAI Realtime + Gemini Live adapters, the file browser, the Composio toolkit bridge, the scheduler UI, the 200 KB single-file web UI) all live here.

Install

# Both packages — voice mode works out of the box
npm install -g @open-gitagent/gitagent @open-gitagent/voice
# Then launch with voice:
gitagent --voice -d ~/assistant

The gitagent CLI ships in @open-gitagent/gitagent. --voice dynamically loads this package; without it installed, the CLI prints an install hint and exits.

What's inside

  • startVoiceServer(opts) — HTTP + WebSocket server (default :3333) with the web UI, file viewer, settings tab, skills/integrations/scheduler tabs.
  • OpenAI Realtime adapter — GA endpoint, GA session.update shape, GA event names.
  • Gemini Live adapter — Google's Live API.
  • Composio bridge — toolkit browser and action execution from the UI.

Programmatic use (SDK)

import { startVoiceServer } from "@open-gitagent/voice";
const stop = await startVoiceServer({
 adapter: "openai-realtime",
 adapterConfig: { apiKey: process.env.OPENAI_API_KEY! },
 agentDir: "/path/to/agent",
});
// ...later
await stop();

startVoiceServer uses @open-gitagent/gitagent (peer dep) for the agent loader, SDK query(), the message-type protocol, and the standalone scheduler. Versions are decoupled — voice declares ^2.0.0 on core and follows core's semver.

Why it's split

The published @open-gitagent/gitagent@1.5.x tarball was being blocked with HTTP 403 by supply-chain scanners. The trigger was the bundled dist/voice/ui.html (~3,860 LOC of inline HTML/JS/CSS) plus the unused baileys dependency. Splitting voice out:

  • core tarball: 179.5 kB → 85.7 kB packed (−52%), 310 → 211 deps
  • voice tarball: ~150 kB packed (separate install, you opt in)

License

MIT.

About

Voice mode + web UI for @open-gitagent/gitagent. OpenAI Realtime + Gemini Live + Composio + the bundled :3333 UI. Optional install split from gitagent core in 2.0.0 so the slim core isn't supply-chain-blocked.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

Contributors

AltStyle によって変換されたページ (->オリジナル) /