llm-cost-optimization

Here are 9 public repositories matching this topic...

Language: All

Filter by language

All 9 Python 4 HTML 2 Go 1 JavaScript 1 Rust 1

isartor-ai / Isartor

Open-source Prompt Firewall — deflect up to 95% of redundant LLM traffic before it leaves your infrastructure. Documents: https://isartor-ai.github.io/Isartor/index.html

rust open-source inference self-hosted openai candle huggingface air-gapped llm anthropic ai-gateway agentic-ai prompt-firewall llm-cost-optimization sematic-cache

Updated Jun 3, 2026
Rust

kirder24-code / ai-agent-manager

Star 7

Free local CLI that estimates, hard-caps, and losslessly compresses the cost of AI coding agents. Delta-encodes re-read files (37.9% proven on a real OpenAI call). MIT, 100% local.

openai developer-tools cost-control ai-agents claude cost-optimization agent-management llm llm-observability llm-cost developer-tools-ai-agent token-budget cost-optimization-cloud-devops llm-cost-optimization cost-control-management-system

Updated Jun 10, 2026
JavaScript

Ismail-2001 / mcp-token-auditor

Star 5

A high-performance, multi-agent observability engine designed for the Model Context Protocol (MCP). It provides a non-blocking, transparent proxy layer that implements deterministic token attribution, real-time context-window alerting, and heuristic-driven static analysis to optimize LLM metadata overhead at scale.

python mcp proxy-server observability fastapi agentic-ai token-counting llm-cost-optimization

Updated Mar 25, 2026
Python

joshua-burnell-1 / prompt-squeeze

Star 1

Builds prompt-cost discipline into Claude Code: measure every prompt, surface avoidable spend, report org-wide. Skill + UserPromptSubmit hook + MCP rollup with cited dollar/Wh/CO2e receipts.

python developer-tools green-software ai-observability sustainable-ai prompt-engineering anthropic prompt-optimization mcp-server claude-code token-counting claude-code-plugin llm-cost-control llm-cost-optimization

Updated May 5, 2026
Python

brainsparker / frugal

Star 1

Cost-optimized MCP server and proxy router for Claude, Cursor, and any agent. Picks the cheapest model + toolchain per use case. Drop-in, Go binary, BUSL-1.1.