one router · every AI tool
Keep your tools. Cut the bill.
hibrid slides underneath Claude Code, Gemini CLI, aider and Copilot and quietly sends each call to the cheapest model that can do the job — Opus when it matters, Haiku when it's plenty, a model on your own machine for everything in between. Your tools don't change. Your token bill does.
# one local-first router, OpenAI- and Anthropic-compatible pip install git+https://github.com/vfalbor/hibrid.git hibrid serve curl localhost:8095/v1/node # what it knows about your machine
Drop it under the tools you already use
hibrid speaks both the OpenAI and the Anthropic dialect. Point a tool's base URL at it and nothing else in your setup moves — it just starts spending less.
Claude Code Anthropic API
export ANTHROPIC_BASE_URL=\ https://hibrid.tokenstree.eu claude
aider · Copilot-style OpenAI API
export OPENAI_BASE_URL=\ https://hibrid.tokenstree.eu/v1 aider
Gemini CLI & others OpenAI mode
any tool with an OpenAI-compatible mode points here too
This node, live
The instance serving this page profiled its own hardware and benchmarked its models on boot. Measured numbers, not a brochure.
How it decides
Three moves, all automatic. A tool can declare the task type; if it doesn't, hibrid reads it. Every response tells you where it actually ran.
It measures your machine
Detects RAM, VRAM and chip, then micro-benchmarks the real tokens/sec of your local models. It doesn't guess from a spec sheet — it times your hardware.
It keeps loops local
A refine-and-retest loop is hundreds of cheap calls. hibrid runs them on a local model and spends a paid token only on the one final check that earns it.
It guards your data
Spot an email, a key, an ID, and the request is pinned to local — a rule, not a checkbox. Your text never reaches a third party.
Why nothing else does this
Cloud routers can't see your hardware. Local apps won't reach the cloud on their own. hibrid is the part in the middle.
| Cloud routers | Local apps | hibrid | |
|---|---|---|---|
| Routes by task | yes | manual | yes |
| Knows your hardware | no | hints | measures it |
| Local + cloud, automatic | no | by hand | yes |
| Private data stays local | no | partial | enforced |
| Sits under your existing tools | some | no | yes |