Agent Router · Cost & Token Visibility

See exactly what AI costs you.

Agent Router gives finance, platform, and engineering one dashboard to attribute every token to a person, team, or application, across your full agent network. Drive adoption. Prove ROI. Stop the surprise bill.

  • Attribute every token to the authenticated user, team, app, or agent chain. Not just an API key.
  • Enforce budgets inline at the gateway. Over-budget calls are blocked before the bill. Not flagged after.
  • Live in 30 minutes on the Envoy-based gateway your platform team already trusts.

Two questions your CFO is already asking

If you can't answer these, AI spend gets cut next quarter.

The adoption mandate

We need AI usage metrics to show the board the investment is working.

Headcount and budget were approved on the assumption that AI makes teams more productive. If teams aren't using AI, they aren't more productive. See who's using AI, then push the ones who aren't.

The ROI mandate

We need AI spend tied to real outcomes, not just activity.

Knowing who is spending tokens isn't enough. Finding out why tokens are being spent, and which workflows generate business value, is what needs data your billing report doesn't have.

Attribution, four ways

Segment spend by person, team, application, or any combination.

Most cost tools show you spend by provider. That's a billing report, not an operational signal. Agent Router attributes every token to the caller. Because agents call other agents, attribution follows the full chain.

By person

Every request carries the authenticated identity of the developer or service account. See exactly which engineers are using AI, and who hasn't made a single request this month.

By team

Roll usage up to the team level for showback and chargeback. See who's on track for adoption targets, and which teams are heading toward a budget overrun, before the bill arrives.

By application

Attribute spend to the workflow that consumed it. When three teams share one agent platform, see which application drove which spend, not just which API key was used.

Any combination

"Show me all spend by the risk team on the compliance-check app, attributed to individual engineers." Every request carries all four dimensions. Filter from the dashboard, no custom reporting required.

The dashboard

One pane of glass for every token.

  • Drill from organization → team → application → individual user in two clicks.
  • Per-team token budgets that block calls at the gateway, not flag them in a report.
  • Compare cost per workflow across models & providers. Same prompt, two prices.
  • Export to your data warehouse (Snowflake, BigQuery, S3) for finance reconciliation.

Out of the box

The numbers you bring to leadership.

Adoption. Spend. ROI. Three boards, ready the day you turn on Agent Router. No custom dashboards to build, no analytics team to staff.

Adoption

  • % of developers with active token usage this month
  • Teams below adoption target, with count and token gap
  • Week-over-week usage trend by team
  • Model diversity: are teams picking capable models or defaulting to the cheapest?

Spend

  • Total spend by team, attributed (not just by API key)
  • Budget adherence flagged before the bill arrives
  • Cost per workflow or application, not per model call
  • Over-budget incidents blocked inline at the gateway

ROI signal

  • Cost per agent workflow, tied to a unit of business output
  • Provider comparison: same model, different provider price
  • Wasteful patterns: retry loops, oversized prompts, dead chains
  • Efficiency trend: are teams getting smarter about tokens over time?

How it works

Three steps from “we don't know” to “we know exactly.”

Route traffic through Agent Router.

A drop-in LLM gateway on the Envoy proxy your platform team already runs. One configuration change at the SDK or load-balancer level. No app rewrites.

Attribute every token.

Each call carries identity, team, app, and agent-chain context. Attribution follows agent-to-agent calls so the workflow that started the spend is the workflow that owns it.

Govern, report, repeat.

Set per-team budgets that enforce inline. Stream the attribution to your data warehouse. Bring real numbers to your next board review, without the analytics backlog.

Get a personal tour. Bring your own workload.

30 minutes with a Tetrate engineer. We'll wire Agent Router to a sample of your traffic and show you the attribution, budgets, and ROI dashboards on your own data.

30 minutes, no sales pitch Bring your stack, any LLM provider NDAs available on request