Live Market Data

LLM Pricing — Live Rates

Token prices tracked across major providers, updated every 6 hours from OpenRouter and LiteLLM. The rates Tally uses when computing cost estimates.

Active Models

Per million tokens (MTok)
Loading live rates…

Where do these prices come from?

Tally fetches prices every 6 hours from two sources: OpenRouter's /models API and LiteLLM's community-maintained price file. The displayed rate is the blended average.

How does Tally use them?

Every routing decision and telemetry event uses the live rates to compute a cost estimate. The bandit's reward function weighs cost against quality — cheaper models win when quality holds up.

What does "agreement" mean?

When both sources return a price, Tally compares them. A green check means they agree within 10%. An amber warning means a significant discrepancy — worth double-checking before budgeting.

Route to the cheapest model that meets your quality bar.

Tally handles the selection. You handle the work.

Next up

MCP Market

Public MCP servers tracked in the Tally ecosystem — usage data, reliability, and more.