Model usage and scheduled jobs, with 7-day and 30-day views.
Generated 2026-05-27
Figures and costs on this page are best-effort estimates from the machine that built it.
Per-day session token totals (agent JSONL) and cron job run token totals (runs store), aligned on UTC calendar days. Chart is inline SVG (no network); Y-axis is linear from 0 to the largest single-day total (model or cron). The table has exact counts.
Generated 2026-05-27 ยท
Source: [redacted path] ยท
Non-xAI token columns = session logs in the window (parseable timestamp). xAI: tokens + $/M from invoice preview (billing cycle); cost from POST โฆ/usage. Configure credentials (local): [REDACTED_AUTH] / XAI_TEAM_ID.
| Model | Provider | $/M In | $/M Out | Requests | Prompt Tokens | Compl. Tokens | Total Tokens | Est. Cost |
|---|---|---|---|---|---|---|---|---|
| deepseek/deepseek-chat deepseek-chat (V3.2). Cache hit: $0.07/M. Max 8K output. | DeepSeek | $0.27 | $1.10 | 90 | 2,134,623 | 18,646 | 2,153,269 | $0.1444 |
| deepseek/deepseek-reasoner deepseek-reasoner (R1). Thinking tokens count toward output cost. | DeepSeek | $0.55 | $2.19 | 11 | 449,689 | 6,279 | 455,968 | $0.0288 |
| google/gemini-2.5-flash Standard context (<200k). Thinking-mode output is $3.50/M. | Google AI Studio | โ | โ | 136 | 1,035,001 | 5,768 | 814,779 | $0.0000 |
| xai/grok-4-1-fast-reasoning Grok 4.1 Fast (reasoning + non-reasoning). 2M token context. | xAI | $0.20 | $0.50 | 10 | 0 | 0 | 0 | $0.0000 |
| nvidia/minimax-m2.7 MiniMax official pay-as-you-go pricing for M2.7 standard. | MiniMax via NVIDIA NIM | โ | โ | 14 | 0 | 0 | 0 | $0.0000 |
| nvidia/deepseek-v4-pro DeepSeek V4-Pro: 1.6T params, 49B active. NVIDIA NIM endpoint. | NVIDIA NIM (DeepSeek V4 Pro) | โ | โ | 0 | 0 | 0 | 0 | $0.0000 |
| nvidia/deepseek-v4-flash DeepSeek published pricing for V4-Flash. NVIDIA NIM endpoint. | NVIDIA NIM (DeepSeek V4 Flash) | โ | โ | 14 | 0 | 0 | 0 | $0.0000 |
| groq-main/llama-3.3-70b-versatile Groq LPU inference. 128K context, up to 33K output. | Groq | โ | โ | 40 | 0 | 0 | 0 | $0.0000 |
| cerebras-8b/llama-3.1-8b Cerebras Llama 3.1 8B. Very fast inference on Cerebras hardware. | Cerebras | โ | โ | 0 | 0 | 0 | 0 | $0.0000 |
| TOTAL | 315 | 3,619,313 | 30,693 | 3,424,016 | $0.1732 | |||
| Model | Provider | $/M In | $/M Out | Requests | Prompt Tokens | Compl. Tokens | Total Tokens | Est. Cost |
|---|---|---|---|---|---|---|---|---|
| deepseek/deepseek-chat deepseek-chat (V3.2). Cache hit: $0.07/M. Max 8K output. | DeepSeek | $0.27 | $1.10 | 797 | 24,197,973 | 192,982 | 24,390,955 | $1.1791 |
| deepseek/deepseek-reasoner deepseek-reasoner (R1). Thinking tokens count toward output cost. | DeepSeek | $0.55 | $2.19 | 47 | 1,975,851 | 23,766 | 1,999,617 | $0.1189 |
| google/gemini-2.5-flash Standard context (<200k). Thinking-mode output is $3.50/M. | Google AI Studio | โ | โ | 335 | 2,607,172 | 14,626 | 2,026,795 | $0.0000 |
| xai/grok-4-1-fast-reasoning Grok 4.1 Fast (reasoning + non-reasoning). 2M token context. | xAI | $0.20 | $0.50 | 128 | 0 | 0 | 0 | $0.0000 |
| nvidia/minimax-m2.7 MiniMax official pay-as-you-go pricing for M2.7 standard. | MiniMax via NVIDIA NIM | โ | โ | 43 | 0 | 0 | 0 | $0.0000 |
| nvidia/deepseek-v4-pro DeepSeek V4-Pro: 1.6T params, 49B active. NVIDIA NIM endpoint. | NVIDIA NIM (DeepSeek V4 Pro) | โ | โ | 0 | 0 | 0 | 0 | $0.0000 |
| nvidia/deepseek-v4-flash DeepSeek published pricing for V4-Flash. NVIDIA NIM endpoint. | NVIDIA NIM (DeepSeek V4 Flash) | โ | โ | 44 | 0 | 0 | 0 | $0.0000 |
| groq-main/llama-3.3-70b-versatile Groq LPU inference. 128K context, up to 33K output. | Groq | โ | โ | 109 | 0 | 0 | 0 | $0.0000 |
| cerebras-8b/llama-3.1-8b Cerebras Llama 3.1 8B. Very fast inference on Cerebras hardware. | Cerebras | โ | โ | 4 | 0 | 0 | 0 | $0.0000 |
| TOTAL | 1,507 | 28,780,996 | 231,374 | 28,417,367 | $1.2980 | |||
|