Token Usage & Costs¶

instatollm tracks token consumption and estimated cost for every analysis call.

How Gemini charges for video¶

Gemini tokenizes video content at approximately:

Content type	Tokens per second
Video (visual frames)	~263 tokens/sec
Audio	~32 tokens/sec
Total (video + audio)	~295 tokens/sec

Plus the text prompt (~400 tokens) and the model's JSON response (~800–1500 tokens).

Example: 30-second Reel¶

Component	Tokens
Video content (30s × 295)	8,850
Text prompt	~400
Total input (prompt_tokens)	~9,250
JSON response (output_tokens)	~1,000
Grand total	~10,250

Model pricing¶

Model	Input (per 1M tokens)	Output (per 1M tokens)
`gemini-2.5-pro`	$1.25	$10.00
`gemini-2.5-flash`	$0.30	$2.50
`gemini-2.5-flash-lite`	$0.10	$0.40
`gemini-3.5-flash`	$1.50	$9.00

Cost per reel (30s video)¶

Model	Input cost	Output cost	Total
`gemini-2.5-pro`	$0.0116	$0.0100	~$0.022
`gemini-2.5-flash`	$0.0028	$0.0025	~$0.005
`gemini-2.5-flash-lite`	$0.0009	$0.0004	~$0.001

Estimates only

These are estimates based on published pricing. Actual charges from Google may vary. Always verify with Google's pricing page.

Viewing your usage¶

Your token usage and cost data is available in the dashboard at app.instatollm.com/usage.

The page shows: - Total spend (USD) - Total API calls - Token breakdown (input vs output) - Per-model breakdown with cost percentage

API usage endpoint¶

GET /api/secure/reels/usage
Authorization: Bearer ist_your_token

{
  "total_cost_usd": 0.0843,
  "total_prompt_tokens": 312400,
  "total_output_tokens": 28600,
  "total_tokens": 341000,
  "total_calls": 34,
  "by_model": [
    {
      "model": "gemini-2.5-pro",
      "calls": 34,
      "prompt_tokens": 312400,
      "output_tokens": 28600,
      "total_tokens": 341000,
      "cost_usd": 0.0843
    }
  ]
}

Optimizing costs¶

Use gemini-2.5-flash instead of gemini-2.5-pro

For most use cases, Flash gives equivalent quality at 4× lower cost. Switch the model in the dashboard settings.

Shorter videos cost less

A 15-second Reel costs roughly half of a 30-second Reel in input tokens.

Re-query the same Reel

Once analyzed, a Reel stays available for follow-up questions for 48 hours. Future re-query features will charge only for the new question, not the full video analysis again.