Analytics

The Analytics dashboard shows how your team uses ZenSearch — query volume across every surface, agent behavior and latency, AI model spend, search-widget engagement, and agent/automation reliability. It is organized into four tabs.

Tab	What it covers
Search Analytics	Query volume, surface breakdown, agent performance, satisfaction, and queries by API key
Model Usage	Requests, tokens, cost, latency, prompt-cache hit rate, and a per-model / per-operation breakdown
Search Widget	Searches, clicks, click-through rate, zero-result rate, and top queries for the embeddable docs widget
Reliability	Agent run cost (actual vs estimate), early-termination reasons, automation run status, and verification failures

The Search, Search Widget, and Reliability tabs offer 7-day / 30-day / 90-day windows. Model Usage additionally offers 24h and 1 year. Date ranges are computed in UTC so period boundaries line up with how the backend buckets events.

Search Analytics

Summary stats

A row of headline counts for the selected period:

Total Queries — every query event across all surfaces
Searches — keyword/semantic search queries
Auto — agentic chat turns on the default tier (a real agent loop ran)
Research — multi-step research-tier chat turns
Automations — scheduled or triggered automation runs
Satisfaction — thumbs-up rate over rated answers (shows -- until there are ratings)

Query volume over time

A stacked area chart of Searches, Chat, and Agents over the period (daily for 7-/30-day, weekly for 90-day).

Surface breakdown

A pie chart that splits every query into exactly one cell by surface: Web Search, Chat, Auto, Research, Automations, Widget, API, Slack, Teams, and Extension. Web traffic is further split by type (search vs chat vs auto vs research) so a single surface doesn't dominate the chart.

Agent performance

This card spans every tier where an agent loop actually ran — default auto, research, and automations — not just research-tier chat. The headline number is Total Agent Turns, with these metrics underneath:

Metric	Meaning
Avg Iterations	Mean iteration count over multi-step turns (more than one iteration)
P95 Iterations	95th-percentile reasoning depth across agentic tiers
Avg Tool Calls	Mean tool invocations per agent turn
P50 Latency	Median per-turn latency
P95 Latency	95th-percentile per-turn latency
Auto / Research	Web turn counts split by tier

:::note Latency is full server wall-clock, not agent-loop-only The P50 / P95 Latency figures measure the full server turn — from request authentication through to the response (or stream) completing — which is the total user-perceived latency, not the time spent inside the agent loop alone. The percentiles are taken over agentic chat turns (auto + research) that recorded a duration; automation runs and legacy rows without a stamped duration are excluded so they can't skew the numbers. A dash (--) means no agentic turn recorded a duration in the window. :::

User satisfaction

Thumbs-up rate, with a breakdown of positive ratings, negative ratings, and source clicks.

AI model usage (managed vs BYO)

Splits queries by whether they ran on ZenSearch AI (managed models) or your own BYO models.

Queries by API key

A bar chart of query counts per API key, covering requests authenticated with a key (the embeddable widget or direct API calls).

Model Usage

This tab reports AI model consumption and cost across all operations (chat, search, embeddings, reranking, agents) — not only agent runs.

Summary cards

Total Requests
Total Tokens
Input / Output token split
Total Cost (USD)

Performance

Average and P95 latency, error rate, and error count for model calls in the period.

Prompt cache

A cache hit rate over input tokens, plus cached-read and cache-write token counts. Long prompt prefixes are cached automatically by supported providers (Anthropic, OpenAI, Groq), so savings appear once traffic builds up.

Usage timeline, by model, and by operation

A timeline bar chart of token usage, a per-model breakdown (cost, tokens, latency, cache hit rate, error rate, requests) with expand/collapse, and a per-operation view.

Engagement analytics for the embeddable search widget, aggregated across all of a team's API keys:

Total searches, total clicks, click-through rate
Average response time
Unique queries and zero-result rate
Top queries, top clicked results, search trends over time, and the zero-result queries worth adding content for

Reliability

Resilience signals for agent runs and automations. The tab is empty until chat agents or automations have executed.

Cost: actual vs estimate

Total actual spend vs the pre-flight estimate, an estimate-accuracy percentage (100% means actual matched the estimate; above 100% means runs cost more than projected), plus counts of pre-flight blocks, mid-run cuts, and total runs. See Cost & Auto-Resume for how these ceilings are enforced.

Why runs stopped early

A breakdown of early-termination reasons (for example budget exhaustion by iterations, tool calls, tokens, wall-clock, or cost).

Automation run status

Run counts by lifecycle status, with a callout when runs breached their SLA targets.

Verification failures

A breakdown of automation acceptance-criteria failure modes — for example insufficient sources or low confidence — for automations that define verification contracts.

Cost & Auto-Resume — the per-run and per-team dollar ceilings behind the Reliability tab
AI Agents — agent execution, the live cost meter, and the in-chat context-window meter
Evaluation — offline retrieval-quality scoring (NDCG, MRR, MAP)

Search Analytics​

Summary stats​

Query volume over time​

Surface breakdown​

Agent performance​

User satisfaction​

AI model usage (managed vs BYO)​

Queries by API key​

Model Usage​

Summary cards​

Performance​

Prompt cache​

Usage timeline, by model, and by operation​

Search Widget​

Reliability​

Cost: actual vs estimate​

Why runs stopped early​

Automation run status​

Verification failures​

Related​