Plans

PlanQueries/dayConcurrentPrice
Free252$0
Pro2005$29/mo
Team1,00010$99/mo
EnterpriseUnlimited20Custom
All plans include full access to every feature — twins, KB management, evals, Slack/Telegram integrations, and the macOS app. The difference is query volume.

Rate Limits

Daily Quota

Each plan has a daily query limit that resets at midnight. Every response shows your remaining count:
⚡ cached · 187 queries remaining today
When you hit the limit:
⏳ Daily query limit reached (25/day on free plan). Resets at midnight.
   Upgrade to Pro for 200 queries/day

Concurrent Requests

Each plan has a concurrent request limit. If you exceed it:
⏳ Rate limited — too many requests. Wait a moment and try again.
This prevents overloading during stress tests or batch operations. Just wait a few seconds.

Caching

Identical queries (same question, same twin, same tenant) are cached for 5 minutes. Cached responses:
  • Don’t count against your daily quota
  • Return instantly (no Firestore reads, no LLM calls)
  • Show ⚡ cached in the CLI
Rapid-fire testing or repeated questions are effectively free.

Monitoring Usage

CLI

Every bestmate ask response shows cache status and remaining queries:
⚡ cached · 187 queries remaining today

macOS App

Check your usage in Analytics — query volume, confidence trends, and escalation rates.

Upgrading

Contact kaya@forever22.com to upgrade your plan, or visit bestmate.io.

Tips to Optimize Usage

  1. Cache works for you — asking the same question within 5 minutes is free
  2. Use evals wiselybestmate eval run counts as N queries (one per eval question). Run evals after batch content updates, not continuously
  3. Deploy doesn’t cost queriesbestmate deploy and bestmate ingest don’t consume your daily quota
  4. Export instead of re-query — use bestmate export-chats to save conversation logs