Gemini provider
Setting up and using Google Gemini (3.1 Pro, 3 Flash, 2.5 family) in Cerevisor.
Setup You need an API key from aistudio.google.com/apikey . Settings → Providers → + Add provider → Gemini. Paste your API key into the API key field. Click Test connection. Cerevisor sends a tiny OK round-trip via the @google/genai SDK to confirm auth and model accessibility (default smoke-test timeout: 30 seconds). Click Save. The key is stored in your OS keychain. It never lives in plain text on disk or in any .cerevisor file. Dependency note: Gemini requires the optional @google/genai package. Cerevisor ships it bundled in v1.2.0; if a future build doesn't, the setup modal shows an install hint instead of the API-key field. Models Cerevisor reads the live model list from the SDK's models.list() and surfaces it everywhere a model picker appears (cached for 60 minutes per provider lifetime). If the list call fails (network blip, auth issue), Cerevisor falls back to a known set so the picker is never empty. The most-used Gemini models you'll see: Model Best for Notes gemini-3.1-pro-preview Highest quality, deepest reasoning. Per-million pricing doubles above 200K input tokens. Use for senior agents, large-context synthesis. gemini-3.1-flash-lite Fast and cheap. Strong workhorse for reviewers, classifiers, and high-volume transforms. gemini-3-flash-preview Balanced quality at moderate cost. The default if no model is set. gemini-2.5-pro Still current. Cheaper than 3.1-pro for similar quality on many tasks. Also has the >200K tier pricing bump. gemini-2.5-flash / 2.5-flash-lite Lower-cost alternatives if you don't need 3.x reasoning. If you type a custom model id (e.g. a dated variant like gemini-3.1-pro-preview-2026-04 ), Cerevisor uses prefix matching to find the correct pricing row. Unknown ids bill at $0 and emit a one-shot console warning. Cost Cerevisor reads token usage from each response's usageMetadata and multiplies by the per-model pricing table in gemini-pricing.ts (rates verified against ai.google.dev/gemini-api/docs/pricing on 2026-05-14). The status bar shows running cost during a run; the audit log saves the breakdown. Two Gemini-specific quirks Cerevisor handles automatically: Context-tier pricing. On gemini-3.1-pro-preview and gemini-2.5-pro , input rates DOUBLE and output rates jump roughly 50% when the prompt crosses 200,000 tokens. Cerevisor picks the right tier per request based on actual promptTokenCount , not a static guess. Thinking tokens bill as output. Per Google's pricing page, "thinking tokens are included in output pricing." Cerevisor folds thoughtsTokenCount into the output total it reports so cost numbers match Google's billing console. Audio-input pricing is tracked but not yet billed correctly because ContentBlock doesn't carry audio parts in v1.2.0. Audio inputs bill at the text rate until that lands; this only under-reports, never over-reports. Default model In the Library entry, you can set a default model for this provider. When an agent's model preference is (auto) , it falls back to this default. If no d