Model Selection

Every agent needs a brain. On Abundly, you can choose which large language model (LLM) powers your agent—or simply let the platform pick the best one for you.

Default behavior

If you don’t have a preference, leave the model set to (no preference). This is the default, and it means Abundly will use whatever model we think works best for general agentic behavior. We continuously evaluate models and update this default as better options become available. This is the right choice for most users—you get great performance without having to think about model selection at all.

Model aliases vs specific versions

When selecting a model, you can choose between:

Aliases like “Claude Sonnet Latest” — automatically points to the latest version that we have verified works well
Specific versions like “Claude Sonnet 4.5” — locked to that exact version

We recommend using aliases in most cases. Models improve constantly, and when a new version of Claude Sonnet arrives, it usually makes sense to upgrade. By selecting an alias, you won’t need to remember to do that—Abundly will point the alias to the latest version when we’ve verified it works well (typically after some internal testing). However, if you want full control and want to minimize the risk of surprises, pick a specific model version. Although newer models usually perform better, agent behavior can sometimes change subtly. Locking to a specific version lets you manually decide when to upgrade.

Available models

Abundly integrates with the leading AI providers. You can switch models at any time—even mid-conversation.

Claude

Anthropic’s models, known for nuanced reasoning and safety

GPT

OpenAI’s models, versatile and widely capable

Gemini

Google’s models, strong at knowledge tasks and multimodal understanding

Comparing models

Not sure which model to choose? We tend to default to the Claude models (Sonnet or Opus). We find they work very well for agentic behaviour. But the other models have improved a lot lately, and some have specific strenghts that you may want to leverage. This is a changing landscape, so you’re probably best off researching online yourself (or asking an Agent to do it for you…). But here’s a high level overview of how the models from each provider compare against each other.

Anthropic Claude models

Model	Best for	Speed	Cost	Notes
Claude Opus 4.5	Complex reasoning, high-stakes tasks	Moderate	$$$	Most capable Claude model. Best for difficult problems that defeat Sonnet.
Claude Sonnet 4.5	Everyday work, coding, analysis	Fast	$$	Recommended default. Excellent balance of capability and speed.
Claude Haiku 4.5	High-volume, speed-critical tasks	Fastest	$	~3x faster than Sonnet at 1/3 the cost. Near-Sonnet quality for most tasks.

Start with Claude Sonnet for most use cases. Use Opus for your hardest problems and Haiku when you need speed at scale.

Google Gemini models

Model	Best for	Speed	Cost	Notes
Gemini Pro 3	Maximum reasoning depth	Moderate	$$$	Strongest reasoning. 1M token context for huge documents.
Gemini Flash 2.5	Balanced performance	Fast	$$	Good multimodal capabilities. Native audio support.
Gemini Flash Lite 2.5	High-volume processing	Fastest	$	Most cost-effective. Great for bulk summarization and routing.

Gemini Flash is the workhorse—use it for most tasks. Choose Pro for complex reasoning and Flash Lite for high-volume, simple operations.

OpenAI GPT models

Model	Best for	Speed	Cost	Notes
GPT 5.2 Pro	Mission-critical, complex reasoning	Moderate	$$$$	Extended reasoning for highest accuracy. Best for high-stakes decisions.
GPT 5.2	Latest improvements, reduced hallucinations	Fast	$$$	30% fewer hallucinations. Strong on coding (55.6% SWE-Bench Pro) and reasoning.
GPT 5 Mini	Well-defined tasks, interactive apps	Faster	$$	Great balance of speed and capability. Ideal for chat and coding assistants.
GPT 5 Nano	High-volume, simple tasks	Fastest	$	Ultra-cheap. Best for summarization, classification, and bulk processing.

GPT 5.2 is the recommended default for most tasks. Use Mini for speed-sensitive applications and Nano for high-volume simple operations.

Per-context model selection

Different contexts in your agent’s life may benefit from different models. You can configure a model and thinking preference per context:

Chat — Conversations you start with the agent
Scheduled tasks — When the agent runs on a schedule
Email — When the agent responds to incoming emails
Slack — When the agent responds to Slack messages
Agent messages — When other agents send your agent a message

If a context has no override, it inherits the agent’s default model. This lets you use a fast, cheap model for routine scheduled work while keeping a more capable model for interactive chat.

Per-task model overrides

Individual scheduled tasks can have their own model and thinking setting on top of the per-context default. Open a task’s settings card to configure it. This is useful when one particular task needs heavier reasoning than your other scheduled tasks.

Thinking tokens

For Claude models, you can enable thinking—an advanced reasoning mode where the model thinks through problems step by step before responding.

Thinking settings showing toggles for triggers and new chats, plus thinking budget configuration

Thinking makes your agent smarter at complex reasoning tasks, but it also increases processing time and credit usage. You can configure thinking per context (chat, triggers, etc.) and also set a thinking budget—the maximum number of tokens the model can use for thinking (1,024–32,000).

Thinking is currently only available for Claude Sonnet and Opus models.

When to change models

Scenario	Recommendation
Just getting started	Leave it on “(no preference)“
Complex reasoning, planning	Claude Opus, Gemini Pro, or GPT 5.2 Pro
Everyday tasks, coding	Claude Sonnet, GPT 5.2, or GPT 5 Mini
High-volume, simple tasks	Claude Haiku, Gemini Flash Lite, or GPT 5 Nano
Speed is critical	Claude Haiku, Gemini Flash, or GPT 5 Nano
Lowest hallucination rates	GPT 5.2 or GPT 5.2 Pro
Working with long documents	Gemini Pro (1M token context)

Unified interface

Regardless of which model you choose, the platform provides a consistent experience:

Switch models even mid-conversation
Same capabilities work across all models
Consistent behavior and tool usage
No need to learn different APIs

What if I pick a model that doesn't work well for my use case?

You can switch models at any time. If you’re not happy with the results, try a different model—your agent’s instructions and capabilities stay the same.

How does pricing work for different models?

Each model has different credit costs per token. More capable models like Opus and GPT 5.2 cost more per request, while Haiku and Flash Lite are very cost-effective. Check your usage dashboard to monitor credit consumption.

Can I use different models for different agents?

Yes. Each agent can have its own model preference. You might use a fast, cheap model for a high-volume support agent and a powerful reasoning model for a research agent.

Can I use different models for different contexts within the same agent?

Yes. You can set a model per context (chat, scheduled tasks, email, Slack, agent messages) and even override the model on individual scheduled tasks. This lets you optimize cost and capability for each type of work the agent does.

Learn more

Anthropic (Claude)

Learn about Claude models and capabilities

OpenAI (GPT)

Learn about GPT models and capabilities

Google Gemini

Learn about Gemini models and capabilities

Core Features

Administration

Advanced

Model Selection

Default behavior

Model aliases vs specific versions

Available models

Claude

GPT

Gemini

Comparing models

Anthropic Claude models

Google Gemini models

OpenAI GPT models

Per-context model selection

Per-task model overrides

Thinking tokens

When to change models

Unified interface

Learn more

Anthropic (Claude)

OpenAI (GPT)

Google Gemini

Core Features

Administration

Advanced

​Default behavior

​Model aliases vs specific versions

​Available models

Claude

GPT

Gemini

​Comparing models

​Anthropic Claude models

​Google Gemini models

​OpenAI GPT models

​Per-context model selection

​Per-task model overrides

​Thinking tokens

​When to change models

​Unified interface

​Learn more

Anthropic (Claude)

OpenAI (GPT)

Google Gemini

Default behavior

Model aliases vs specific versions

Available models

Comparing models

Anthropic Claude models

Google Gemini models

OpenAI GPT models

Per-context model selection

Per-task model overrides

Thinking tokens

When to change models

Unified interface

Learn more