AI Model Selection
🧠 Choose Your AI Engine
Provider Connection Requirements
⚠️ Important: To use models from any provider (except AgenticFlow), you must first add a connection for that provider in your project's Connection Settings.
AgenticFlow Provider: Uses your AgenticFlow credits directly - no connection setup required
All Other Providers: Require a valid connection configured in Connection Settings
Automatic Selection: The server automatically uses the first available connection for the selected model's provider
How to add a provider connection:
Navigate to your project's Connection Settings
Add a new connection for your desired provider (OpenAI, Anthropic, Google, etc.)
Configure the required API keys and credentials
Once connected, you can select models from that provider in your agent configuration
Quick Provider Overview
PixelML
50+
Unified access to latest models (GPT-5, Claude 4.5, Gemini 3)
$0.05-15 input
OpenAI
17
Reasoning (O1/O3), Coding (GPT-5.1 Codex), General use
$0.08-15 input
Anthropic
8
Analysis, structured outputs, long context
$1-15 input
Google GenAI
5
Vision, video, audio, 1M+ token context
$0.075-2.5 input
Groq
14
Ultra-fast inference (< 1 sec), open-source models
$0.03-1 input
DeepSeek
2
Cost-efficient reasoning and chat
$0.27-0.55 input
AgenticFlow
11
Budget-friendly, 50K input token limit
$0.05-0.2 input
📊 Complete Model Comparison Table
OpenAI Models
GPT-5 Pro
$15.00
$120.00
400K
Vision, Tools, Structured
Premium reasoning, critical decisions
GPT-5.1
$1.25
$10.00
400K
Vision, Tools, Structured
Advanced general use
GPT-5.1 Codex
$1.25
$10.00
400K
Tools, Structured
Code generation & debugging
GPT-5.1 Codex Mini
$1.50
$6.00
400K
Tools, Structured
Efficient coding tasks
GPT-5
$1.25
$10.00
272K
Vision, Tools, Structured
General premium use
GPT-5 Mini
$0.25
$2.00
272K
Vision, Tools, Structured
Cost-effective quality
GPT-5 Nano
$0.05
$0.40
272K
Vision, Tools, Structured
High-volume budget tasks
GPT-4.1
$2.00
$8.00
1M
Tools, Structured
Large context needs
GPT-4.1 Mini
$0.40
$1.60
1M
Tools, Structured
Cost-effective large context
GPT-4.1 Nano
$0.08
$0.32
1M
Tools, Structured
Ultra-budget large context
GPT-4o
$0.50
$1.50
128K
Tools, Structured
Standard general use
O3
$1.10
$4.40
128K
Reasoning, Tools, Structured
Complex reasoning
O3 Mini
$0.60
$2.40
200K
Reasoning, Tools, Structured
Efficient reasoning
O1
$1.10
$4.40
200K
Reasoning, Tools, Structured
Advanced reasoning
O1 Mini
$0.40
$1.60
128K
Tools, Structured
Budget reasoning
GPT OSS 120B
$0.15
$0.75
131K
Tools, Structured
Open-source compatible
GPT OSS 20B
$0.10
$0.50
131K
Tools, Structured
Efficient open-source
Anthropic (Claude) Models
Claude 4.5 Opus
$5.00
$25.00
200K
Vision, Reasoning, Tools, Structured
Premium analysis & reasoning
Claude 4.5 Sonnet
$3.00
$15.00
200K
Tools, Structured
Balanced quality & cost
Claude 4.5 Haiku
$1.00
$5.00
200K
Tools, Structured
Fast, cost-effective
Claude 4 Opus
$15.00
$75.00
50K
Tools, Structured
Highest quality reasoning
Claude 4 Sonnet
$3.00
$15.00
50K
Tools, Structured
Quality analysis
Claude 3.7 Sonnet
$3.00
$15.00
50K
Tools, Structured
Advanced analysis
Claude 3.7 Sonnet Thinking
$3.00
$15.00
200K
Reasoning, Structured
Thinking mode enabled
Claude 3.5 Opus
$15.00
$75.00
50K
Tools, Structured
Legacy premium
Claude 3.5 Sonnet
$3.00
$15.00
50K
Tools, Structured
Legacy balanced
Claude 3.5 Haiku
$3.00
$4.00
50K
Tools, Structured
Legacy fast
Google Gemini Models
Gemini 3 Pro Preview
$2.00
$12.00
1M
Vision, Tools, Structured
Latest Gemini capabilities
Gemini 2.5 Pro
$2.50
$15.00
1M
Vision, Video, Tools, Structured
Multi-modal premium
Gemini 2.5 Flash
$0.15
$0.60
1M
Tools, Structured
Fast, cost-effective
Gemini 2.5 Flash Lite
$0.075
$0.30
1M
Vision, Audio, Video, Tools, Structured
Ultra-budget multi-modal
Gemini 2.0 Flash
$0.10
$0.40
1M
Vision, Tools, Structured
Balanced speed & cost
Gemini 2.0 Flash Lite
$0.075
$0.30
1M
Vision, Tools, Structured
Budget vision support
Gemini 1.5 Pro
$2.50
$10.00
2M
Tools, Structured
Largest context window
Gemini 1.5 Flash
$0.15
$0.60
1M
Tools, Structured
Fast & efficient
DeepSeek Models
DeepSeek Reasoner
$0.55
$2.19
64K
Reasoning, Tools, Structured
Cost-effective reasoning
DeepSeek Chat
$0.27
$1.10
64K
Tools
Budget conversational AI
Groq Models (Ultra-Fast Inference)
Llama 3.3 70B Versatile
$0.59
$0.79
131K
Tools
Fast large model
DeepSeek R1 Distill Llama 70B
$0.75
$0.99
131K
Tools
Fast reasoning
Llama 3.1 8B Instant
$0.05
$0.08
131K
Tools
Ultra-fast, ultra-cheap
Llama 4 Maverick 17B 128E
$0.24
$0.24
131K
Tools
Fast balanced model
Llama 4 Scout 17B 16E
$0.11
$0.34
131K
Tools
Fast efficient model
Kimi K2 Instruct
$1.00
$3.00
131K
Tools, Structured
Fast advanced chat
Qwen 3 32B
$0.29
$0.54
131K
Tools
Fast Chinese + English
Mistral Saba 24B
$0.79
$0.79
32K
Tools
Fast European model
Gemma 2 9B
$0.20
$0.20
8K
Tools
Fast small model
GPT OSS 120B
$0.15
$0.75
131K
Tools
Fast open-source large
GPT OSS 20B
$0.10
$0.40
131K
Tools
Fast open-source small
Llama Guard 4 12B
$0.20
$0.20
131K
Streaming
Safety moderation
Llama Prompt Guard 2 86M
$0.04
$0.04
512
Streaming
Prompt injection detection
Llama Prompt Guard 2 22M
$0.03
$0.03
512
Streaming
Fast prompt filtering
Additional Models (PixelML)
Grok 4
$3.00
$15.00
64K
Tools, Structured
xAI model with real-time data
Kimi K2
$0.57
$2.30
64K
Tools, Structured
Chinese language specialist
Kimi K2 Thinking
$0.60
$2.50
262K
Reasoning, Tools, Structured
Long-context reasoning
GLM-4.5
$0.39
$1.55
131K
Tools, Structured
Chinese bilingual model
GLM-4.5 Air
$0.14
$0.86
131K
Tools, Structured
Efficient Chinese model
AgenticFlow Provider (Budget Models)
All AgenticFlow models have 50K input token limit for cost control:
Gemini 2.5 Flash Lite
$0.075
$0.30
1M
Vision, Audio, Video, Tools, Structured
Best budget multi-modal
Gemini 2.0 Flash Lite
$0.07
$0.30
50K
Tools, Structured
Budget vision
Gemini 2.0 Flash
$0.10
$0.40
50K
Vision, Tools, Structured
Budget balanced
Gemini 1.5 Flash
$0.07
$0.30
50K
Tools, Structured
Budget efficient
GPT-5 Nano
$0.05
$0.40
50K
Vision, Tools, Structured
Budget OpenAI vision
GPT-4o Mini
$0.10
$0.40
50K
Tools, Structured
Budget OpenAI
GPT OSS 120B
$0.15
$0.75
131K
Tools, Structured
Budget large model
GPT OSS 20B
$0.10
$0.50
131K
Tools, Structured
Budget small model
DeepSeek V3
$0.20
$1.10
50K
Tools, Structured
Budget reasoning
Claude 3.5 Haiku
$1.00
$5.00
50K
Tools, Structured
Budget Claude
GLM-4.5 Air
$0.14
$0.86
131K
Tools, Structured
Budget Chinese
Notes:
All costs are per million tokens
Context shown is max input tokens
Features: Tools = Function calling, Structured = JSON schema, Vision/Audio/Video = Multi-modal
🎯 Model Selection by Use Case
General Business Use
Recommended: GPT-4.1 ($2/M), Claude 4.5 Sonnet ($3/M), Claude 3.5 Sonnet ($3/M)
Best overall performance for business tasks
Excellent reasoning and communication
Good balance of capability and cost
Customer Support (High-Volume)
Recommended: Claude 4.5 Haiku ($1/M), GPT-4.1 Nano ($0.08/M), Gemini 2.5 Flash Lite ($0.075/M)
Fast response times
Cost-effective for high-volume
Tool calling and structured output support
Content Creation
Recommended: Claude 4.5 Opus ($5/M), GPT-5 ($1.25/M), GPT-4.1 ($2/M)
Superior writing capabilities
Vision support for image-based content
Advanced formatting
Data Analysis
Recommended: Claude 4.5 Sonnet ($3/M), Gemini 2.5 Pro ($2.5/M), DeepSeek Reasoner ($0.55/M)
Excellent analytical reasoning
Strong structured data handling
Large context windows (200K-1M+)
Technical Support & Coding
Recommended: GPT-5.1 Codex ($1.25/M), Claude 4.5 Opus ($5/M), DeepSeek Reasoner ($0.55/M)
Code understanding and generation
Complex problem-solving
Reasoning features for debugging
Budget-Conscious / High-Volume
Recommended: Gemini 2.5 Flash Lite ($0.075/M), GPT OSS 20B ($0.10/M), Llama 3.1 8B ($0.05/M)
Ultra-low costs
Still capable for most tasks
Fast inference times
Advanced Reasoning
Recommended: O1 ($1.10/M), O3 ($1.10/M), DeepSeek Reasoner ($0.55/M), Claude 4.5 Opus ($5/M)
Step-by-step reasoning
Complex problem-solving
Mathematical and logical tasks
Multi-Modal (Vision/Video/Audio)
Recommended: Gemini 2.5 Flash Lite ($0.075/M), Gemini 2.5 Pro ($2.5/M), GPT-5 ($1.25/M)
Image understanding
Video analysis (Gemini 2.5 Pro/Lite)
Audio processing (Gemini 2.5 Lite)
⚙️ Configuration Settings
Temperature Control
0.0-0.3 (Focused): Consistent, predictable responses for support, analysis, documentation
0.4-0.7 (Balanced): Natural variation for general communication
0.8-1.0 (Creative): High creativity for marketing, writing, brainstorming
Default: 0.1 (backend standard)
Token Limits
Max Input Tokens: Controls context window (50K-2M depending on model)
Max Output Tokens: Controls response length (1K-128K depending on model)
AgenticFlow Provider: Capped at 50K input for cost control
Advanced Settings
Streaming: Enabled by default for all models
Tool Calling: Function calling for external integrations
Structured Output: JSON schema enforcement
Response Format: Plain text, JSON, or custom structures
💰 Cost Optimization Tips
Choose the Right Provider
PixelML: Latest models, highest limits.
AgenticFlow: Use your Agenticflow Credit, no extra cost.
Direct Providers: Full features, varying costs
Token Management
Set appropriate max_input_tokens to control costs
Configure max_output_tokens based on actual needs
Lower temperature (0.1-0.3) = shorter, focused responses
Feature Selection
Only use vision-enabled models when processing images
Use reasoning models only for complex logic
Choose standard models for routine tasks
📋 Quick Reference
Fastest Models (< 1 second)
All Groq models: Llama 3.1 8B, Llama 3.3 70B, Llama 4 family, Gemma 2 9B
Largest Context (1M+ tokens)
GPT-4.1 (1M), All Gemini models (1M-2M)
Best Value
Gemini 2.5 Flash Lite ($0.075/M), GPT-4.1 Nano ($0.08/M), Llama 3.1 8B ($0.05/M)
Most Capable
Claude 4.5 Opus, GPT-5 Pro, O1/O3, Claude 4 Opus
Best for Coding
GPT-5.1 Codex, Claude 4.5 Opus, DeepSeek Reasoner
Multi-Modal
Gemini 2.5 family (vision + video + audio), GPT-5 family (vision), Claude 4.5 Opus (vision)
Last updated
Was this helpful?