Models

Browse 10 canonical LLM models across all providers

10 models

Gemini 3 Flash

2 providers1.0M ctx

Google's balanced model combining Gemini 3 Pro's reasoning capabilities with the Flash line's latency, efficiency, and cost. Features configurable thinking levels, multimodal function responses, and streaming function calling for complex agentic workflows.

$0.50$3.00 / 1M tokens

chatcompletionfunction-callingvision+3
textimageaudiovideocode

Gemini 3.1 Flash-Lite

2 providers1.0M ctx

Google's most cost-efficient Gemini model optimized for high-volume, low-latency use cases. Delivers 2.5x faster time to first token versus Gemini 2.5 Flash with full multimodal support. Ideal for agentic tasks, data extraction, translation, and classification.

$0.25$1.50 / 1M tokens

chatcompletionfunction-callingvision+2
textimageaudiovideocode

GPT-5.5

1.0M ctx

OpenAI's most capable model designed for complex real-world work including coding, online research, information analysis, and document creation. Features advanced agentic capabilities with tool search and multi-step task execution.

$12.00$48.00 / 1M tokens

chatcompletionfunction-callingvision+3
textimageaudiocode

Muse Spark

256K ctx

Meta Superintelligence Labs' first model, featuring advanced reasoning, multimodal understanding, and agentic capabilities. Processes voice, text, and image inputs with tool use and multi-agent orchestration. Powers Meta AI across its product ecosystem.

$5.00$25.00 / 1M tokens

chatcompletionfunction-callingvision+3
textimageaudiocode

Gemini 3.1 Pro

2 providers2.0M ctx

Google's latest flagship multimodal model with state-of-the-art performance on reasoning, coding, and multimodal understanding. Features native tool use, grounding, and million-token context window.

$7.00$21.00 / 1M tokens

chatcompletionfunction-callingvision+3
textimageaudiovideocode

GPT-5.5 Pro

256K ctx

OpenAI's premium tier model with extended reasoning capabilities, higher accuracy on complex tasks, and priority access. Optimized for professional and enterprise workloads requiring maximum quality.

$30.00$120.00 / 1M tokens

chatcompletionfunction-callingvision+3
textimageaudiocode

Gemini 2.5 Flash

2 providers1.0M ctx

Google's cost-effective model optimized for high throughput tasks. Balances speed and intelligence with strong multimodal capabilities and 1M token context window.

$0.15$0.60 / 1M tokens

chatcompletionfunction-callingvision+3
textimageaudiovideocode

Gemini 2.5 Pro

2 providers1.0M ctx

Google's high-capability reasoning model with adaptive thinking for complex agentic and multimodal challenges. Features 1M token context window and strong performance on coding and scientific tasks.

$2.50$15.00 / 1M tokens

chatcompletionfunction-callingvision+3
textimageaudiovideocode

GPT-5

2 providers256K ctx

OpenAI's fifth-generation flagship model with significant improvements in reasoning, multimodal understanding, and code generation. Features enhanced instruction following and expanded context window.

$10.00$40.00 / 1M tokens

chatcompletionfunction-callingvision+3
textimageaudiocode

Whisper

448 ctx

OpenAI's Whisper automatic speech recognition model capable of multilingual audio transcription and translation, trained on a large dataset of diverse audio for robust real-world performance.

audio
audiotext