Models

Browse 11 canonical LLM models across all providers

Sort by

Compare Models

Showing 1–11 of 11 models

Gemma 4 12B

USA3 providers262K ctx

Google's medium-size open-weight model with 12 billion parameters from the Gemma 4 family. Encoder-free unified multimodal architecture that natively processes text, image, audio, and video inputs without dedicated encoders. Features a 256K context window and supports 140+ languages. First medium-sized model capable of natively ingesting audio. Suitable for local deployment on GPUs with 16GB VRAM.

$0.00 – $0.10 / 1M tokens

Gemini 3.1 Flash-Lite

USA2 providers1.0M ctx

Google's most cost-efficient Gemini model optimized for high-volume, low-latency use cases. Delivers 2.5x faster time to first token versus Gemini 2.5 Flash with full multimodal support. Ideal for agentic tasks, data extraction, translation, and classification.

$0.25 – $1.50 / 1M tokens

Gemini 3 Flash

USA2 providers1.0M ctx

Google's balanced model combining Gemini 3 Pro's reasoning capabilities with the Flash line's latency, efficiency, and cost. Features configurable thinking levels, multimodal function responses, and streaming function calling for complex agentic workflows.

$0.50 – $3.00 / 1M tokens

GPT-5.5

USA1.0M ctx

OpenAI's most capable model designed for complex real-world work including coding, online research, information analysis, and document creation. Features advanced agentic capabilities with tool search and multi-step task execution.

$12.00 – $48.00 / 1M tokens

Muse Spark

USA256K ctx

Meta Superintelligence Labs' first model, featuring advanced reasoning, multimodal understanding, and agentic capabilities. Processes voice, text, and image inputs with tool use and multi-agent orchestration. Powers Meta AI across its product ecosystem.

$5.00 – $25.00 / 1M tokens

Gemini 3.1 Pro

USA2 providers2.0M ctx

Google's latest flagship multimodal model with state-of-the-art performance on reasoning, coding, and multimodal understanding. Features native tool use, grounding, and million-token context window.

$7.00 – $21.00 / 1M tokens

GPT-5.5 Pro

USA256K ctx

OpenAI's premium tier model with extended reasoning capabilities, higher accuracy on complex tasks, and priority access. Optimized for professional and enterprise workloads requiring maximum quality.

$30.00 – $120.00 / 1M tokens

Gemini 2.5 Flash

USA2 providers1.0M ctx

Google's cost-effective model optimized for high throughput tasks. Balances speed and intelligence with strong multimodal capabilities and 1M token context window.

$0.15 – $0.60 / 1M tokens

Gemini 2.5 Pro

USA2 providers1.0M ctx

Google's high-capability reasoning model with adaptive thinking for complex agentic and multimodal challenges. Features 1M token context window and strong performance on coding and scientific tasks.

$2.50 – $15.00 / 1M tokens

GPT-5

USA2 providers256K ctx

OpenAI's fifth-generation flagship model with significant improvements in reasoning, multimodal understanding, and code generation. Features enhanced instruction following and expanded context window.

$10.00 – $40.00 / 1M tokens

Whisper

USA448 ctx

OpenAI's Whisper automatic speech recognition model capable of multilingual audio transcription and translation, trained on a large dataset of diverse audio for robust real-world performance.