Featherless

HealthyTelemetry updated 24m ago

Serverless LLM inference platform hosting 20,000+ open-source models from Hugging Face with flat-rate subscription pricing and unlimited token usage. OpenAI-compatible API with no per-token billing — access any model up to a given size based on subscription tier. Largest Hugging Face inference provider by model count.

API Base URL

https://api.featherless.ai/v1

Authentication

bearer

Uptime (24h)

100.0%

Uptime (7d)

100.0%

Supported Regions

global

Latency (TTFT)

Time to first token percentiles

No latency data available

Health History

Uptime over the last 7 days

7-Day Uptime100.00% — Excellent

24-Hour Uptime100.00% — Excellent

Current Status

healthy

Last Checked

24m ago

Supported Models (3)

Models available through this provider. Click a model to view details.

Model	Pricing (per 1M)	Rate Limits	Regions
Kumru 7B kumru-7b	In: $0.00 Out: $0.00	60 RPM / 300K TPM	global
Trendyol LLM 8B T1 trendyol-llm-8b	In: $0.00 Out: $0.00	60 RPM / 300K TPM	global
WiroAI Turkish LLM 9B wiroai-turkish-llm-9b	In: $0.00 Out: $0.00	60 RPM / 300K TPM	global