Back to providers

Featherless

HealthyTelemetry updated 24m ago

Serverless LLM inference platform hosting 20,000+ open-source models from Hugging Face with flat-rate subscription pricing and unlimited token usage. OpenAI-compatible API with no per-token billing — access any model up to a given size based on subscription tier. Largest Hugging Face inference provider by model count.

Authentication

bearer

Uptime (24h)

100.0%

Uptime (7d)

100.0%

Supported Regions

global

Latency (TTFT)

Time to first token percentiles

No latency data available

Health History

Uptime over the last 7 days

7-Day Uptime100.00% — Excellent
24-Hour Uptime100.00% — Excellent

Current Status

healthy

Last Checked

24m ago

Supported Models (3)

Models available through this provider. Click a model to view details.

ModelPricing (per 1M)Rate LimitsRegions

Kumru 7B

kumru-7b

In: $0.00
Out: $0.00
60 RPM / 300K TPM
global

Trendyol LLM 8B T1

trendyol-llm-8b

In: $0.00
Out: $0.00
60 RPM / 300K TPM
global

WiroAI Turkish LLM 9B

wiroai-turkish-llm-9b

In: $0.00
Out: $0.00
60 RPM / 300K TPM
global
Featherless - OpenModels