Alloma 8B Instruct4K ctx
Uzbek LLM Lab's 8B parameter instruction-tuned model optimized for the Uzbek language. Built on Llama architecture with a custom tokenizer averaging 1.7 tokens per Uzbek word versus 3.5 in original Llama, enabling 2x faster inference. Trained on 3.6B tokens with 4096 context length.