K2 Think131K ctx
A 32 billion parameter open-weights reasoning model by LLM360/MBZUAI, built on Qwen2.5-32B. Trained with reinforcement learning and verifiable rewards for long chain-of-thought reasoning, agentic planning, and complex problem solving in math, science, and code.