Qwen 3.6-27B Released — Dense 27B Leads Agentic Coding, 40 tok/s on RTX 3090 [April 2026]
Qwen 3.6-27B Dense from Alibaba's Qwen Team, released April 22, 2026: 77.2 on SWE-bench Verified, 59.3 on Terminal-Bench 2.0 (matching Claude 4.5 Opus), 262K-to-1M context, Apache 2.0 license, and 40 tok/s on an RTX 3090 with Q4_K_M — summarized from official sources.
Qwen 3.6-27B at a glance
Alibaba's Qwen Team released Qwen 3.6-27B on April 22, 2026 — the first Dense (non-MoE) open-weight model in the Qwen 3.6 family. It's published on Hugging Face and ModelScope under Apache 2.0, allowing commercial use and fine-tuning, with substantial improvements aimed at agentic coding.
Reported benchmarks
Headline numbers from the official announcement:
| Benchmark | Qwen 3.6-27B |
|---|---|
| SWE-bench Verified | 77.2 |
| Terminal-Bench 2.0 | 59.3 (matches Claude 4.5 Opus) |
| QwenWebBench | 1487 |
The model reportedly beats both its predecessor Qwen 3.5-27B and the much larger Qwen 3.5-397B-A17B MoE on multiple tasks. Matching Claude 4.5 Opus on Terminal-Bench 2.0 with a 27B open model is the headline.
Architecture highlights
Despite being 27B Dense, Qwen 3.6-27B introduces a "Thinking Preservation" mechanism (per the official blog) and combines Gated DeltaNet linear attention with traditional self-attention in a hybrid architecture. Native context is 262,144 tokens, extensible to roughly 1,010,000. The model also accepts text, image, and video inputs.
Qwen 3.5-9B vs 3.6-27B — Which to pick
Our existing posts on Qwen 3.5-9B (Complete Guide / Claude replacement migration guide) cover the lighter sibling, which is still a strong choice. Here's how to decide between the two:
| Aspect | Qwen 3.5-9B (light, default) | Qwen 3.6-27B (coding-strong, upper) |
|---|---|---|
| Architecture | 9B Dense + Gated DeltaNet + Sparse MoE | 27B Dense + Thinking Preservation + Gated DeltaNet hybrid |
| Strengths | Japanese fluency, low ops cost, runs from 5GB | Agentic coding, SWE-bench / Terminal-Bench top tier among 27B-class |
| License | Apache 2.0 | Apache 2.0 |
| Native context | 262K (extensible to 1M) | 262K (extensible to 1M) |
| VRAM @ Q4 | ~6 GB | ~16–18 GB |
| Minimum hardware | M1 Mac 8GB / RTX 3060 12GB | M4 Pro 32GB / RTX 3090 24GB / RTX 4090 24GB |
| Speed (quantized, reported) | 40–80 tok/s on RTX 3060 class | ~40 tok/s on RTX 3090 (Q4_K_M) |
| Multimodal | Text-focused | Text + image + video |
| Best for | Business chat, summarization, FAQ, internal RAG | Agentic dev, coding assistance, real DX builds |
| As Claude/GPT replacement | Stand-in for Haiku / Mini class | Reportedly head-to-head with Claude 4.5 Opus on coding benches |
Picking guidance: - Cost-first, run on a Mac mini → Qwen 3.5-9B - Agentic coding / dev productivity is the core need → Qwen 3.6-27B - Already have a 24GB+ GPU server / serious DX investment → Qwen 3.6-27B - Need it on a laptop / 8GB Mac → Qwen 3.5-9B - Need multimodal → Qwen 3.6-27B (3.5-9B is text-focused) A hybrid setup — Qwen 3.5-9B for routine queries and Qwen 3.6-27B only when coding judgment is required — is often the most cost-balanced choice.
Hardware — 40 tok/s on RTX 3090
Reported runs show ~40 tokens/sec on an RTX 3090 24GB with Q4_K_M quantization, passing 10 of 10 functional tests. Hitting practical throughput on a workstation-class consumer GPU makes this 27B Dense open model genuinely deployable for SMBs. Apple Silicon (M3 Max 64GB / M4 Max-class) is also viable for quantized inference.
How Oflight uses it
Qwen 3.6-27B is now a candidate local backend for OpenClaw (see OpenClaw 2026.4.23 release notes). It pairs well with internal-DX projects where confidential data shouldn't leave a private network but agentic coding is still required, and Apache 2.0 fine-tuning is a practical bonus. For deployment guidance, see AI Consulting.
FAQ
Q1: Worth upgrading from Qwen 3.5-9B? A: For coding-heavy workloads, likely yes. Note that 9B → 27B raises VRAM requirements significantly. Q2: Does it run on Mac mini? A: M4 Pro 32GB+ can run quantized Qwen 3.6-27B usefully. For M2 8GB-class, prefer Qwen 3.5-9B or Gemma 4 E4B. Q3: Is commercial use allowed? A: Apache 2.0 — commercial use, modification, and redistribution are all permitted.
References
Feel free to contact us
Contact Us