Discussion about this post

User's avatar
Dan Cucolea's avatar

I use Qwen 3.6 Plus via Opencode, not sure how quantized it is. Have you tried using that one? I'm curious if it's much better or kind of the same with running Qwen 3.6 locally.

The Parrot And The Dreamer's avatar

I need to get more timings but my sense is “good enough, wish I had more, love the OTHER factors.” It’s a trade off. Your rig is definitely way faster. We’ve mostly been using it so far for the rest of the harness and scaffolding not as the primary AI. Prefill is the killer, but the much larger kvcache has its charms.

18 more comments...

No posts

Ready for more?