Rendered at 06:59:05 GMT+0000 (Coordinated Universal Time) with Cloudflare Workers.
matt_daemon 12 minutes ago [-]
> Hardware (minimum): 1× H100 @ FP8
Cool to see this but seems like it would be pretty expensive to run
moojacob 3 hours ago [-]
I was a fan of coheres general purpose LLM. Command A I think? Before they came out with their reasoning model.
More competition is better.
SubiculumCode 3 hours ago [-]
I always forget the VRAM requirements on these MOE things
sipjca 3 hours ago [-]
fwiw because of the relatively few activated params offloading to system RAM is quite feasible, you can see the endless amount of people doing this on r/localllama with qwen3.6 35a3b
bitwize 14 minutes ago [-]
I ran Gemma4 26B A4B on an 8yo PC with a fucking GTX and it did rather well.
tonyrice 4 hours ago [-]
I'm excited to see more OSS models
4 hours ago [-]
zuzululu 3 hours ago [-]
Wasn't aware that Cohere was still around but this release doesn't exactly instill confidence.
greyb 2 hours ago [-]
>Wasn't aware that Cohere was still around but this release doesn't exactly instill confidence.
It's being kept alive because the Canadian government is desperate to have a local frontier lab and is willing to inject funding and force its adoption in government services, but leadership at Cohere is known to be weak in Canadian tech circles, and they pivoting to an enterprise-first market around production RAG rather than anything close to frontier work.
I'm glad they're doing open weight releases but they're not viable in the long-run. It is embarrassing sharing similar spaces with them, but I'll try this release out in OpenCode and re-think afterwards.
kadoban 2 hours ago [-]
Really? Why not. From the benchmarks at least it's a pretty decent small model.
cyanydeez 2 days ago [-]
looks like it's just qwen 3.6 coder.
lumost 5 hours ago [-]
its worse at code compared to qwen 3.6 coder.
stymaar 59 minutes ago [-]
How can it be worse than something that doesn't exist?
SubiculumCode 4 hours ago [-]
Do you mean it's based on qwen 3.6 coder?
daemonologist 3 hours ago [-]
There is no "coder" version of Qwen 3.6; I think they just mean it's a coding-focused model of similar size and performance (to Qwen 3.6 35B-A3B).
Regular Qwen 3.6 benchmarks slightly better and has much wider software support though, so this is probably of interest only to organizations which disallow models trained in China.
kadoban 3 hours ago [-]
I mean, Qwen 3.6 kicks ass. I don't know who these people are, but if their first outing is "not quite as good as Qwen 3.6", that's not a bad start by any means.
30B vs 35B isn't nothing either.
If it ends up just being some tweaks to someone else's weights, then meh.
mtone 2 hours ago [-]
It was trained from scratch by Cohere. They're the only Canadian AI lab - I'm glad they're releasing open weights and I wish them luck catching up!
moralestapia 5 hours ago [-]
>Our plan to being profitable is to give mediocre stuff for free
Cool to see this but seems like it would be pretty expensive to run
More competition is better.
It's being kept alive because the Canadian government is desperate to have a local frontier lab and is willing to inject funding and force its adoption in government services, but leadership at Cohere is known to be weak in Canadian tech circles, and they pivoting to an enterprise-first market around production RAG rather than anything close to frontier work.
I'm glad they're doing open weight releases but they're not viable in the long-run. It is embarrassing sharing similar spaces with them, but I'll try this release out in OpenCode and re-think afterwards.
Regular Qwen 3.6 benchmarks slightly better and has much wider software support though, so this is probably of interest only to organizations which disallow models trained in China.
30B vs 35B isn't nothing either.
If it ends up just being some tweaks to someone else's weights, then meh.