On paper. There's huge financial incentive to quantize the crap out of a good mo...

armchairhacker · 2026-03-27T10:17:23 1774606643

And there’s an incentive to publish evidence of this to discourage it, do you have any?

TeMPOraL · 2026-03-27T10:54:11 1774608851

Models aren't just big bags of floats you imagine them to be. Those bags are there, but there's a whole layer of runtimes, caches, timers, load balancers, classifiers/sanitizers, etc. around them, all of which have tunable parameters that affect the user-perceptible output.

natebc · 2026-03-27T11:12:46 1774609966

There really always is a man behind the curtain eh?

coldtea · 2026-03-27T12:12:35 1774613555

Often it's literally just that:

https://www.msn.com/en-us/money/other/ai-startup-backed-by-m...

TeMPOraL · 2026-03-27T11:35:12 1774611312

It's still engineering. Even magic alien tech from outer space would end up with an interface layer to manage it :).

ETA: reminds me of biology, too. In life, it turns out the more simple some functional component looks like, the more stupidly overcomplicated it is if you look at it under microscope.

woadwarrior01 · 2026-03-27T11:23:53 1774610633

There's this[1]. Model providers have a strong incentive to switch (a part of) their inference fleet to quantized models during peak loads. From a systems perspective, it's just another lever. Better to have slightly nerfed models than complete downtime.

[1]: https://marginlab.ai/trackers/claude-code/

nl · 2026-03-27T11:31:52 1774611112

So - as the charts say - no statistical difference?

Isn't this link am argument against the point you are making?

withinboredom · 2026-03-27T12:47:48 1774615668

The chart doesn't cover the 4.6 release which was in the end of December/early January time frame. So, it's hard to tell from existing data.

nl · 2026-03-29T06:42:23 1774766543

That isn't true. The whole point it to quickly pick up statistically significant variations quickly, and with the volume of tests they are doing there is plenty of data.

If you turn on the 95% CI bands you can see there is plenty of statistical significance.

withinboredom · 2026-03-29T09:04:24 1774775064

Unless you and I are looking at different web pages… it only goes back to February, not December or January.

coldtea · 2026-03-27T12:11:18 1774613478

Anybody with more than five years in the tech industry has seen this done in all domains time and again. What evidence you have AI is different, which is the extraordinary claim in this case...

seunosewa · 2026-03-27T17:53:36 1774634016

Or just change the reasoning levels.