More

anon373839 · 2026-04-30T04:38:42 1777523922

Hm, I don't think this looks like Anthropic's design style. Anthropic is kind of doing a Chobanicore + Corporate Memphis design system that I personally find kind of creepy. But the website here just feels fresh and pleasant.

anon373839 · 2026-04-30T03:27:07 1777519627

Agreed; that's a beautiful site. The main design style apart from minimalism that I notice is glassmorphism. Well, that and a very well chosen Monet to set the tone.

anon373839 · 2026-04-26T01:13:28 1777166008

Well both aren’t “more important”, since that’s illogical. I think recent strides in high performance small LLMs have shown that the tasks LLMs are useful for may not require the level of representational capacity that trillion-parameter models offer.

However: the labs releasing these high-intelligence-density models are getting them by first training much larger models and then distilling down. So the most interesting question to me is, how can we accelerate learning in small networks to avoid the necessity of training huge teacher networks?

anon373839 · 2026-04-22T22:56:53 1776898613

This is just blind belief. The model discussed in this topic already outperforms “well made” frontier LLMs of 12-18 months ago. If what you wrote is true, that wouldn’t have been possible.

datadrivenangel · 2026-04-22T23:26:23 1776900383

It's amazing that we can run models better than state of the art ~36 months ago on local consumer devices!

anon373839 · 2026-04-22T22:51:48 1776898308

Absolutely. Plus as these companies become hungrier for revenue and to get out of the commodity market they are in, they are only going to get more aggressive in their (ab)use of customer data.

anon373839 · 2026-04-22T22:47:36 1776898056

I would recommend trying oMLX, which is much more performant and efficient than LM Studio. It has block-level KV context caching that makes long chats and agentic/tool calling scenarios MUCH faster.

anon373839 · 2026-04-19T08:43:37 1776588217

That's not what consumes the most memory at scale. The KV caches are per-user.

anon373839 · 2026-04-15T23:31:01 1776295861

It was always possible to store it in the browser’s localStorage, so…

tekacs · 2026-04-15T23:44:52 1776296692

It wasn't even the local-ness so much. Even if they stored at remotely it would be okay like ChatGPT or Claude but unlike the others for a long time the only way to let it store history on their servers was also allowing them to train on it. I haven't checked if it's changed.

anon373839 · 2026-04-13T10:43:02 1776076982

That amount of RAM won’t be necessary. Gemma 4 and comparably sized Qwen 3.5 models are already better than the very best, biggest frontier models were just 12-18 months ago. Now in an 18-36GB footprint, depending on quantization.

anon373839 · 2026-04-12T00:13:40 1775952820

That is Anthropic’s shtick to a tee.