More

ting0 · 2026-04-07T09:10:15 1775553015

Has anyone ever done a proper security audit of VLC that is downloaded from the web? I don't trust it, and the fact that their releases on Github don't include binaries makes me trust it even less. Nobody is compiling VLC from source, and they don't provide any sort of provenance from the GH actions pipeline.

kykat · 2026-04-07T09:44:46 1775555086

All linux distros build VLC from source

ohhman11 · 2026-04-07T09:19:15 1775553555

This seems utterly pointless to worry about. You're fucked either way if you trust VLC.

bloudermilk · 2026-04-07T09:22:29 1775553749

Care to elaborate?

bzzzt · 2026-04-07T13:09:23 1775567363

Look at the supported formats lists. It includes so many parsers, mostly written in C, which means there probably are a few dozen ways to exploit the player.

ting0 · 2026-04-06T18:28:13 1775500093

Thinking time is not the issue. The issue is that Claude does not actually complete tasks. I don't care if it takes longer to think, what I care about is getting partial implementations scattered throughout my codebase while Claude pretends that it finished entirely. You REALLY need to fix this, it's atrocious.

ting0 · 2026-04-06T18:21:25 1775499685

Do you guys realize that everyone is switching to Codex because Claude Code is practically unusable now, even on a Max subscription? You ask it to do tasks, and it does 1/10th of them. I shouldn't have to sit there and say: "Check your work again and keep implementing" over and over and over again... Such a garbage experience.

Does Anthropic actually care? Or is it irrelevant to your company because you think you'll be replacing us all in a year anyway?

misnome · 2026-04-06T23:02:26 1775516546

Or, ask it to make a plan, and it makes a good plan! It explicitly notes how validation is to take place on each stage!

And then does every stage without running any of the validation. It's your agent's plan, it should probably be generated in a way that your own agent can follow it.

ting0 · 2026-04-05T00:05:45 1775347545

Whenever Anthropic has an opportunity to do what's right, they go the opposite way. For example, their source leaks, and instead of open-sourcing it like people have been asking to happen for years so they can contribute fixes because Anthropic doesn't care to maintain their own software, they tighten the noose further.

If it isn't obvious by now, this problem is only going to get worse. The only reason we have subscriptions still is because they're waiting to pull off the biggest bait and switch in history. Don't get sunk on this ecosystem, or you're in for a world of pain in the future. As has always been the case; competition and open-source are our only hope.

ting0 · 2026-03-30T04:57:55 1774846675

They're just removing it from public access and selling it to big money instead. Think large advertising companies, government agencies, Coke-Cola, Hollywood, etc. The scary part is now that they've removed it publicly, it's going to be harder to keep a pulse on what is real and what is fake. We can't trust any video, audio or text content now.

simonw · 2026-03-30T06:19:00 1774851540

That's not what's happening. Disney were due to invest $1bn in OpenAI to partner on Sora and that deal has been cancelled.

ting0 · 2026-03-30T04:49:50 1774846190

The only reason to ever use LinkedIn is if you're in the process of finding a job. Other than that, it's a cesspit.

Even for that though, I've never used it, and I don't feel like I'm missing out.

ting0 · 2026-03-27T08:22:55 1774599775

You have a problem with generating synthetic data from test questions? Humans simulate experiences in their mind. What's the problem?

BoorishBears · 2026-03-27T08:37:51 1774600671

Models don't generalize as well as humans.

Synthetic data is fine. Synthetic data on very similar questions generated based on the description is typically fine. But once the shape of what you're training on gets too close to the actual holdout questions, you're getting an uplift that's not realistic for unseen tasks.

ting0 · 2026-03-27T08:21:57 1774599717

Does it matter though? If it accomplishes the task, it accomplishes the task. Everyone uses a harness anyway, and finding the best harness is relevant. Also perhaps this hints at something bigger, i.e.: we're wasting our time focusing on the model when we could be focusing on the harness.

stephantul · 2026-03-27T18:50:46 1774637446

Yes it matters, because it’s not a measurement of whether it accomplishes the task if a human tells it how to solve it.

ting0 · 2026-03-26T17:10:15 1774545015

They will certainly get worse. LLMs make it so much easier.

hxugufjfjf · 2026-03-26T20:56:44 1774558604

Agreed, as proven quite brutally over the last two weeks and especially the last three days.

ting0 · 2026-03-26T17:08:03 1774544883

I feel like they should be legally responsible for providing scanning infrastructure for this sort of thing. The potential economic damage can be catastrophic. I don't think this is the end of the litellm story either, given that 47k+ people were infected.