Please don't slander the most open AI company in the world. Even open than some non-profit labs from universities. DeepSeek is famous for publishing everything. They might take a bit to publish source code but it's almost always there. And their papers are extremely pro-social to help the broader open AI community. This is why they struggle getting money because investors hate openness. And in China they struggle against the political and hiring power of the big tech companies.
The models from Chinese Big Tech are open weights only. (and allegedly benchmaxxed)
Ah, sorry. I suppose it is only fair to mention using the wrong metrics and getting the exact metric wrong myself. Today it is radar-observed wind speed and historically there were other less efficient means to test or at least estimate wind speed.
The original point still stands that Hurricanes are defined by only the one metric and other metrics have room to grow bigger as the category stays the same:
> The Saffir-Simpson Hurricane Wind Scale is a 1 to 5 rating based only on a hurricane's maximum sustained wind speed. This scale does not take into account other potentially deadly hazards such as storm surge, rainfall flooding, and tornadoes.
Is that true for evolution? If that math works, it seems that any one bacterium's mutation must become universal across the globe? Evolution works but not for every instance of a beneficial mutation. I wonder what the odds are for bacteria.
To me, the important thing isn't that I can run it, it's that I can pay someone else to run it. I'm finding Opus 4.7 seems to be weirdly broken compared to 4.6, it just doesn't understand my code, breaks it whenever I ask it to do anything.
Now, at the moment, i can still use 4.6 but eventually Anthropic are going to remove it, and when it's gone it will be gone forever. I'm planning on trying Deepseek v4, because even if it's not quite as good, I know that it will be available forever, I'll always be able to find someone to run it.
No! I had this machine, the skeleton is made of iron (the black colored parts), it completely rusted away for me in a couple of years. Very bad design decision on Rancilio.
if you use Kubernetes, shelling into an instance from k9s cli is pressing "s" with the instance highlighted in the TUI. it's great. haven't found a shrink-wrapped tool like that for ECS thats as good/easy as k9s for Kubernetes.
The most frustrating part about this is that evidently there's a database of notifications, so that notification that you dismissed and didn't see is totally a thing iOS could have like Android does but they just don't wanna do that for some reason.
I just overhauled a la marzocco sitting in my kitchen. People often inquire about whether it's worth buying an espresso machine for the home, or if it's a good investment as a coffee connoisseur.
My reply is always that it was the best money I've ever spent and the worst investment I've ever made. It's a lifestyle choice, and a questionable one at that. But one I'd make again every time given the opportunity.
I have not much followed the science of gut microbiome and psychology. Is this really going where this article is pointing? That we can tease out causation in foods and habits via gut microbiome towards behavior and psychology? Pretty rad.
doesn't it get tiring after a while? using the same (perceived) gotcha, over and over again, for three years now?
no one is ever going to release their training data because it's full of copyrighted stuff. everyone, even the hecking-wholesome safety-first Anthropic uses copyrighted data without permission to train their models. there you go.
Wow, yes I should have looked more closely at the image - which was obviously generated by Gemini Nano Banana (it usually does a better job with the text rendering). I checked the article text carefully but not the image...
And that's a great question. I'll answer it in two parts. First, I think the inner and outer are actually quite complimentary. The outer part is largely things that Claude can't provide - such as your own instructions, skills, and ways of validating things specific to your project. I have memory as part of the outer harness but Claude of course does have a memory system and it would not surprise me if they drastically improve it.
The other thing, mentioned in the article, is that harnesses (and specifically Claude) are shrinking: "As I was going through these iteration cycles, we also released Opus 4.6, which provided further motivation to reduce harness complexity. There was good reason to expect 4.6 would need less scaffolding than 4.5 did. ". (from https://www.anthropic.com/engineering/harness-design-long-ru...).
So my* core argument is you still need that "scaffolding", but it belongs in the outer harness.
"These findings reveal previously unrecognised effects of coffee on the microbiota–gut–brain axis, suggesting that microbiome profiles could potentially predict coffee consumption patterns", or, perhaps, just ask the patient?
Overhired has nothing to do with the talent pool and just means they hired more than they actually needed or wanted, if the talent pool is large enough then everyone can overhire
More like he wants to ban accelerator chip sales to China, which may be about “national security” or self preservation against a different model for AI development which also happens to be an existential threat to Anthropic. Maybe those alternatives are actually one and the same to him.