There’s a version of local LLMs that lives in my head from how they were a couple of years ago. That they’re slow, clunky, need expensive hardware to run anything worth using, and outputs that feel like a worse version of what you already have in your browser. That mental model made sense at the time because local models really were like that for a while, and the barrier to entry was high enough to write them off if you weren’t a serious tinkerer.