For the past year, I’ve been running my own local LLM setup, hoping it would make my work faster and more efficient. And in many ways, it did; but not for the reasons I expected. I went in thinking better hardware would unlock better results. More VRAM, faster inference, bigger models.