For a long time, running an AI model locally felt like a gimmick, rather than something actually useful. You could generate a paragraph of text, edit or generate an image if you were patient, all while putting up with subpar results compared to the cloud-based behemoths operated by the likes of OpenAI, Google, and Anthropic. They did everything better, faster, and with fewer compromises, and if you wanted something that actually worked, you had to send your data off to someone else’s servers… all while accepting the trade-offs.