I’ve been running local LLMs for a while now on all kinds of devices. I have Ollama and Open WebUI on my home server, with various models running on my AMD Radeon RX 7900 XTX. It’s always been functional, but never quite good enough to replace cloud-based coding a**istants for real work. The models are typically too dumb to handle anything beyond basic autocomplete, and instead, I’ve played around with Claude Code a fair amount.