I have been using Ollama to run local LLMs on my Mac, and it has been working just fine. However, my Mac’s overall performance took a hit because local LLMs are resource-hungry. I have a MacBook Air M5 with 16GB of RAM. It’s probably not the most powerful machine for this kind of workload, but it’s been good enough to run models with fewer than 7 billion parameters.