Running an LLM locally is a pain you probably don’t want to deal with unless you have a real use case. I tried self-hosting OpenAI’s Whisper model on my laptop, and while the tool itself worked well, getting it up and running took a long time. You usually have to stitch together multiple tools just to get started, downloading model files from different sources, setting up Python environments, installing frameworks like PyTorch or Transformers, configuring GPU drivers, and then fixing whatever breaks next.