Why self-hosting LLMs fails in production