Check out this local AI model manager similar to Ollama, but better.
https://www.kdnuggets.com/run-local-llms-with-cortex