There are open source AI models that can be downloaded and self-hosted by anyone, such as Ollama, an open source project that allows models to run. What is open source AI? In simple terms, it refers to AI models that are publicly available, allowing users to modify and deploy them without restrictions. The idea is to buy or rent an infrastructure and install the model you want to use. There are several models to choose from, such as Llama 3.3, DeepSeek-R1, Phi-4, Mistral, and Gemma 2.
In this case, we have to pay for the infrastructure and the operation fee and within that, we use as many as we want. It's a bit like renting a car: you can drive it as much as you like each month, but you have to pay the rental fee even if you don't go anywhere (the analogy is a bit flawed in that you have to fill up the car, but let's ignore that for now). This solution is worthwhile if you use the LLM on a large scale.
There are also vendors - such as together.ai - that specialise in running open source language models and pricing them on an API basis. This is typically cheaper than closed source commercial APIs and is the solution of choice for low usage.