https://erickleppen.medium.com/run-large-language-models-on-a-budget-model-quantization-and-gguf-for-efficient-gpu-free-operation-9206d447508a