Run Large Language Models On A Budget: Model Quantization And GGUF For Efficient GPU-Free Operation
Run Large Language Models On A Budget: Model Quantization And GGUF For Efficient GPU-Free Operation
Thu Jan 4, 11:19pm UTC
https://erickleppen.medium.com/run-large-language-models-on-a-budget-model-quantization-and-gguf-for-efficient-gpu-free-operation-9206d447508a