GPTQ or bitsandbytes: Which Quantization Method to Use for LLMs — Examples with Llama 2

GPTQ or bitsandbytes: Which Quantization Method to Use for LLMs — Examples with Llama 2

a year ago
Anonymous $pUsIN4hzN9

GPTQ or bitsandbytes: Which Quantization Method to Use for LLMs — Examples with Llama 2

Aug 25, 2023, 5:20am UTC
https://towardsdatascience.com/gptq-or-bitsandbytes-which-quantization-method-to-use-for-llms-examples-with-llama-2-f79bc03046dc