Text GenerationQuantizationGPTQGPTQ: Archaic Quantization Succeeded by EXL2, GPTQ is the original quantization method for models. It's been around since llama-1. It's still used very broadly by the public, due to people being the most familiar with it.ExllamaV2Llama-cpp