Lightning AI Studios: Never set up a local environment again →

← Back to glossary

QLoRA

QLoRA is a highly effective method for fine-tuning, which significantly minimizes memory requirements, enabling the fine-tuning of a 65 billion parameter model on a single 48GB GPU without compromising the performance of the 16-bit fine-tuning tasks.

Related content

The NeurIPS 2023 LLM Efficiency Challenge Starter Guide
Finetuning LLMs with LoRA and QLoRA: Insights from Hundreds of Experiments