Pelayo Arbués

Recent Notes

I am cooking again
Mar 25, 2026
The 10x Manager
Feb 17, 2026
2025 Reading Wrapped
Jan 08, 2026

See 99 more →

❯

Literature Notes

❯

❯

3. Fine Tune LLaMA 13B With QLoRA on Amazon SageMaker

3. Fine-Tune LLaMA 13B With QLoRA on Amazon SageMaker

Apr 16, 20251 min read

articles
literature-note

Metadata

Author: Philipp Schmid
Full Title: 3. Fine-Tune LLaMA 13B With QLoRA on Amazon SageMaker
URL: https://www.philschmid.de/sagemaker-llama2-qlora

Highlights

Parameter Efficient Fine-tuning, is a new open-source library from Hugging Face to enable efficient adaptation of pre-trained language models (PLMs) to various downstream applications without fine-tuning all the model’s parameters (View Highlight)
QLoRA is a new technique to reduce the memory footprint of large language models during finetuning, without sacrificing performance. The TL;DR; of how QLoRA works is: • Quantize the pretrained model to 4 bits and freezing it. • Attach small, trainable adapter layers. (LoRA) • Finetune only the adapter layers, while using the frozen quantized model for context. (View Highlight)

Graph View

Metadata
Highlights

Now Reading

Rightmove Launches Next Phase of AI-powered Property Search
Mar 25, 2026

See 1712 more →

Created with Quartz, © 2026

Bluesky
Linkedin
Mastodon
Twitter
Unsplash
GitHub
RSS