Blogs · Draft Notes · LLM · Fine-tuning

Fine-tuning in LLM

Draft notes on LoRA, prompt tuning, adapter tuning, RLHF, and DPO.

2024.02.19 · 1 min read · by Zhenlin Wang

LoRA Soft Prompt Tuning Prefix Tuning Adapter RLHF DPO