Draft notes on LoRA, prompt tuning, adapter tuning, RLHF, and DPO.
LoRA Soft Prompt Tuning Prefix Tuning Adapter RLHF DPO