Instruction Tuning and Alignment
课程大纲
推荐阅读材料
- [论文]Training language models to follow instructions with human feedback
- [论文]LIMA: Less Is More for Alignment
- [论文]Multitask Prompted Training Enables Zero-Shot Task Generalization
- [论文]Self-Instruct: Aligning Language Models with Self-Generated Instructions
致谢
- 感谢 Runze Fan, Yixiu Liu, Zengzhi Wang 协助一起完成指令学习的课件。