LLM-driven Instruction Following:

Progresses & Concerns

(Dec. 6, 2023; EMNLP'23 Tutorial)

Contact: wenpeng@psu.edu

Penn State

USC

SJTU

USC

LMU Munich

Abstract:

The progress of classic NLP is primarily driven by machine learning that optimizes a system on a large-scale set of task-specific labeled examples. This learning paradigm limits the ability of machines to have the same capabilities as humans in handling new tasks since humans can often solve unseen tasks with a couple of examples accompanied by task instructions. In addition, we may not have a chance to prepare task-specific examples of large-volume for new tasks because we cannot foresee what task needs to be addressed next and how complex to annotate for it. Therefore, task instructions act as a novel and promising resource for supervision. This tutorial presents a diverse thread of instruction-driven NLP studies that try to answer the following questions: (i) What is task instruction? (ii) How is the process of creating datasets and evaluating systems conducted? (iii) How to encode task instructions? (iv) When and why do some instructions work better? (v) What concerns remain in LLM-driven instruction following? We will discuss several lines of frontier research that tackle those challenges and will conclude the tutorial by outlining directions for further investigation.

Hinrich Schütze

Introduction (Slides)

Why does "learning from instructions" matter?
What are "instructions"? (LLM-oriented instructions vs. Human-oriented instructions)
Representative instruction-following researches

Wenpeng Yin

Datasets & Evaluations for Instruction Following (Slides)

Human-generated datasets
LLM-generated datasets
Automatic evaluations & Human evaluations

Qinyuan Ye

Xiang Ren

Methodology---Prompting (Slides)

Human-inspired prompting techniques (Reasoning, Augmentation, Verification, Feedback and Refinement)
LLM-powered Automatic Prompt Engineering

Pengfei Liu

Methodology---Alignment in LLMs (Slides)

Human values manifest in diverse forms
Alignment Techniques in different LLM stages (Pretraining, Supervised Fine-Tuning, Reward-Based Tuning, Testing Time, Deployment, Tool Use)

Wenpeng Yin

Challenges & Issues in Instruction Following (Slides)

Inverse scaling law of LLMs in dealing with negation
LLMs struggle to grasp instructions like humans do
Adversarial instruction attacks

Hinrich Schütze

Conclusion & Future Directions (Slides)

Instruction-following generalist in the physical world
Autonomously derive instructions through learning and observation
Planning complex problems and directing sub-AIs with instructions

Overall slides Recording

Page updated

Google Sites

Report abuse