Search Results for author: Shaolun Zhang

Found 1 papers, 0 papers with code

Towards a Theoretical Understanding of the 'Reversal Curse' via Training Dynamics

no code implementations • 7 May 2024 • Hanlin Zhu, Baihe Huang, Shaolun Zhang, Michael Jordan, Jiantao Jiao, Yuandong Tian, Stuart Russell

Auto-regressive large language models (LLMs) show impressive capacities to solve many complex reasoning tasks while struggling with some simple logical reasoning tasks such as inverse search: when trained on ''A is B'', LLM fails to directly conclude ''B is A'' during inference, which is known as the ''reversal curse'' (Berglund et al., 2023).

Logical Reasoning

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.