1 code implementation • 1 Feb 2024 • Zhixue Zhao, Boxuan Shan
Our method updates the token importance distribution in a recursive manner.
Decoder Text Generation