Xingtai Lv, Ning Ding, Kaiyan Zhang, Ermo Hua, Ganqu Cui, Bowen Zhou. Scalable Efficient Training of Large Language Models with Low-dimensional Projected Attention. EMNLP 2024
Release time:2025-04-09
Hits:
- Translation or Not:
- no
- Pre One:Xiangyu Hong, Che Jiang, Biqing Qi, Fandong Meng, Mo Yu, Bowen Zhou, Jie Zhou. On the token distance modeling ability of higher RoPE attention dimension. EMNLP 2024
- Next One:Yulin Chen, Ning Ding, Hai-Tao Zheng, Zhiyuan Liu, Maosong Sun, Bowen Zhou. Empowering private tutoring by chaining large language models. CIKM 2024