Toggle navigation
233博客
首页
登录
2025-12-09T03:32:17Z
windows
(bsz, seq_len, num_q_heads, head_dim) transpose(1, 2) 成了 [bsz, n_q_head, seq_len, head_dim]