FlashAttention-T: Towards Tensorized Attention

52 points | by matt_d 3 hours ago

8 comments