Hierarchical Autoregressive Modeling for Memory-Efficient Language Generation

46 points | by PaulHoule 3 days ago

3 comments