State-space models can learn in-context by gradient descent

5 points | by dsalaj 2 days ago

2 comments