Context Is Software, Weights Are Hardware

16 points3 comments3 days ago
maxaravind

Author here.

I spent the last weekend thinking about continual learning. A lot of people think that we can solve long term memory and learning in LLMs by simply extending the context length to infinity. I analyse a different perspective that challenges this assumption.

Let me know how you think about this.

show comments