737615
Hylke

@hylkedonker #737615

Dutch machine learning enthousiast 🤖 with a love for programming.
10 Follower 63 Following
Looking at the nightly changelogs, release of mojo 24.6, which is supposed to ship with gpu support, is coming any day now.
State space models can be used as drop in replacements for attention, but with more favourable sequence length scaling. This video may well be the most lucid intro to state space models I've come across:
https://youtu.be/QJHA-PY8zDc?si=J5kGW87Yg0SAFdpR