attogram

"Attention Is All You Need" - I've always wondered if the authors of that paper used such a casual and catchy title because they knew it would be groundbreaking and massively cited in the future....

show comments
JSR_FDED

Any way to read this without making an account?

show comments
mrtesthah

Do we know if any of these techniques are actually used in the so-called "frontier" models?

show comments