Hi Xi, Thanks for posting this! That's very nice to see. I'm currently traveling without my laptop (actually in Yunnan, China!), so I'll be able to take a look at this for real starting the 26th, as right now I'm just on my cellphone using lore+mutt. One thing I wanted to ask, though, is - doesn't LoongArch have 32 8-byte registers? Shouldn't that be enough to implement ChaCha without spilling and without using LSX? Jason