In your code, you used (dt+A).softmax. Why not use mamba's default (dt+A).softplus instead? <img width="828" height="704" alt="Image" src="https://github.com/user-attachments/assets/0b5f82ba-3adf-4e83-b8e4-13ed9af7dbdf" />
In your code, you used (dt+A).softmax. Why not use mamba's default (dt+A).softplus instead?