Hey!
First of all thank you for this library!
I would like to take your actors and critics and implement RNN-enhanced TD3 algorithm as described here: https://arxiv.org/pdf/1710.06537.pdf.
I have investigated the source code and it seems that you are not supporting recurrent feature in your implementation of rl algorithms. Have you considered it? Is there anything you can recommend which may possibly help me with seamless transition to memory-based approach?
Cheers!
Edit: The "history len" parameter in the code, would this be something I am looking for?
Hey!
First of all thank you for this library!
I would like to take your actors and critics and implement RNN-enhanced TD3 algorithm as described here: https://arxiv.org/pdf/1710.06537.pdf.
I have investigated the source code and it seems that you are not supporting recurrent feature in your implementation of rl algorithms. Have you considered it? Is there anything you can recommend which may possibly help me with seamless transition to memory-based approach?
Cheers!
Edit: The "history len" parameter in the code, would this be something I am looking for?