About Reinforcement Learning

First of all, thanks for your open-source code of this wonderful work.
I also have some questions about your code of reinforcement learning. I found that in your version of reinforcement learning, you use the training dataset for policy gradient to fine-tuning parameters. 
But actually, in my opinion, a user simulator should be used as the environment for updating the parameters in RL setup. Can you tell me the reason?
Thank you very much !


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

About Reinforcement Learning #21

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

About Reinforcement Learning #21

Description

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions