Reinforcement Learning
Behavior Policy VS Target Policy
Naranjito
2024. 10. 6. 23:16
- Behavior Policy
Acting to get the following state.
That is, taking action and getting the next state.
Action for transition.
- Target Policy
PDF required to create a Temporary Difference.
- On Policy(Policy) VS Off Policy