Reinforcement Learning

Behavior Policy VS Target Policy

Naranjito 2024. 10. 6. 23:16
  • Behavior Policy

 

Acting to get the following state.

That is, taking action and getting the next state.

Action for transition.

 

  • Target Policy

 

PDF required to create a Temporary Difference.

 


  • On Policy(Policy) VS Off Policy