- Behavior Policy
Acting to get the following state.
That is, taking action and getting the next state.
Action for transition.
- Target Policy
PDF required to create a Temporary Difference.

- On Policy(Policy) VS Off Policy

Acting to get the following state.
That is, taking action and getting the next state.
Action for transition.
PDF required to create a Temporary Difference.