Behavior Policy VS Target Policy Behavior Policy Acting to get the following state.That is, taking action and getting the next state.Action for transition. Target Policy PDF required to create a Temporary Difference. On Policy(Policy) VS Off Policy Reinforcement Learning 2024.10.06
Monte-carlo VS Temporal Difference How can we get Q*?Monte-carlo https://youtu.be/bCifW0SENGs?si=-Brm4lwVPN4emAG2Temporal Difference1 Step Temporal DifferenceIncremental Monte Carlo UpdatesTime DifferenceTemporal Difference ErrorTemporal Difference Target https://youtu.be/vfLrBPYwuFA?si=zTnAFh5bjHerEKX-SARSA https://youtu.be/vfLrBPYwuFA?si=zTnAFh5bjHerEKX-Monte-carlo VS Temporal Difference https://youtu.be/STcbD5VhP3Y?si=1mt7.. Reinforcement Learning 2024.10.06
Quaternion Quaternion A set of numbers that use Complex Numbers. H or by H .If we express the three-dimensional vector in terms of the basis , we obtain .Quaternions : with the scalar value added to it.It can be expressed as .∴Quaternions can be said to be a vector based on ." data-ke-type="html">HTML 삽입미리보기할 수 없는 소스Quaternions multiplication rules .. Math 2024.09.29
Axis-Angle Rotation(Rodrigues Rotation) Axis-Angle Rotation(Rodrigues Rotation) It is used as a 3d rotation method that can improve the disadvantages of Euler Angle. 3D rotation is more flexible than Euler Angle. : the axis of rotation, orange line- : a point to rotate- 𝜃 : an angle to rotate- : the final point of rotation- : a point of the world space- : the center of a rotating plane" data-ke-type="html">HTML 삽입미리보기할 수 없는 소스 - .. Autonomous Vehicle/Theory 2024.09.26
Tensor product(Outer product) VS Dot Product Tensor product(Outer product) The outer product (denoted by ⊗) results in a matrix (or tensor) rather than a scalar. Let’s say you have two vectors: a → = a 1 a 2 .. Math 2024.09.26