¡Hola, Mundo!

  • 홈
  • 태그
  • 방명록

action value function 1

Markov Decision Process, State value function, Action value function, Optimal policy, Bellman equation

Markov Decision Process - Decision : Sequence of Actions.- S1 : It absorbed S0, a0 to indicate a1.- a1 : It is given by S1. If only S1 is given, a1 is determined regadless of S0, a0. https://youtu.be/DbbcaspZATg?si=KgUq5CdJKzHj9QOJ : Probability of what action to do in time t, state t. That is, distribution of what action to do in a particular state.The policy determines the action." data-ke-typ..

Reinforcement Learning 2024.09.22
이전
1
다음
더보기
  • 분류 전체보기 (359) N
    • Autonomous Vehicle (56)
      • Theory (14)
      • Sensors (11)
      • Video Geometry (25)
    • Deep Learning (21) N
      • PyTorch (11)
      • Tensorflow (8)
      • CNN (7)
      • Object Detection (16)
    • Machine Learning (19)
    • Reinforcement Learning (4)
    • Analyze Data (29)
      • Python (2)
      • Python Libraries (20)
      • Measure of similarity (7)
    • KAFKA (6)
    • d3.js (20)
    • Environment (9)
      • Anaconda (4)
      • VisualStudioCode (3)
      • Jupyter (2)
    • JavaScript (10)
    • C# (6)
    • Linux (15)
      • terminal (9)
      • Nvidia (3)
    • Docker (17)
    • Git (7)
    • Concept (8)
      • Network (3)
    • Elastic Stack (6)
      • Elasticsearch (5)
      • Logstash (1)
    • Basic Python (42) N
      • FastAPI (3)
      • Data Structure (1)
      • Workbook (5)
    • DataBase (11)
      • MYSQL (6)
      • MariaDB (1)
    • Math (21)
    • About me (1)

Tag

randint, Regular Expression, docker-compose, kafka, axis, zeros, nvidia-smi, abstractmethod, textdistance, 3D Rotation Matrix, forward propagation, global variable, Sigmoid function, selectall, batch size, randn, Filter, d3js, yield from, classmethod,

최근글과 인기글

  • 최근글
  • 인기글

최근댓글

공지사항

페이스북 트위터 플러그인

  • Facebook
  • Twitter

Archives

Calendar

«   2026/01   »
일 월 화 수 목 금 토
1 2 3
4 5 6 7 8 9 10
11 12 13 14 15 16 17
18 19 20 21 22 23 24
25 26 27 28 29 30 31

Copyright © Kakao Corp. All rights reserved.

티스토리툴바