본문으로 건너뛰기

Juhyeon's Blog

❯

Reinforcement Learning

❯

❯

Reward

2026년 4월 13일1분 분량

Summary

강화학습 맥락에서, 특정 state에서 특정 action에 따른 즉각적인 scalar feedback.
env에 depend

공유하기

그래프 뷰

Properties

No properties

백링크

The Student's Guide to Cognitive NeuroScience
Memory
Architecture
Benchmarks
LLMs
Fundamentals
self-consciousness
Theory of mind
Vision

Created with Quartz v4.5.2 © 2026

GitHub
Blog