본문으로 건너뛰기
Juhyeon's Blog
Search
검색
다크 모드
라이트 모드
탐색기
태그: Reasoning
7건의 항목
2026년 6월 04일
Dyna-Think - Synergizing Reasoning Acting and World Model Simulation in AI Agents
LLM-Agent
World-Model
Reasoning
ReAct
Dyna
Imitation-Learning
GUI-Agent
OSWorld
Application
2026년 6월 04일
MEM1 - Learning to Synergize Memory and Reasoning for Efficient Long-Horizon Agents
Paper
Agent
LLM
RL
Memory
LongHorizon
MEM1
PPO
Reasoning
Application
2026년 6월 04일
Measuring the Faithfulness of Thinking Drafts in Large Reasoning Models
paper
Reasoning
Faithfulness
CoT
LRM
CounterfactualIntervention
Causality
Qwen
DeepSeek
2026년 6월 04일
R-Zero - Self-Evolving Reasoning LLM from Zero Data
paper
Self-Evolving
Reasoning
Self-Play
RLVR
Curriculum
ICLR2026
ZPD
2026년 6월 04일
ReAct - Synergizing Reasoning and Acting in Language Models
paper
Reasoning
Acting
LLM_Agent
Prompting
CoT
Tool_Use
ICLR
2026년 6월 04일
Think Deep, Not Just Long - Measuring LLM Reasoning Effort via Deep-Thinking Tokens
paper
Reasoning
DeepThinking
DTR
InferenceScaling
CoT
Overthinking
LayerwisePrediction
2026년 6월 04일
Thinking with Nothinking Calibration - A New In-Context Learning Paradigm in Reasoning Large Language Models
paper
Reasoning
ThinkingMode
ICL
Qwen3
DeepSeekR1
Calibration
ModeConsistency
RLLM