본문으로 건너뛰기

Juhyeon's Blog

태그: Reasoning

7건의 항목

  • 2026년 6월 04일

    Dyna-Think - Synergizing Reasoning Acting and World Model Simulation in AI Agents

    • LLM-Agent
    • World-Model
    • Reasoning
    • ReAct
    • Dyna
    • Imitation-Learning
    • GUI-Agent
    • OSWorld
    • Application
  • 2026년 6월 04일

    MEM1 - Learning to Synergize Memory and Reasoning for Efficient Long-Horizon Agents

    • Paper
    • Agent
    • LLM
    • RL
    • Memory
    • LongHorizon
    • MEM1
    • PPO
    • Reasoning
    • Application
  • 2026년 6월 04일

    Measuring the Faithfulness of Thinking Drafts in Large Reasoning Models

    • paper
    • Reasoning
    • Faithfulness
    • CoT
    • LRM
    • CounterfactualIntervention
    • Causality
    • Qwen
    • DeepSeek
  • 2026년 6월 04일

    R-Zero - Self-Evolving Reasoning LLM from Zero Data

    • paper
    • Self-Evolving
    • Reasoning
    • Self-Play
    • RLVR
    • Curriculum
    • ICLR2026
    • ZPD
  • 2026년 6월 04일

    ReAct - Synergizing Reasoning and Acting in Language Models

    • paper
    • Reasoning
    • Acting
    • LLM_Agent
    • Prompting
    • CoT
    • Tool_Use
    • ICLR
  • 2026년 6월 04일

    Think Deep, Not Just Long - Measuring LLM Reasoning Effort via Deep-Thinking Tokens

    • paper
    • Reasoning
    • DeepThinking
    • DTR
    • InferenceScaling
    • CoT
    • Overthinking
    • LayerwisePrediction
  • 2026년 6월 04일

    Thinking with Nothinking Calibration - A New In-Context Learning Paradigm in Reasoning Large Language Models

    • paper
    • Reasoning
    • ThinkingMode
    • ICL
    • Qwen3
    • DeepSeekR1
    • Calibration
    • ModeConsistency
    • RLLM

키보드 단축키

/ 또는 Ctrl+K검색
?단축키 도움말
Esc모달 닫기

Created with Quartz v4.5.2 © 2026

  • GitHub
  • Blog