본문으로 건너뛰기

Juhyeon's Blog

❯

❯

❯

❯

Reflexion Language Agents with Verbal Reinforcement Learning

Reflexion - Language Agents with Verbal Reinforcement Learning

2026년 2월 11일2분 분량

by Moonlight

💡 Reflexion은 LLM 기반 에이전트가 가중치 업데이트 대신 언어적 피드백을 통해 학습하도록 돕는 새로운 프레임워크입니다.

📚 에이전트는 작업 피드백을 언어적으로 반영하고, 이 반성 텍스트를 에피소드 메모리 버퍼에 저장하여 후속 시도에서 더 나은 의사결정을 유도합니다.

🚀 이 접근 방식은 AlfWorld, HotPotQA, HumanEval 등 다양한 작업에서 기존 베이스라인 대비 상당한 성능 향상을 달성했으며, 특히 HumanEval 코딩 벤치마크에서 91%의 pass@1 정확도로 GPT-4의 80%를 뛰어넘는 SOTA를 기록했습니다.

Summary

Context 내부에 reflection 내용이나 eval 내용을 포함하여 여러 번 돌리는 pipeline

공유하기

그래프 뷰

Properties

Linked Bases: [[Memory.base]]
Reading Status: ☑️ Not Started

백링크

Architecture
Fundamentals
LLMs
Memory
Toward Efficient Agents - A Survey of Memory, Tool Learning, and Planning
self-consciousness
Vision

Created with Quartz v4.5.2 © 2026

GitHub
Blog