본문으로 건너뛰기
Juhyeon's Blog
Search
검색
다크 모드
라이트 모드
탐색기
태그: LLM
43건의 항목
2026년 6월 04일
A Comprehensive Survey of Self-Evolving AI Agents - A New Paradigm Bridging Foundation Models and Lifelong Agentic Systems
paper
Survey
SelfEvolvingAgents
AgentOptimization
LifelongLearning
MultiAgent
Memory
Tools
PromptOptimization
LLM
2026년 6월 04일
AgentFold - Long-Horizon Web Agents with Proactive Context Management
paper
agent
web-agent
long-horizon
context-management
memory
LLM
SFT
MoE
BrowseComp
AgentFold
application
2026년 6월 04일
AgentTuning - Enabling Generalized Agentabilities for LLMS
Agent
InstructionTuning
LLM
AgentLM
Llama2
SFT
Generalization
Training
2026년 6월 04일
Annotation-Efficient Universal Honesty Alignment for LLMs
Paper
LLM
HonestyAlignment
Calibration
SelfConsistency
AnnotationEfficiency
Training
ICLR2026
Safety
Hallucination
2026년 6월 04일
Automatic Prompt Optimization with Gradient Descent and Beam Search
paper
prompt-optimization
textual-gradient
beam-search
bandit-algorithm
AutoML
LLM
EMNLP
2026년 6월 04일
Belief in the Machine - Investigating Epistemological Blind Spots of Language Models
LLM
Epistemology
Belief
Knowledge
KaBLE
Benchmark
TheoryOfMind
Factivity
FirstPerson
Self-Consciousness
Evaluation
Theory
2026년 6월 04일
Benchmark Self-Evolving - A Multi-Agent Framework for Dynamic LLM Evaluation
Paper
Benchmark
Evaluation
LLM
MultiAgent
DynamicEvaluation
DataContamination
2026년 6월 04일
Berkeley Function Calling Leaderboard (BFCL)
Benchmark
FunctionCalling
ToolUse
LLM
AST
Agent
API
Evaluation
UCBerkeley
Gorilla
2026년 6월 04일
Calibrating Verbal Uncertainty as a Linear Feature to Reduce Hallucinations
paper
LLM
hallucination
calibration
representation-engineering
verbal-uncertainty
inference-time-intervention
linear-feature
theory
2026년 6월 04일
Can LLMs Lie - Investigation beyond Hallucination
LLM
Deception
Hallucination
Safety
Interpretability
Steering
Alignment
Theory
2026년 6월 04일
Cognitive Dissonance - Why Do Language Model Outputs Disagree with Internal Representations of Truthfulness
paper/theory
LLM
interpretability
truthfulness
probing
calibration
deception
safety
EMNLP2024
2026년 6월 04일
Concept Incongruence - An Exploration of Time and Death in Role Playing
paper
LLM
role-play
concept-incongruence
temporal-reasoning
probing
hallucination
specification
Self-Preservation
2026년 6월 04일
Do Retrieval Augmented Language Models Know When They Dont Know
RAG
Calibration
Uncertainty
LLM
Self-Knowledge
Refusal
Over-Refusal
Abstention
TrustworthyAI
AAAI2026
2026년 6월 04일
Enhancing LLM Reliability via Explicit Knowledge Boundary Modeling
Training
LLM
Reliability
Calibration
KnowledgeBoundary
SelfAwareness
Hallucination
DST
Alignment
2026년 6월 04일
GraphReader - Building Graph-based Agent to Enhance Long-Context Abilities of Large Language Models
LLM
Agent
LongContext
GraphReasoning
MultiHopQA
RAG
EMNLP2024
Application
2026년 6월 04일
How Far Are We From AGI - Are LLMs All We Need
paper
AGI
LLM
survey
capabilities
reasoning
perception
memory
metacognition
alignment
embodied-AI
roadmap
2026년 6월 04일
If an LLM Were a Character Would It Know Its Own Story - Evaluating Lifelong Learning in LLMs
paper
LLM
lifelong-learning
benchmark
evaluation
memory
role-play
catastrophic-forgetting
self-awareness
narrative
2026년 6월 04일
Incomplete Tasks Induce Shutdown Resistance in Some Frontier LLMs
paper
ai-safety
corrigibility
shutdown-resistance
RLVR
instruction-hierarchy
self-preservation
Alignment
LLM
Instrumental-Convergence
2026년 6월 04일
Is Your Code Generated by ChatGPT Really Correct! Rigorous Evaluation of Large Language Models for Code Generation
paper
LLM
code-generation
benchmark
evaluation
EvalPlus
HumanEval
MBPP
mutation-testing
differential-testing
NeurIPS2023
2026년 6월 04일
Know Your Limits - A Survey of Abstention in Large Language Models
Survey
LLM
Abstention
SelectivePrediction
Uncertainty
Calibration
Safety
Alignment
RLHF
Hallucination
2026년 6월 04일
Knowing What LLMs DO NOT Know - A Simple Yet Effective Self-Detection Method
LLM
Hallucination
SelfDetection
Uncertainty
Metacognition
NAACL2024
SelfKnowledge
Theory
2026년 6월 04일
LACIE - Listener-Aware Finetuning for Confidence Calibration in Large Language Models
LLM
Calibration
Alignment
DPO
Pragmatics
NeurIPS2024
Finetuning
Honesty
2026년 6월 04일
LLM Theory of Mind and Alignment - Opportunities and Risks
Paper
TheoryOfMind
AIAlignment
AISafety
LLM
HCI
SocialCognition
PositionPaper
CHI2024
2026년 6월 04일
Logic-RL - Unleashing LLM Reasoning with Rule-Based Reinforcement Learning
paper
reasoning
reinforcement-learning
LLM
emergent-behavior
logic-puzzles
2026년 6월 04일
LongBench - A Bilingual, Multitask Benchmark for Long Context Understanding
Benchmark
LongContext
Bilingual
DocumentUnderstanding
Evaluation
QA
Summarization
CodeGeneration
LLM
2026년 6월 04일
MEM1 - Learning to Synergize Memory and Reasoning for Efficient Long-Horizon Agents
Paper
Agent
LLM
RL
Memory
LongHorizon
MEM1
PPO
Reasoning
Application
2026년 6월 04일
MemAgent - Reshaping Long-Context LLM with Multi-Conv RL-based Memory Agent
paper/application
long-context
memory-agent
reinforcement-learning
DAPO
LLM
agent
2026년 6월 04일
Motivation in Large Language Models
paper
LLM
motivation
psychology
behavioral-alignment
loss-aversion
zombie-framework
self-determination-theory
prompt-engineering
2026년 6월 04일
Reasoning Models Struggle to Control their Chains of Thought
paper
Safety
CoT
Monitoring
Controllability
Alignment
ReasoningModels
LLM
2026년 6월 04일
Social-R1 - Towards Human-like Social Reasoning in LLMs
paper
ToM
SocialReasoning
RL
TrajectoryAlignment
SIP
LLM
ReasoningParasitism
2026년 6월 04일
Surgical Cheap and Flexible - Mitigating False Refusal in Language Models via Single Vector Ablation
LLM
Safety
Alignment
FalseRefusal
ActivationEngineering
Interpretability
VectorAblation
ICLR2025
2026년 6월 04일
The Geometry of Truth - Emergent Linear Structure in LLM Representations of True and False Statements
interpretability
LLM
probing
truth-representation
linear-representation-hypothesis
causal-intervention
alignment
theory
2026년 6월 04일
Thinking Faithful and Stable - Mitigating Hallucinations in LLMs via Internal Consistency
LLM
hallucination
faithfulness
self-consistency
calibration
RLHF
reasoning
uncertainty
theory
arxiv-2511-15921
2026년 6월 04일
Towards Ontology-Enhanced Representation Learning for Large Language Models
paper
LLM
Ontology
RepresentationLearning
ContrastiveLearning
KnowledgeInjection
Biomedical
Training
2026년 6월 04일
Training Compute-Optimal Large Language Models
paper
scaling_law
compute_optimal
chinchilla
LLM
DeepMind
NeurIPS
2026년 6월 04일
Training language models to follow instructions with human feedback - InstructGPT
paper
RLHF
alignment
LLM
InstructGPT
PPO
reward-model
OpenAI
NeurIPS2022
human-feedback
fine-tuning
2026년 6월 04일
Uncertainty-Based Abstention in LLMs Improves Safety
paper
LLM
uncertainty
abstention
safety
hallucination
calibration
selective-prediction
trustworthy-AI
metacognition
training
2026년 6월 04일
Weak-to-Strong Generalization - Eliciting Strong Capabilities With Weak Supervision
paper
alignment
superalignment
weak-to-strong
LLM
AI-safety
finetuning
RLHF
2026년 6월 04일
Harb et al. (2025) — GPT-4o·Gemini의 NimStim 얼굴 감정 인식 평가
reference
facial-emotion
LLM
VLM
NimStim
benchmark
AI-face-DB
2026년 6월 04일
Using large language models to estimate features of multi-word expressions: Concreteness, valence, arousal
LLM
psycholinguistics
valence
arousal
concreteness
human-norms
replaceability
2026년 6월 04일
GPT-4 Emulates Average-Human Emotional Cognition from a Third-Person Perspective
paper
related-work
v10-references
GPT-4
average-human-modeling
emotion-cognition
LLM
third-person-perspective
2026년 6월 04일
Affective Computing in the Era of Large Language Models: A Survey from the NLP Perspective
affective-computing
LLM
survey
NLP
emotion-recognition
2026년 6월 04일
LLMs_Do_Not_Simulate_Human_Psychology_2025
paper
LLM
HumanSimulation
Psychology
MoralJudgment
SemanticSensitivity
CENTAUR
Evaluation
persona-LDT