본문으로 건너뛰기

Juhyeon's Blog

❯

❯

❯

❯

The Confidence Paradox LLMs Can Know When They Are Wrong

The Confidence Paradox - LLMs Can Know When They Are Wrong

2026년 2월 11일1분 분량

Introduction

LLM이 자신의 오류를 인식할 수 있는가에 대한 confidence paradox 제시
높은 confidence로 틀린 답을 내는 현상 분석

Related Papers

LLM calibration
Self-correction

Methods

Confidence score와 actual correctness 간의 관계 분석
오류 인식 능력 평가 실험 설계

Results

LLM이 특정 조건에서 자신의 오류를 감지할 수 있지만 일관적이지 않음
Confidence paradox: 높은 self-assessed confidence와 실제 오류의 공존

Discussion

Metacognitive monitoring의 불완전성
Self-awareness 개선을 위한 시사점

공유하기

그래프 뷰

Introduction
Related Papers
Methods
Results
Discussion

Properties

Author: Tripathi et al.
Comment: LLM이 자신이 틀렸을 때 이를 인식할 수 있다는 paradox 분석 - confidence와 correctness의 관계
IsTargetPaper: true
Journal/Conference: arXiv
Published Year: 2025
Reading Status: Not Started
Review Date: 2026-02-01
Topic: Confidence paradox, self-knowledge, error detection
URL: https://www.semanticscholar.org/paper/The-Confidence-Paradox

백링크

Architecture
Fundamentals
LLMs
Memory
self-consciousness
Unlabeled
Vision

Created with Quartz v4.5.2 © 2026

GitHub
Blog