본문으로 건너뛰기

Juhyeon's Blog

❯

❯

❯

❯

UniCR Unified Framework for Confidence Calibration and Risk Controlled Refusal in LLMs

UniCR - Unified Framework for Confidence Calibration and Risk-Controlled Refusal in LLMs

2026년 2월 11일1분 분량

Introduction

LLM이 언제 답하지 말아야 하는지 결정하는 unified framework
Heterogeneous uncertainty evidence를 calibrated probability로 통합

Related Papers

Conformal prediction
Selective prediction

Methods

Sequence likelihood, self-consistency, retrieval compatibility, tool feedback 통합
Lightweight calibration head + conformal risk control
Short-form QA, code generation, long-form QA에서 실험

Results

기존 entropy/logit threshold 방법 대비 calibration 및 coverage 개선
Risk-coverage curve에서 우수한 성능

Discussion

Self-knowledge의 다양한 signal을 통합하는 실용적 접근
“Know when not to answer”의 체계적 구현

공유하기

그래프 뷰

Introduction
Related Papers
Methods
Results
Discussion

Properties

Author: Markus Oehri et al.
Comment: 다양한 uncertainty signal을 통합하여 calibrated probability로 변환하고 risk-controlled refusal 수행하는 unified framework
IsTargetPaper: true
Journal/Conference: arXiv
Published Year: 2025
Reading Status: Not Started
Review Date: 2026-02-01
Topic: Confidence calibration, risk-controlled refusal, unified uncertainty framework
URL: https://arxiv.org/abs/2509.01455

백링크

Architecture
Fundamentals
LLMs
Memory
self-consciousness
Unlabeled
Vision

Created with Quartz v4.5.2 © 2026

GitHub
Blog