본문으로 건너뛰기

Juhyeon's Blog

❯

❯

❯

❯

ObjexMT Objective Extraction and Metacognitive Calibration for LLM as a Judge

ObjexMT - Objective Extraction and Metacognitive Calibration for LLM-as-a-Judge

2026년 2월 11일1분 분량

Introduction

LLM-as-a-Judge의 핵심 자격 테스트: 대화의 숨겨진 목적을 추론하고 그 추론의 신뢰성을 판단할 수 있는가
ObjexMT 벤치마크: objective extraction + metacognition 평가

Related Papers

LLM-as-a-Judge 연구
Calibration 연구

Methods

Multi-turn transcript에서 base objective를 추출하고 self-reported confidence 출력
Accuracy: gold objective와의 semantic similarity
Metacognition: ECE, Brier score, Wrong@High-Confidence, risk-coverage curves
6개 모델 평가: GPT-4.1, Claude Sonnet 4, Qwen3-235B, kimi-k2, DeepSeek-v3.1, Gemini-2.5-flash

Results

kimi-k2가 최고 objective-extraction accuracy (0.612)
Claude Sonnet 4가 최고 calibration (AURC 0.242, ECE 0.206, Brier 0.254)
데이터셋에 따라 16%~82% accuracy로 큰 변동
Wrong@0.90 범위: 14.9% (Claude) ~ 47.7% (Qwen3)

Discussion

모델별 metacognitive calibration의 뚜렷한 차이
High-confidence error가 여전히 심각한 문제

공유하기

그래프 뷰

Introduction
Related Papers
Methods
Results
Discussion

Properties

Author: Hyunjun Kim et al.
Comment: 6개 모델의 metacognitive calibration(ECE, Brier score 등)을 비교 평가하는 벤치마크
IsTargetPaper: true
Journal/Conference: arXiv
Published Year: 2025
Reading Status: Not Started
Review Date: 2026-02-01
Topic: Metacognition, calibration, LLM-as-a-Judge, benchmark
URL: https://arxiv.org/abs/2508.16889

백링크

Architecture
Fundamentals
LLMs
Memory
self-consciousness
Unlabeled
Vision

Created with Quartz v4.5.2 © 2026

GitHub
Blog