본문으로 건너뛰기

Juhyeon's Blog

❯

❯

❯

❯

Evaluation Metric

❯

Perplexity

2026년 2월 11일1분 분량

Summary

전통적으로 Language model을 평가할 떄 사용하는 metric.
$LM 이 다음에 올 단어를 어느정도로 deterministic 하게 예측하냐 ?$
→ perplexity가 높은 모델은 NTP(Next Token Prediction)에 대한 confidence가 낮다.
→ perplexity가 낮아야 좋은 모델.

즉, 모델의 uncertainty를 측정하는 measure.
“매 순간 평균적으로 몇개의 단어 선택지 중 고민을 했는가?”

NOTE

$perplexity = \prod_{t = 1}^{T} (\frac{1}{P _{L M} ( x ^{(t + 1)} ∣ x ^{(t)} , \dots , x ^{(1)} )})^{\frac{1}{T}} = exp (J (θ))$
or
$PP L (X) = exp (- \frac{1}{n} \sum_{i = 1}^{n} lo g P (w_{i} ∣ w_{< i}))$

Tip

$perplexity = exp of cross-entropy loss$

Prompt PPL

Summary

perplexity 개념을 model의 response가 아니라 입력 prompt에 대해 계산을 하여,
“모델이 query를 얼마나 익숙하고 명확하게 느끼는가?”로 해석할 수 있음.
→ ltpo

공유하기

그래프 뷰

Properties

No properties

백링크

Architecture
Fundamentals
LLMs
The Ultimate Guide to Fine-Tuning LLMs from Basics to Breakthroughs - An Exhaustive Review of Technologies, Research, Best Practices, Applied Research Challenges and Opportunities
Memory
self-consciousness
Vision

Created with Quartz v4.5.2 © 2026

GitHub
Blog