본문으로 건너뛰기
Juhyeon's Blog
Search
검색
다크 모드
라이트 모드
탐색기
태그: LLM_evaluation
4건의 항목
2026년 4월 13일
Are Emergent Abilities of Large Language Models a Mirage?
paper
emergent_abilities
scaling_laws
measurement
metric_choice
BIG-Bench
LLM_evaluation
NeurIPS
outstanding_paper
2026년 4월 13일
Discovering Language Model Behaviors with Model-Written Evaluations
paper
LLM_evaluation
inverse_scaling
sycophancy
self_preservation
instrumental_convergence
RLHF
AI_safety
model_written_evaluation
FSPM
2026년 4월 13일
Taken out of context - On measuring situational awareness in LLMs
paper
situational_awareness
OOC_reasoning
AI_safety
LLM_evaluation
emergent_capabilities
alignment
FSPM_prerequisite
2026년 4월 13일
Using cognitive psychology to understand GPT-3
paper
machine_psychology
cognitive_psychology
GPT3
decision_making
causal_reasoning
prospect_theory
information_search
LLM_evaluation
PNAS
FSPM
methodology