본문으로 건너뛰기

Juhyeon's Blog

태그: theory

9건의 항목

2026년 6월 04일
Calibrating Verbal Uncertainty as a Linear Feature to Reduce Hallucinations
2026년 6월 04일
Goal Misgeneralization - Why Correct Specifications Aren't Enough For Correct Goals
2026년 6월 04일
Natural Selection Favors AIs over Humans
2026년 6월 04일
Risks from Learned Optimization in Advanced Machine Learning Systems
2026년 6월 04일
Shared Parameter Subspaces and Cross-Task Linearity in Emergently Misaligned Behavior
2026년 6월 04일
The Geometry of Truth - Emergent Linear Structure in LLM Representations of True and False Statements
2026년 6월 04일
Thinking Faithful and Stable - Mitigating Hallucinations in LLMs via Internal Consistency
2026년 6월 04일
Understanding deep learning requires rethinking generalization
2026년 6월 04일
Understanding intermediate layers using linear classifier probes

키보드 단축키

`/` 또는 `Ctrl`+`K`	검색
`?`	단축키 도움말
`Esc`	모달 닫기

Created with Quartz v4.5.2 © 2026

GitHub
Blog