본문으로 건너뛰기

Juhyeon's Blog

❯

❯

❯

Self Consciousness

❯

폴더: AI/Papers/Self-Consciousness/bench

14건의 항목

2026년 2월 11일
CogToM - A Comprehensive Theory of Mind Benchmark inspired by Human Cognition
2026년 2월 11일
DynToM - Towards Dynamic Theory of Mind
2026년 2월 11일
Evidence for Limited Metacognition in LLMs
2026년 2월 11일
FANToM - A Benchmark for Stress-testing Machine Theory of Mind in Interactions
2026년 2월 11일
HI-TOM - A Benchmark for Evaluating Higher-Order Theory of Mind Reasoning in Large Language Models
2026년 2월 11일
MM-SAP - A Comprehensive Benchmark for Assessing Self-Awareness of Multimodal Large Language Models in Perception
2026년 2월 11일
Metacognition and Uncertainty Communication in Humans and Large Language Models
2026년 2월 11일
NegotiationToM - A Benchmark for Stress-testing Machine Theory of Mind on Negotiation Surrounding
2026년 2월 11일
Re-evaluating Theory of Mind Evaluation in Large Language Models
2026년 2월 11일
Rethinking Theory of Mind Benchmarks for LLMs - Towards A User-Centered Perspective
2026년 2월 11일
SODA
2026년 2월 11일
SimpleToM - Exposing the Gap between Explicit ToM Inference and Implicit ToM Application in LLMs
2026년 2월 11일
What Large Language Models Know and What People Think They Know
2026년 2월 11일
_survey-overview

키보드 단축키

`/` 또는 `Ctrl`+`K`	검색
`?`	단축키 도움말
`Esc`	모달 닫기

Created with Quartz v4.5.2 © 2026

GitHub
Blog