본문으로 건너뛰기

Juhyeon's Blog

❯

❯

❯

Self Consciousness

❯

폴더: AI/Papers/Self-Consciousness/_benchmarks

14건의 항목

2026년 4월 13일
CogToM - A Comprehensive Theory of Mind Benchmark inspired by Human Cognition
2026년 4월 13일
DynToM - Towards Dynamic Theory of Mind
2026년 4월 13일
Evidence for Limited Metacognition in LLMs
2026년 4월 13일
HI-TOM - A Benchmark for Evaluating Higher-Order Theory of Mind Reasoning in Large Language Models
2026년 4월 13일
MM-SAP - A Comprehensive Benchmark for Assessing Self-Awareness of Multimodal Large Language Models in Perception
2026년 4월 13일
Me, Myself, and AI - The Situational Awareness Dataset (SAD) for LLMs
2026년 4월 13일
Metacognition and Uncertainty Communication in Humans and Large Language Models
2026년 4월 13일
NegotiationToM - A Benchmark for Stress-testing Machine Theory of Mind on Negotiation Surrounding
2026년 4월 13일
Re-evaluating Theory of Mind Evaluation in Large Language Models
2026년 4월 13일
Rethinking Theory of Mind Benchmarks for LLMs - Towards A User-Centered Perspective
2026년 4월 13일
SODA
2026년 4월 13일
SimpleToM - Exposing the Gap between Explicit ToM Inference and Implicit ToM Application in LLMs
2026년 4월 13일
What Large Language Models Know and What People Think They Know
2026년 4월 13일
_survey-overview

키보드 단축키

`/` 또는 `Ctrl`+`K`	검색
`?`	단축키 도움말
`Esc`	모달 닫기

Created with Quartz v4.5.2 © 2026

GitHub
Blog