본문으로 건너뛰기

Juhyeon's Blog

❯

❯

❯

❯

폴더: AI/Papers/Theory-of-Mind/_benchmarks

6건의 항목

2026년 4월 13일
Explore Theory-of-Mind - Program-Guided Adversarial Data Generation for Theory of Mind Reasoning
2026년 4월 13일
FANToM - A Benchmark for Stress-testing Machine Theory of Mind in Interactions
2026년 4월 13일
MoToMQA - LLMs Achieve Adult Human Performance on Higher-Order Theory of Mind Tasks
2026년 4월 13일
OpenToM - A Comprehensive Benchmark for Evaluating Theory-of-Mind Reasoning
2026년 4월 13일
TOM BENCH - Benchmarking Theory of Mind in Large Language Models
2026년 4월 13일
ToMATO - Verbalizing the Mental States of Role-Playing LLMs for Benchmarking Theory of Mind

키보드 단축키

`/` 또는 `Ctrl`+`K`	검색
`?`	단축키 도움말
`Esc`	모달 닫기

Created with Quartz v4.5.2 © 2026

GitHub
Blog