본문으로 건너뛰기
Juhyeon's Blog
Search
검색
다크 모드
라이트 모드
탐색기
태그: reasoning
11건의 항목
2026년 4월 13일
How Far Are We From AGI - Are LLMs All We Need
paper
AGI
LLM
survey
capabilities
reasoning
perception
memory
metacognition
alignment
embodied-AI
roadmap
2026년 4월 13일
AIME 2024 - 미국 수학 올림피아드 벤치마크
1에서
15로
benchmark
math
reasoning
AIME
competition
olympiad
chain-of-thought
evaluation
2026년 4월 13일
ARC-AGI - Abstraction and Reasoning Corpus
benchmark
reasoning
abstraction
generalization
ARC
AGI
Chollet
few-shot
program-synthesis
core-knowledge
2026년 4월 13일
Challenging BIG-Bench Tasks and Whether Chain-of-Thought Can Solve Them
paper
benchmark
reasoning
BBH
BIG_Bench
chain_of_thought
ACL
2026년 4월 13일
MMLU-Pro - A More Robust and Challenging Multi-Task Language Understanding Benchmark
paper
benchmark
MMLU_Pro
knowledge
reasoning
10_choice
NeurIPS
2026년 4월 13일
Measuring Mathematical Problem Solving with the MATH Dataset
paper
benchmark
mathematics
MATH
competition_math
reasoning
NeurIPS
2026년 4월 13일
LLM_as_Judge_GenToJudgment_2025_LLM_Evaluation
paper
LLM_Evaluation
LLM_as_Judge
taxonomy
EMNLP
alignment
reasoning
bias
survey
2026년 4월 13일
Qwen Models
qwen2.5
qwen3
alibaba
dense
moe
multilingual
reasoning
baseline-selection
hyperparameters
2026년 4월 13일
Does Learning Mathematical Problem-Solving Generalize to Broader Reasoning
paper
reasoning
generalization
math-reasoning
long-CoT
reinforcement-learning
transfer-learning
2026년 4월 13일
Logic-RL - Unleashing LLM Reasoning with Rule-Based Reinforcement Learning
paper
reasoning
reinforcement-learning
LLM
emergent-behavior
logic-puzzles
2026년 4월 13일
Reasoning Paper Collection
moc
reasoning
cot