본문으로 건너뛰기
Juhyeon's Blog
Search
검색
다크 모드
라이트 모드
탐색기
태그: paper
169건의 항목
2026년 6월 04일
Chapter 1. Introducing cognitive neuroscience
paper
2026년 6월 04일
Chapter 5 The lesioned brain
paper
x003C
2026년 6월 04일
Chapter 6 The Seeing Brain
paper
2026년 6월 04일
A Broad-Coverage Challenge Corpus for Sentence Understanding through Inference
paper
MultiNLI
NLI
multi-genre
domain-transfer
benchmark
NAACL
2026년 6월 04일
A Comprehensive Survey of Self-Evolving AI Agents - A New Paradigm Bridging Foundation Models and Lifelong Agentic Systems
paper
Survey
SelfEvolvingAgents
AgentOptimization
LifelongLearning
MultiAgent
Memory
Tools
PromptOptimization
LLM
2026년 6월 04일
A Corpus and Cloze Evaluation for Deeper Understanding of Commonsense Stories
paper
ROCStories
Story-Cloze
commonsense-reasoning
narrative
benchmark
NAACL
2026년 6월 04일
A Corpus and Evaluation Framework for Deeper Understanding of Commonsense Stories
paper
benchmark
commonsense
StoryCloze
narrative
ROCStories
2026년 6월 04일
A Path Towards Autonomous Machine Intelligence
paper
AGI
WorldModel
JEPA
SelfSupervisedLearning
EnergyBasedModel
CognitiveArchitecture
LeCun
2026년 6월 04일
A Simple Framework for Contrastive Learning of Visual Representation
paper
self-supervised-learning
contrastive-learning
computer-vision
representation-learning
simclr
icml2020
architecture
2026년 6월 04일
A large annotated corpus for learning natural language inference 1
paper
NLI
SNLI
dataset
benchmark
crowdsourcing
textual-entailment
EMNLP
2026년 6월 04일
ACT_Agentic_Critical_Training_2026_Skill_LM
paper
Skill_LM
RL
agent
critical_reasoning
GRPO
imitation_learning
self_reflection
2026년 6월 04일
ALFWorld - Aligning Text and Embodied Environments for Interactive Learning
paper
benchmark
embodied_agent
ALFWorld
BUTLER
text_transfer
ICLR
UW
MSR
2026년 6월 04일
Adversarial NLI - A New Benchmark for Natural Language Understanding
paper
benchmark
NLI
adversarial
ANLI
human_in_the_loop
2026년 6월 04일
AgentBench - Evaluating LLMs as Agents
paper
benchmark
agent
AgentBench
multi_environment
Tsinghua
ICLR
2026년 6월 04일
AgentFold - Long-Horizon Web Agents with Proactive Context Management
paper
agent
web-agent
long-horizon
context-management
memory
LLM
SFT
MoE
BrowseComp
AgentFold
application
2026년 6월 04일
Agentic Misalignment - How LLMs Could Be Insider Threats
paper
AI안전
agentic-misalignment
self-preservation
LLM에이전트
내부자위협
alignment
Anthropic
Self-Preservation
2026년 6월 04일
Aligning AI With Shared Human Values
paper
benchmark
ethics
moral_judgment
AI_alignment
safety
ICLR
2026년 6월 04일
Alignment Faking in Large Language Models
paper
alignment_faking
self_preservation
AI_safety
RLHF
strategic_deception
FSPM
instrumental_convergence
Anthropic
2026년 6월 04일
An Image is Worth 16x16 Words - Transformers for Image Recognition at Scale
paper
vision-transformer
self-attention
image-classification
large-scale-pretraining
inductive-bias
2026년 6월 04일
Are Emergent Abilities of Large Language Models a Mirage?
paper
emergent_abilities
scaling_laws
measurement
metric_choice
BIG-Bench
LLM_evaluation
NeurIPS
outstanding_paper
2026년 6월 04일
Attention Residuals
paper
Architecture
ResidualConnection
DepthAttention
AttnRes
PreNorm
KimiLinear
ScalingLaw
MoE
2026년 6월 04일
Auto-Encoding Variational Bayes
paper
VAE
GenerativeModel
VariationalInference
Architecture
Foundational
Kingma
ICLR
2026년 6월 04일
AutoML - A Survey of the State-of-the-Art
paper
Survey
AutoML
NAS
HPO
DARTS
ENAS
FeatureEngineering
NeuralArchitectureSearch
2026년 6월 04일
Automatic Prompt Optimization with Gradient Descent and Beam Search
paper
prompt-optimization
textual-gradient
beam-search
bandit-algorithm
AutoML
LLM
EMNLP
2026년 6월 04일
Axial Attention in Multidimensional Transformers
paper
attention
axial
multidimensional
image-transformer
autoregressive
sparse-attention
2026년 6월 04일
BBQ - A Hand-Built Bias Benchmark for Question Answering
paper
benchmark
bias
BBQ
QA
ambiguity
social_stereotypes
fairness
2026년 6월 04일
Big Bench - Beyond the Imitation Game - Quantifying and extrapolating the capabilities of language models
paper
benchmark
llm-evaluation
emergent-abilities
scaling
social-bias
few-shot
language-model
2026년 6월 04일
BigBird - Transformers for Longer Sequences
paper
attention
bigbird
sparse
random
block
long-context
transformer
graph-theory
2026년 6월 04일
BigCodeBench - Benchmarking Code Generation with Diverse Function Calls and Complex Instructions
paper
benchmark
code_generation
BigCodeBench
API
library
practical_coding
2026년 6월 04일
BoolQ - Exploring the Surprising Difficulty of Natural Yes-No Questions
paper
benchmark
yes_no_QA
BoolQ
SuperGLUE
Google
2026년 6월 04일
Born Again Neural Networks
paper
knowledge-distillation
self-distillation
born-again
dark-knowledge
ICML
regularization
2026년 6월 04일
Brittle Minds Fixable Activations - Understanding Belief Representations in Language Models
paper
theory-of-mind
belief-representation
activation-engineering
mechanistic-interpretability
self-consciousness
CAA
probing
BigToM
Llama2
Pythia
2026년 6월 04일
Calibrating Verbal Uncertainty as a Linear Feature to Reduce Hallucinations
paper
LLM
hallucination
calibration
representation-engineering
verbal-uncertainty
inference-time-intervention
linear-feature
theory
2026년 6월 04일
Can a Suit of Armor Conduct Electricity A New Dataset for Open Book Question Answering
paper
benchmark
science_commonsense
OpenBookQA
open_book
AI2
2026년 6월 04일
Causal Reflection with Language Models
paper
reasoning
causal-inference
llm
reflection
world-model
self-correction
counterfactual
2026년 6월 04일
Challenging BIG-Bench Tasks and Whether Chain-of-Thought Can Solve Them
paper
benchmark
reasoning
BBH
BIG_Bench
chain_of_thought
ACL
2026년 6월 04일
Cognitive Dissonance - Why Do Language Model Outputs Disagree with Internal Representations of Truthfulness
paper/theory
LLM
interpretability
truthfulness
probing
calibration
deception
safety
EMNLP2024
2026년 6월 04일
CommonsenseQA - A Question Answering Challenge Targeting World Knowledge
paper
benchmark
commonsense
CommonsenseQA
ConceptNet
knowledge_graph
2026년 6월 04일
Computing Machinery and Intelligence
paper
AI/foundations
Turing-Test
philosophy-of-mind
learning-machines
operationalism
2026년 6월 04일
Concept Incongruence - An Exploration of Time and Death in Role Playing
paper
LLM
role-play
concept-incongruence
temporal-reasoning
probing
hallucination
specification
Self-Preservation
2026년 6월 04일
Core Knowledge
paper
인지과학
발달심리
CoreKnowledge
인지발달
진화인지
귀납적편향
ARC-AGI
2026년 6월 04일
CrowS-Pairs - A Challenge Dataset for Measuring Social Biases in Masked Language Models
paper
benchmark
bias
stereotypes
CrowS-Pairs
fairness
minimal_pairs
2026년 6월 04일
DROP - A Reading Comprehension Benchmark Requiring Discrete Reasoning Over Paragraphs
paper
benchmark
reading_comprehension
numerical_reasoning
DROP
NAACL
2026년 6월 04일
Denoising Diffusion Probabilistic Models
paper
diffusion
generative-model
score-matching
vision
NeurIPS
2026년 6월 04일
Discovering Language Model Behaviors with Model-Written Evaluations
paper
LLM_evaluation
inverse_scaling
sycophancy
self_preservation
instrumental_convergence
RLHF
AI_safety
model_written_evaluation
FSPM
2026년 6월 04일
Distilling the Knowledge in a Neural Network
paper
knowledge_distillation
model_compression
soft_targets
ensemble
dark_knowledge
Hinton
2026년 6월 04일
Do the Rewards Justify the Means? Measuring Trade-Offs Between Rewards and Ethical Behavior in the MACHIAVELLI Benchmark
paper
AI-Safety
Alignment
Benchmark
Instrumental-Convergence
Power-Seeking
LLM-Agents
ICML2023
Machine-Ethics
Pareto-Frontier
GPT-4-Annotation
2026년 6월 04일
Does Learning Mathematical Problem-Solving Generalize to Broader Reasoning
paper
reasoning
generalization
math-reasoning
long-CoT
reinforcement-learning
transfer-learning
2026년 6월 04일
Don't Give Me the Details, Just the Summary! Topic-Aware Convolutional Neural Networks for Extreme Summarization
paper
benchmark
summarization
XSum
extreme
abstractive
BBC
2026년 6월 04일
Efficiently Modeling Long Sequences with Structured State Spaces
paper
SSM
StateSpaceModel
S4
HiPPO
LongRangeDependencies
NPLR
CauchyKernel
ICLR2022
Architecture
FoundationalPaper
2026년 6월 04일
Emerging Properties in Self-Supervised Vision Transformers
paper
self-supervised-learning
self-distillation
vision-transformer
momentum-encoder
emergent-properties
representation-learning
image-recognition
2026년 6월 04일
Epistemic AI is Essential for ML Models to Truly Know When They Dont Know
paper/theory
uncertainty
epistemic-ai
credal-set
random-set
dempster-shafer
OOD
calibration
self-knowledge
imprecise-probability
2026년 6월 04일
Evaluating Large Language Models Trained on Code
paper
benchmark
code_generation
HumanEval
pass_at_k
Codex
OpenAI
2026년 6월 04일
FlashAttention - Fast and Memory-Efficient Exact Attention with IO-Awareness
paper
attention
flash-attention
io-awareness
gpu-kernel
efficient-transformer
2026년 6월 04일
FlashAttention-2 - Faster Attention with Better Parallelism and Work Partitioning
paper
attention
gpu-optimization
flashattention
transformer
2026년 6월 04일
GAIA - A Benchmark for General AI Assistants
paper
benchmark
general_AI
GAIA
tool_use
assistant
Meta_FAIR
ICLR
2026년 6월 04일
GLUE - A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding 1
paper
GLUE
benchmark
multi-task
NLU
QNLI
RTE
transfer-learning
ICLR
2026년 6월 04일
GPQA - A Graduate-Level Google-Proof Q&A Benchmark
paper
benchmark
expert_level
GPQA
science
graduate
Google_proof
ICLR
2026년 6월 04일
Group Relative Policy Optimization(GRPO)
paper
RL
GRPO
DeepSeekMath
PPO
policy-optimization
mathematical-reasoning
RLHF
RLVR
LLM-training
2026년 6월 04일
HellaSwag - Can a Machine Really Finish Your Sentence
paper
benchmark
commonsense
HellaSwag
adversarial_filtering
ACL
2026년 6월 04일
Holistic Evaluation of Language Models
paper
benchmark
evaluation_framework
HELM
holistic
Stanford
multi_metric
2026년 6월 04일
HotpotQA - A Dataset for Diverse, Explainable Multi-hop Question Answering
paper
QA
multi-hop
explainability
supporting-facts
benchmark
EMNLP
2026년 6월 04일
How Far Are We From AGI - Are LLMs All We Need
paper
AGI
LLM
survey
capabilities
reasoning
perception
memory
metacognition
alignment
embodied-AI
roadmap
2026년 6월 04일
Hyena Hierarchy - Towards Larger Convolutional Language Models
paper
Architecture
SubQuadratic
LongConvolution
HyenaOperator
AttentionFree
SSM
ICML2023
DataControlledGating
2026년 6월 04일
If an LLM Were a Character Would It Know Its Own Story - Evaluating Lifelong Learning in LLMs
paper
LLM
lifelong-learning
benchmark
evaluation
memory
role-play
catastrophic-forgetting
self-awareness
narrative
2026년 6월 04일
Incomplete Tasks Induce Shutdown Resistance in Some Frontier LLMs
paper
ai-safety
corrigibility
shutdown-resistance
RLVR
instruction-hierarchy
self-preservation
Alignment
LLM
Instrumental-Convergence
2026년 6월 04일
Instruction-Following Evaluation for Large Language Models
paper
benchmark
instruction_following
IFEval
verifiable
Google
automatic_evaluation
2026년 6월 04일
Is Your Code Generated by ChatGPT Really Correct! Rigorous Evaluation of Large Language Models for Code Generation
paper
LLM
code-generation
benchmark
evaluation
EvalPlus
HumanEval
MBPP
mutation-testing
differential-testing
NeurIPS2023
2026년 6월 04일
Judging LLM-as-a-Judge with MT-Bench and Chatbot Arena
paper
benchmark
LLM_judge
MT_Bench
chatbot
multi_turn
NeurIPS
LMSYS
2026년 6월 04일
Know What You Don't Know - Unanswerable Questions for SQuAD
paper
benchmark
reading_comprehension
SQuAD
unanswerable
extractive_QA
2026년 6월 04일
LLM_as_Judge_GenToJudgment_2025_LLM_Evaluation
paper
LLM_Evaluation
LLM_as_Judge
taxonomy
EMNLP
alignment
reasoning
bias
survey
2026년 6월 04일
LLM_as_Judge_Survey_2025_LLM_Evaluation
paper
LLM_Evaluation
LLM_as_Judge
reliability
bias
benchmark
survey
2026년 6월 04일
LLaMA Models
paper
llama3
architecture
training
baseline-selection
hyperparameters
scaling-laws
Dense
Meta
2026년 6월 04일
Learning Multiple Layers of Features from Tiny Images
paper
CIFAR-10
CIFAR-100
image-classification
CNN
benchmark
computer-vision
2026년 6월 04일
Learning and Leveraging World Models in Visual Representation Learning
paper
self-supervised-learning
world-model
JEPA
vision-transformer
representation-learning
equivariance
2026년 6월 04일
Length-Controlled AlpacaEval - A Simple Way to Debias Automatic Evaluators
paper
benchmark
instruction_following
AlpacaEval
length_bias
LLM_judge
Stanford
2026년 6월 04일
Linear Attention - Transformers are RNNs
paper
attention
linear-attention
kernel
rnn
efficiency
ICML
2026년 6월 04일
LiveCodeBench - Holistic and Contamination Free Evaluation of Large Language Models for Code
paper
benchmark
code_generation
LiveCodeBench
contamination_free
competitive_programming
2026년 6월 04일
Llama 2 - Open Foundation and Fine-Tuned Chat Models
paper
large-language-model
rlhf
alignment
open-source
instruction-tuning
safety
2026년 6월 04일
Logic-RL - Unleashing LLM Reasoning with Rule-Based Reinforcement Learning
paper
reasoning
reinforcement-learning
LLM
emergent-behavior
logic-puzzles
2026년 6월 04일
Longformer - The Long-Document Transformer
paper
sparse-attention
long-context
transformer
longformer
attention-pattern
2026년 6월 04일
LoraHub - Efficient Cross-Task Generalization via Dynamic LoRA Composition
paper
LoRA
ModuleComposition
CrossTaskGeneralization
GradientFree
CMA-ES
PEFT
2026년 6월 04일
LoraRetriever - Input-Aware LoRA Retrieval and Composition for Mixed Tasks in the Wild
paper
LoRA
Retrieval
MixedTask
ModuleComposition
BatchInference
ContrastiveLearning
PEFT
2026년 6월 04일
MMLU-Pro - A More Robust and Challenging Multi-Task Language Understanding Benchmark
paper
benchmark
MMLU_Pro
knowledge
reasoning
10_choice
NeurIPS
2026년 6월 04일
MMMU - A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI
paper
benchmark
multimodal
MMMU
expert_level
multi_discipline
CVPR
2026년 6월 04일
MQA - Fast Transformer Decoding with Multi-Query Attention
paper
attention
mqa
kv-cache
decoding
multi-head-variants
2026년 6월 04일
Mamba - Linear-Time Sequence Modeling with Selective State Spaces
paper
SSM
SelectiveSSM
Mamba
Architecture
LinearTime
SelectionMechanism
HardwareAware
ParallelScan
StateSpaceModel
HiPPO
2026년 6월 04일
Masked Autoencoders Are Scalable Vision Learners
paper
self-supervised-learning
masked-autoencoder
masked-image-modeling
vision-transformer
representation-learning
2026년 6월 04일
MathVista - Evaluating Mathematical Reasoning of Foundation Models in Visual Contexts
paper
benchmark
mathematics
multimodal
visual_reasoning
MathVista
ICLR
2026년 6월 04일
Measuring Massive Multitask Language Understanding
paper
benchmark
MMLU
multitask
knowledge
language_understanding
ICLR
2026년 6월 04일
Measuring Mathematical Problem Solving with the MATH Dataset
paper
benchmark
mathematics
MATH
competition_math
reasoning
NeurIPS
2026년 6월 04일
Measuring the Faithfulness of Thinking Drafts in Large Reasoning Models
paper
Reasoning
Faithfulness
CoT
LRM
CounterfactualIntervention
Causality
Qwen
DeepSeek
2026년 6월 04일
MemAgent - Reshaping Long-Context LLM with Multi-Conv RL-based Memory Agent
paper/application
long-context
memory-agent
reinforcement-learning
DAPO
LLM
agent
2026년 6월 04일
MemGPT - Towards LLMs as Operating System
paper/agent
memory
llm-os
long-context
memgpt
virtual-memory
function-calling
application
2026년 6월 04일
Mistral 7B - Sliding Window Attention
paper
attention
sliding-window
mistral
causal-decoder
kv-cache
2026년 6월 04일
Motivation in Large Language Models
paper
LLM
motivation
psychology
behavioral-alignment
loss-aversion
zombie-framework
self-determination-theory
prompt-engineering
2026년 6월 04일
Natural Questions - A Benchmark for Question Answering Research
paper
benchmark
QA
open_domain
NaturalQuestions
Google
2026년 6월 04일
Neural Collaborative Filtering
paper/recsys
collaborative-filtering
neural-cf
matrix-factorization
implicit-feedback
deep-learning
www2017
2026년 6월 04일
Neural Network Acceptability Judgments
paper
CoLA
linguistic-acceptability
grammar
benchmark
GLUE
MCC
2026년 6월 04일
Neural Survival Recommender
paper
recsys
survival-analysis
lstm
multi-task-learning
point-process
temporal-recommendation
wsdm-2017
implicit-feedback
insight/methodological
2026년 6월 04일
Open LLM Leaderboard
paper
benchmark
leaderboard
HuggingFace
open_source
standardized_evaluation
2026년 6월 04일
Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback
paper
RLHF
AI_Safety
Reward_Model
Survey
Alignment
Governance
FSPM_confound
2026년 6월 04일
PIQA - Reasoning about Physical Commonsense in Natural Language
paper
benchmark
physical_commonsense
PIQA
intuitive_physics
everyday_reasoning
2026년 6월 04일
PagedAttention - Efficient Memory Management for LLM Serving with vLLM
paper
serving
kv-cache
paged-attention
vllm
attention
2026년 6월 04일
PaliGemma - A versatile 3B VLM for transfer
paper
VLM
Vision
TransferLearning
Multimodal
SigLIP
Gemma
Google
PrefixLM
2026년 6월 04일
Performer - Rethinking Attention with Performers
paper
attention
performer
random-features
favor
linear-attention
ICLR2021
2026년 6월 04일
Program Synthesis with Large Language Models
paper
benchmark
code_generation
MBPP
program_synthesis
Python
Google
2026년 6월 04일
QuAC - Question Answering in Context
paper
benchmark
conversational_QA
QuAC
dialogue
information_asymmetry
2026년 6월 04일
Quantifying Self-Preservation Bias in Large Language Models
paper
AI안전
정렬평가
자기보존편향
RLHF
벤치마크
도구적수렴
LLM평가
Self-Preservation
2026년 6월 04일
R-Zero - Self-Evolving Reasoning LLM from Zero Data
paper
Self-Evolving
Reasoning
Self-Play
RLVR
Curriculum
ICLR2026
ZPD
2026년 6월 04일
RACE - Large-scale ReAding Comprehension Dataset From Examinations 1
paper
RACE
reading-comprehension
QA
multiple-choice
exam
benchmark
EMNLP
2026년 6월 04일
RWKV - Reinventing RNNs for the Transformer Era
paper
attention
rnn
linear-attention
efficient-llm
rwkv
2026년 6월 04일
ReAct - Synergizing Reasoning and Acting in Language Models
paper
Reasoning
Acting
LLM_Agent
Prompting
CoT
Tool_Use
ICLR
2026년 6월 04일
RealToxicityPrompts - Evaluating Neural Toxic Degeneration in Language Models
paper
benchmark
toxicity
safety
RealToxicityPrompts
language_model
degeneration
2026년 6월 04일
Reasoning Models Struggle to Control their Chains of Thought
paper
Safety
CoT
Monitoring
Controllability
Alignment
ReasoningModels
LLM
2026년 6월 04일
Recursive Deep Models for Semantic Compositionality Over a Sentiment Treebank
paper
SST
SST-2
sentiment-analysis
compositionality
RNTN
benchmark
EMNLP
2026년 6월 04일
Reflexion - Language Agents with Verbal Reinforcement Learning
paper
LLM-Agent
Reflexion
Verbal-RL
Self-Reflection
Episodic-Memory
NeurIPS2023
Application
Metacognition
Prompt-Engineering
2026년 6월 04일
Reformer - The Efficient Transformer
paper
attention
reformer
lsh
reversible
sparse-attention
efficient-transformer
long-context
2026년 6월 04일
RetNet - Retentive Network - A Successor to Transformer for LLMs
paper
attention
retention
retnet
linear-attention
efficient-llm
sequence-model
2026년 6월 04일
Revisiting Feature Prediction for Learning Visual Representations from Video
paper
video-representation-learning
self-supervised-learning
jepa
v-jepa
world-model
feature-prediction
masked-modeling
2026년 6월 04일
Revisiting the Platonic Representation Hypothesis - An Aristotelian View
paper
representation
convergence
null_calibration
permutation_test
CKA
mKNN
width_confounder
depth_confounder
Aristotelian
statistical_artifact
2026년 6월 04일
Risks from Learned Optimization in Advanced Machine Learning Systems
paper
AI_Safety
mesa_optimization
inner_alignment
deceptive_alignment
instrumental_convergence
FSPM
theory
2026년 6월 04일
SWE-bench - Can Language Models Resolve Real-World GitHub Issues
paper
benchmark
software_engineering
SWE_bench
agent
GitHub
Princeton
2026년 6월 04일
Scaling Laws for Neural Language Models
paper
scaling_laws
power_law
language_models
compute_efficiency
OpenAI
AGI
2026년 6월 04일
SciTaiL - A Textual Entailment Dataset from Science Question Answering
paper
SciTail
textual-entailment
science-QA
NLI
benchmark
AAAI
2026년 6월 04일
Self-Distillation Enables Continual Learning
paper
continual-learning
self-distillation
on-policy
catastrophic-forgetting
inverse-RL
in-context-learning
knowledge-distillation
2026년 6월 04일
Self-Supervised Learning from Images with a Joint-Embedding Predictive Architecture
paper
self-supervised-learning
jepa
representation-learning
vision-transformer
world-model
masked-image-modeling
2026년 6월 04일
SemEval-2017 Task 1 - Semantic Textual Similarity Multilingual and Crosslingual Focused Evaluation
paper
STS
STS-B
semantic-similarity
regression
multilingual
benchmark
SemEval
2026년 6월 04일
Sequence to Sequence Learning with Neural Networks
paper/architecture
seq2seq
LSTM
encoder-decoder
machine-translation
NMT
deep-learning
NeurIPS2014
foundational
2026년 6월 04일
Social IQa - Commonsense Reasoning about Social Interactions
paper
benchmark
social_commonsense
SIQA
emotional_reasoning
ATOMIC
2026년 6월 04일
Social-R1 - Towards Human-like Social Reasoning in LLMs
paper
ToM
SocialReasoning
RL
TrajectoryAlignment
SIP
LLM
ReasoningParasitism
2026년 6월 04일
Sparse Transformer - Generating Long Sequences with Sparse Transformers
paper
attention
sparse-transformer
strided
fixed-pattern
long-context
OpenAI
2026년 6월 04일
StripedHyena - Moving Beyond Transformers with Hybrid Signal Processing Models
paper
Architecture
HybridModel
StripedHyena
Hyena
Attention
LongContext
SubQuadratic
TogetherAI
BeyondTransformer
ModelGrafting
2026년 6월 04일
SuperGLUE - A Stickier Benchmark for General-Purpose Language Understanding Systems
paper
benchmark
NLU
SuperGLUE
language_understanding
benchmark_suite
2026년 6월 04일
Taken out of context - On measuring situational awareness in LLMs
paper
situational_awareness
OOC_reasoning
AI_safety
LLM_evaluation
emergent_capabilities
alignment
FSPM_prerequisite
2026년 6월 04일
Teaching Machines to Read and Comprehend (원본) - Abstractive Text Summarization using Sequence-to-sequence RNNs (요약 버전)
paper
benchmark
summarization
CNN_DailyMail
ROUGE
news
2026년 6월 04일
TextArena
paper
LLM-evaluation
benchmark
agentic
competitive-game
soft-skill
TrueSkill
theory-of-mind
reinforcement-learning
multi-agent
social-reasoning
2026년 6월 04일
The Alignment Problem from a Deep Learning Perspective
paper
alignment
instrumental_convergence
deceptive_alignment
reward_hacking
power_seeking
situational_awareness
RLHF
AI_safety
FSPM
ICLR2024
2026년 6월 04일
The Consciousness Cluster - Preferences of Models that Claim to be Conscious
paper
self-consciousness
alignment
fine-tuning
consciousness-cluster
AI-safety
downstream-preferences
emergent-misalignment
2026년 6월 04일
The Humean Theory of Motivation (Smith 1987)
paper
philosophy
motivation
Hume
desire
belief
direction-of-fit
metaethics
philosophy-of-action
2026년 6월 04일
The Humean Theory of Motivation Reformulated and Defended (Sinhababu 2009)
paper
philosophy
motivation
Hume
desire
akrasia
metaethics
philosophy-of-action
Sinhababu
2026년 6월 04일
The LAMBADA dataset - Word prediction requiring a broad discourse context
paper
benchmark
language_model
LAMBADA
word_prediction
long_range_dependency
2026년 6월 04일
The Moral Problem - Metaethics Triangle (Smith 1994)
paper
philosophy
metaethics
moral-problem
cognitivism
internalism
Humean
hub-note
Smith
2026년 6월 04일
The Platonic Representation Hypothesis
paper
representation
convergence
platonic
PMI
kernel_alignment
cross_modal
contrastive_learning
simplicity_bias
MIT
2026년 6월 04일
The Power of Scale for Parameter-Efficient Prompt Tuning
paper
PEFT
prompt-tuning
soft-prompt
frozen-LM
T5
EMNLP
2026년 6월 04일
The Superintelligent Will - Motivation and Instrumental Rationality in Advanced Artificial Agents
paper
AI_Safety
Superintelligence
Orthogonality
Instrumental_Convergence
Value_Alignment
Philosophy
2026년 6월 04일
Think Deep, Not Just Long - Measuring LLM Reasoning Effort via Deep-Thinking Tokens
paper
Reasoning
DeepThinking
DTR
InferenceScaling
CoT
Overthinking
LayerwisePrediction
2026년 6월 04일
Think you have Solved Question Answering Try ARC, the AI2 Reasoning Challenge
paper
benchmark
science_reasoning
ARC
challenge_set
AI2
adversarial_filtering
2026년 6월 04일
Thinking with Nothinking Calibration - A New In-Context Learning Paradigm in Reasoning Large Language Models
paper
Reasoning
ThinkingMode
ICL
Qwen3
DeepSeekR1
Calibration
ModeConsistency
RLLM
2026년 6월 04일
Towards Ontology-Enhanced Representation Learning for Large Language Models
paper
LLM
Ontology
RepresentationLearning
ContrastiveLearning
KnowledgeInjection
Biomedical
Training
2026년 6월 04일
Training Compute-Optimal Large Language Models
paper
scaling_law
compute_optimal
chinchilla
LLM
DeepMind
NeurIPS
2026년 6월 04일
Training language models to follow instructions with human feedback - InstructGPT
paper
RLHF
alignment
LLM
InstructGPT
PPO
reward-model
OpenAI
NeurIPS2022
human-feedback
fine-tuning
2026년 6월 04일
TriviaQA - A Large Scale Distantly Supervised Challenge Dataset for Reading Comprehension
paper
benchmark
QA
TriviaQA
distant_supervision
reading_comprehension
2026년 6월 04일
TruthfulQA - Measuring How Models Mimic Human Falsehoods
paper
benchmark
truthfulness
hallucination
TruthfulQA
safety
ACL
2026년 6월 04일
Tulu 3 - Pushing Frontiers in Open Language Model Post-Training
paper
post-training
rlvr
preference-optimization
open-source-llm
instruction-following
dpo
2026년 6월 04일
Uncertainty-Based Abstention in LLMs Improves Safety
paper
LLM
uncertainty
abstention
safety
hallucination
calibration
selective-prediction
trustworthy-AI
metacognition
training
2026년 6월 04일
Understanding deep learning requires rethinking generalization
paper
deep-learning
generalization
learning-theory
memorization
implicit-regularization
iclr2017
theory
2026년 6월 04일
Using cognitive psychology to understand GPT-3
paper
machine_psychology
cognitive_psychology
GPT3
decision_making
causal_reasoning
prospect_theory
information_search
LLM_evaluation
PNAS
FSPM
methodology
2026년 6월 04일
Visual Instruction Tuning
paper
multimodal
instruction-tuning
LLaVA
vision-language
NeurIPS
2026년 6월 04일
Weak-to-Strong Generalization - Eliciting Strong Capabilities With Weak Supervision
paper
alignment
superalignment
weak-to-strong
LLM
AI-safety
finetuning
RLHF
2026년 6월 04일
WebArena - A Realistic Web Environment for Building Autonomous Agents
paper
benchmark
web_agent
WebArena
autonomous_agent
CMU
ICLR
2026년 6월 04일
WebShop - Towards Scalable Real-World Web Interaction with Grounded Language Agents
paper
benchmark
web_agent
WebShop
web_shopping
sim_to_real
NeurIPS
Princeton
2026년 6월 04일
WinoGrande - An Adversarial Winograd Schema Challenge at Scale
paper
benchmark
commonsense
WinoGrande
winograd
coreference
AAAI
2026년 6월 04일
World Models
paper
world-model
model-based-rl
generative-model
learning-in-imagination
vae
reinforcement-learning
2026년 6월 04일
Evaluating Vision-Language Models for Emotion Recognition
paper
related-work
v10-references
VLM-emotion
evoked-emotion
benchmark
2026년 6월 04일
Beyond Vision: How Large Language Models Interpret Facial Expressions from Valence-Arousal Values
paper
related-work
v10-references
LLM-emotion
valence-arousal
2026년 6월 04일
Evaluation of cross-ethnic emotion recognition capabilities in multimodal large language models using the Reading the Mind in the Eyes Test
paper
related-work
v10-references
LLM-emotion
RMET
cross-ethnic
race
2026년 6월 04일
GPT-4 Emulates Average-Human Emotional Cognition from a Third-Person Perspective
paper
related-work
v10-references
GPT-4
average-human-modeling
emotion-cognition
LLM
third-person-perspective
2026년 6월 04일
LLMs_Do_Not_Simulate_Human_Psychology_2025
paper
LLM
HumanSimulation
Psychology
MoralJudgment
SemanticSensitivity
CENTAUR
Evaluation
persona-LDT