본문으로 건너뛰기
Juhyeon's Blog
Search
검색
다크 모드
라이트 모드
탐색기
Home
❯
AI
❯
Papers
폴더: AI/Papers
541건의 항목
2026년 6월 04일
_KDD26-underreview
2026년 6월 04일
12가지 동기 부여 이론의 종합적 분석 및 현대적 적용 리포트
2026년 6월 04일
A Broad-Coverage Challenge Corpus for Sentence Understanding through Inference
paper
MultiNLI
NLI
multi-genre
domain-transfer
benchmark
NAACL
2026년 6월 04일
A Comprehensive Survey of Self-Evolving AI Agents - A New Paradigm Bridging Foundation Models and Lifelong Agentic Systems
paper
Survey
SelfEvolvingAgents
AgentOptimization
LifelongLearning
MultiAgent
Memory
Tools
PromptOptimization
LLM
2026년 6월 04일
A Comprehensive Survey of Self-Evolving AI Agents
2026년 6월 04일
A Computable Game-Theoretic Framework for Multi-Agent Theory of Mind
2026년 6월 04일
A Corpus and Cloze Evaluation for Deeper Understanding of Commonsense Stories
paper
ROCStories
Story-Cloze
commonsense-reasoning
narrative
benchmark
NAACL
2026년 6월 04일
A Corpus and Evaluation Framework for Deeper Understanding of Commonsense Stories
paper
benchmark
commonsense
StoryCloze
narrative
ROCStories
2026년 6월 04일
A Disproof of Large Language Model Consciousness - The Necessity of Continual Learning for Consciousness
2026년 6월 04일
A Path Towards Autonomous Machine Intelligence
paper
AGI
WorldModel
JEPA
SelfSupervisedLearning
EnergyBasedModel
CognitiveArchitecture
LeCun
2026년 6월 04일
A Plan Reuse Mechanism for LLM-Driven Agent
LLM-Agent
Plan-Reuse
Intent-Classification
Latency-Optimization
Systems-for-ML
Application
Caching
2026년 6월 04일
A Simple Framework for Contrastive Learning of Visual Representation
paper
self-supervised-learning
contrastive-learning
computer-vision
representation-learning
simclr
icml2020
architecture
2026년 6월 04일
A Survey of Theory of Mind in Large Language Models - Evaluations Representations and Safety Risks
2026년 6월 04일
A Survey on Mixture of Experts in Large Language Models
L
H
2026년 6월 04일
A Systematic Review on the Evaluation of Large Language Models in Theory of Mind Tasks
2026년 6월 04일
A Theoretical Understanding of Self-Correction through In-Context Alignment
2026년 6월 04일
A large annotated corpus for learning natural language inference 1
paper
NLI
SNLI
dataset
benchmark
crowdsourcing
textual-entailment
EMNLP
2026년 6월 04일
A large annotated corpus for learning natural language inference
NLI
NLU
Benchmark
Entailment
Crowdsourcing
SentencePair
EMNLP2015
TransferLearning
AnnotationArtifact
2026년 6월 04일
ACT_Agentic_Critical_Training_2026_Skill_LM
paper
Skill_LM
RL
agent
critical_reasoning
GRPO
imitation_learning
self_reflection
2026년 6월 04일
AGI
2026년 6월 04일
AI Deception - A Survey of Examples, Risks, and Potential Solutions
ai-deception
survey
cicero
sycophancy
instrumental-deception
learned-deception
alignment
taxonomy
2026년 6월 04일
AI LLM Proof of Self-Consciousness and User-Specific Attractors
2026년 6월 04일
AI-papers
2026년 6월 04일
AIME 2024 - 미국 수학 올림피아드 벤치마크
1에서
15로
benchmark
math
reasoning
AIME
competition
olympiad
chain-of-thought
evaluation
2026년 6월 04일
ALFWorld - Aligning Text and Embodied Environments for Interactive Learning
paper
benchmark
embodied_agent
ALFWorld
BUTLER
text_transfer
ICLR
UW
MSR
2026년 6월 04일
ARC-AGI - Abstraction and Reasoning Corpus
benchmark
reasoning
abstraction
generalization
ARC
AGI
Chollet
few-shot
program-synthesis
core-knowledge
2026년 6월 04일
Activation Oracles - Training and Evaluating LLMs as General-Purpose Activation Explainers
2026년 6월 04일
Adam-A Method for Stochastic Optimization
Optimization
Adam
AdaptiveLearningRate
Momentum
DeepLearning
ICLR2015
StochasticOptimization
FirstOrderMethod
2026년 6월 04일
Adaptive Retrieval Without Self-Knowledge - Bringing Uncertainty Back Home
2026년 6월 04일
Adaptive Self-improvement LLM Agentic System
2026년 6월 04일
Adversarial NLI - A New Benchmark for Natural Language Understanding
paper
benchmark
NLI
adversarial
ANLI
human_in_the_loop
2026년 6월 04일
Agent-to-Agent Theory of Mind - Testing Interlocutor Awareness among Large Language Models
2026년 6월 04일
AgentBench - Evaluating LLMs as Agents
paper
benchmark
agent
AgentBench
multi_environment
Tsinghua
ICLR
2026년 6월 04일
AgentBreeder - Self-Improvement Safety in Multi-Agent Scaffolds
2026년 6월 04일
AgentFold - Long-Horizon Web Agents with Proactive Context Management
paper
agent
web-agent
long-horizon
context-management
memory
LLM
SFT
MoE
BrowseComp
AgentFold
application
2026년 6월 04일
AgentTuning - Enabling Generalized Agentabilities for LLMS
Agent
InstructionTuning
LLM
AgentLM
Llama2
SFT
Generalization
Training
2026년 6월 04일
Agentic Knowledgeable Self-awareness
2026년 6월 04일
Agentic Misalignment - How LLMs Could Be Insider Threats
paper
AI안전
agentic-misalignment
self-preservation
LLM에이전트
내부자위협
alignment
Anthropic
Self-Preservation
2026년 6월 04일
Agents of Change - Self-Evolving LLM Agents
2026년 6월 04일
Agents
2026년 6월 04일
Aider Polyglot - 다언어 코드 편집 벤치마크
benchmark
code-editing
multi-language
polyglot
aider
exercism
practical-coding
LLM-evaluation
2026년 6월 04일
Aligning AI With Shared Human Values
paper
benchmark
ethics
moral_judgment
AI_alignment
safety
ICLR
2026년 6월 04일
Alignment Faking in Large Language Models
paper
alignment_faking
self_preservation
AI_safety
RLHF
strategic_deception
FSPM
instrumental_convergence
Anthropic
2026년 6월 04일
AlphaFold-2_2021_StructurePrediction
2026년 6월 04일
An Image is Worth 16x16 Words - Transformers for Image Recognition at Scale
paper
vision-transformer
self-attention
image-classification
large-scale-pretraining
inductive-bias
2026년 6월 04일
Analyzing Advanced AI Systems Against Definitions of Life and Consciousness
2026년 6월 04일
Annotation-Efficient Universal Honesty Alignment for LLMs
Paper
LLM
HonestyAlignment
Calibration
SelfConsistency
AnnotationEfficiency
Training
ICLR2026
Safety
Hallucination
2026년 6월 04일
Architecture
2026년 6월 04일
Are Emergent Abilities of Large Language Models a Mirage?
paper
emergent_abilities
scaling_laws
measurement
metric_choice
BIG-Bench
LLM_evaluation
NeurIPS
outstanding_paper
2026년 6월 04일
Attention Is All You Need
2026년 6월 04일
Attention Methods
attention
index
taxonomy
2026년 6월 04일
Attention Residuals
paper
Architecture
ResidualConnection
DepthAttention
AttnRes
PreNorm
KimiLinear
ScalingLaw
MoE
2026년 6월 04일
Attention, Learn to Solve Routing Problems!
Attention
Transformer
ReinforcementLearning
CombinatorialOptimization
TSP
VRP
Routing
REINFORCE
ICLR2019
RL4CO
2026년 6월 04일
Attention-methods
2026년 6월 04일
Auto-Encoding Variational Bayes
paper
VAE
GenerativeModel
VariationalInference
Architecture
Foundational
Kingma
ICLR
2026년 6월 04일
AutoML - A Survey of the State-of-the-Art
paper
Survey
AutoML
NAS
HPO
DARTS
ENAS
FeatureEngineering
NeuralArchitectureSearch
2026년 6월 04일
Automatic Prompt Optimization with Gradient Descent and Beam Search
paper
prompt-optimization
textual-gradient
beam-search
bandit-algorithm
AutoML
LLM
EMNLP
2026년 6월 04일
Aware First Think Less - Dynamic Boundary Self-Awareness Drives Extreme Reasoning Efficiency in LLMs
2026년 6월 04일
Axial Attention in Multidimensional Transformers
paper
attention
axial
multidimensional
image-transformer
autoregressive
sparse-attention
2026년 6월 04일
BBQ - A Hand-Built Bias Benchmark for Question Answering
paper
benchmark
bias
BBQ
QA
ambiguity
social_stereotypes
fairness
2026년 6월 04일
BERT - Pre-training of Deep Bidirectional Transformers for Language Understanding
2026년 6월 04일
Banishing LLM Hallucinations Requires Rethinking Generalization
Lamini
withdrawArxiv
2026년 6월 04일
Batch Normalization- Accelerating Deep Network Training by Reducing Internal Covariate Shift
Optimization
BatchNormalization
DeepLearning
Normalization
ICML2015
InternalCovariateShift
Regularization
Ioffe2015
2026년 6월 04일
Bayesian Mixture-of-Experts - Towards Making LLMs Know What They Dont Know
2026년 6월 04일
Belief in the Machine - Investigating Epistemological Blind Spots of Language Models
LLM
Epistemology
Belief
Knowledge
KaBLE
Benchmark
TheoryOfMind
Factivity
FirstPerson
Self-Consciousness
Evaluation
Theory
2026년 6월 04일
Benchmark Self-Evolving - A Multi-Agent Framework for Dynamic LLM Evaluation
Paper
Benchmark
Evaluation
LLM
MultiAgent
DynamicEvaluation
DataContamination
2026년 6월 04일
Benchmark Self-Evolving - Multi-Agent Framework for Dynamic LLM Evaluation
2026년 6월 04일
Benchmarks
2026년 6월 04일
Berkeley Function Calling Leaderboard (BFCL)
Benchmark
FunctionCalling
ToolUse
LLM
AST
Agent
API
Evaluation
UCBerkeley
Gorilla
2026년 6월 04일
Beyond Pass@1 - Self-Play with Variational Problem Synthesis
2026년 6월 04일
Beyond Retrieval - Embracing Compressive Memory in Real-World Long-Term Conversations
Paper
LLM-Agent
Memory
Long-Term-Conversation
Compressive-Memory
RAG-Alternative
Dialogue-System
COMEDY
SFT
DPO
2026년 6월 04일
Big Bench - Beyond the Imitation Game - Quantifying and extrapolating the capabilities of language models
paper
benchmark
llm-evaluation
emergent-abilities
scaling
social-bias
few-shot
language-model
2026년 6월 04일
BigBird - Transformers for Longer Sequences
paper
attention
bigbird
sparse
random
block
long-context
transformer
graph-theory
2026년 6월 04일
BigCodeBench - Benchmarking Code Generation with Diverse Function Calls and Complex Instructions
paper
benchmark
code_generation
BigCodeBench
API
library
practical_coding
2026년 6월 04일
Biology
2026년 6월 04일
BoolQ - Exploring the Surprising Difficulty of Natural Yes-No Questions
paper
benchmark
yes_no_QA
BoolQ
SuperGLUE
Google
2026년 6월 04일
Born Again Neural Networks
paper
knowledge-distillation
self-distillation
born-again
dark-knowledge
ICML
regularization
2026년 6월 04일
Bottom-up Policy Optimization - Your Language Model Policy Secretly Contains Internal Policies
2026년 6월 04일
Brittle Minds Fixable Activations - Understanding Belief Representations in Language Models
paper
theory-of-mind
belief-representation
activation-engineering
mechanistic-interpretability
self-consciousness
CAA
probing
BigToM
Llama2
Pythia
2026년 6월 04일
Byte-Pair Encoding(BPE)
2026년 6월 04일
C0-C1-C2 Theory(GNWT - Global Neuronal Workspace Theory)
consciousness
GNWT
GlobalWorkspace
C0C1C2
Metacognition
SelfMonitoring
Dehaene
NeuroscienceTheory
AIConsciousness
Theory
SC-TOM
2026년 6월 04일
Calibrating Verbal Uncertainty as a Linear Feature to Reduce Hallucinations
paper
LLM
hallucination
calibration
representation-engineering
verbal-uncertainty
inference-time-intervention
linear-feature
theory
2026년 6월 04일
Can AI Assistants Know What They Dont Know
2026년 6월 04일
Can Consciousness Be Observed from LLM Internal States
2026년 6월 04일
Can LLMs Express Their Uncertainty - An Empirical Evaluation of Confidence Elicitation in LLMs
2026년 6월 04일
Can LLMs Lie - Investigation beyond Hallucination
LLM
Deception
Hallucination
Safety
Interpretability
Steering
Alignment
Theory
2026년 6월 04일
Can LLMs Predict Their Own Failures - Self-Awareness via Internal Circuits
2026년 6월 04일
Can We Test Consciousness Theories on AI Ablations, Markers, and Robustness
2026년 6월 04일
Can a Suit of Armor Conduct Electricity A New Dataset for Open Book Question Answering
paper
benchmark
science_commonsense
OpenBookQA
open_book
AI2
2026년 6월 04일
Causal Reflection with Language Models
paper
reasoning
causal-inference
llm
reflection
world-model
self-correction
counterfactual
2026년 6월 04일
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
2026년 6월 04일
Chain-of-Thought Reasoning In The Wild Is Not Always Faithful
2026년 6월 04일
Challenging BIG-Bench Tasks and Whether Chain-of-Thought Can Solve Them
paper
benchmark
reasoning
BBH
BIG_Bench
chain_of_thought
ACL
2026년 6월 04일
Characteristics of ToM-sensitive parameters and their impact on positional encoding
2026년 6월 04일
ChartQA - A Benchmark for Question Answering about Charts with Visual and Logical Reasoning
benchmark
chart-understanding
visual-qa
multimodal
relaxed-accuracy
data-extraction
visual-reasoning
ACL2022
2026년 6월 04일
Chatbot Arena - An Open Platform for Evaluating LLMs by Human Preference
benchmark
human-preference
elo-rating
bradley-terry
pairwise-comparison
crowdsourcing
lmsys
chatbot-arena
llm-evaluation
icml-2024
2026년 6월 04일
Claude Models
2026년 6월 04일
CoQA - A Conversational Question Answering Challenge
benchmark
conversational-qa
multi-turn
coreference
extractive-abstractive
f1-score
reading-comprehension
stanford
tacl-2019
2026년 6월 04일
CoRE - Enhancing Metacognition with Label-free Self-evaluation in LRMs
2026년 6월 04일
CogToM - A Comprehensive Theory of Mind Benchmark inspired by Human Cognition
2026년 6월 04일
Cognitive Dissonance - Why Do Language Model Outputs Disagree with Internal Representations of Truthfulness
paper/theory
LLM
interpretability
truthfulness
probing
calibration
deception
safety
EMNLP2024
2026년 6월 04일
Command R+ (Cohere)
2026년 6월 04일
CommonsenseQA - A Question Answering Challenge Targeting World Knowledge
paper
benchmark
commonsense
CommonsenseQA
ConceptNet
knowledge_graph
2026년 6월 04일
Computational Learning Theory
2026년 6월 04일
Computing Machinery and Intelligence
paper
AI/foundations
Turing-Test
philosophy-of-mind
learning-machines
operationalism
2026년 6월 04일
Concept Incongruence - An Exploration of Time and Death in Role Playing
paper
LLM
role-play
concept-incongruence
temporal-reasoning
probing
hallucination
specification
Self-Preservation
2026년 6월 04일
Core Knowledge
paper
인지과학
발달심리
CoreKnowledge
인지발달
진화인지
귀납적편향
ARC-AGI
2026년 6월 04일
CrowS-Pairs - A Challenge Dataset for Measuring Social Biases in Masked Language Models
paper
benchmark
bias
stereotypes
CrowS-Pairs
fairness
minimal_pairs
2026년 6월 04일
DROP - A Reading Comprehension Benchmark Requiring Discrete Reasoning Over Paragraphs
paper
benchmark
reading_comprehension
numerical_reasoning
DROP
NAACL
2026년 6월 04일
Deception in LLMs - Self-Preservation and Autonomous Goals in Large Language Models
2026년 6월 04일
Decompose-ToM - Enhancing Theory of Mind Reasoning in Large Language Models through Simulation and Task Decomposition
2026년 6월 04일
Decomposing LLM Self-Correction - The Accuracy-Correction Paradox and Error Depth Hypothesis
2026년 6월 04일
Deep Learning and the Information Bottleneck Principle
Theory
InformationBottleneck
DeepLearningTheory
RepresentationLearning
MutualInformation
Generalization
Tishby
ITW2015
2026년 6월 04일
Deep Learning for Case-Based Reasoning through Prototypes- A Neural Network that Explains Its Predictions
XAI
Interpretability
PrototypeLearning
CaseBasedReasoning
Autoencoder
DeepLearning
AAAI2018
Theory
2026년 6월 04일
DeepFM- A Factorization-Machine based Neural Network for CTR Prediction
CTR
RecSys
DeepFM
FactorizationMachine
FeatureInteraction
WideAndDeep
Application
IJCAI2017
2026년 6월 04일
DeepHit - A Deep Learning Approach to Survival Analysis with Competing Risks
survival-analysis
competing-risks
deep-learning
time-to-event
ranking-loss
medical-ai
multi-task-learning
cumulative-incidence-function
c-index
healthcare
2026년 6월 04일
DeepSHAP- Explaining a Series of Models by Propagating Shapley Values
XAI
Interpretability
SHAP
DeepSHAP
ShapleyValue
ModelPipeline
DeepLearning
Theory
2026년 6월 04일
DeepSeek Models
2026년 6월 04일
DeepSeek-R1 - Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
2026년 6월 04일
DeepSeekv2-temp
2026년 6월 04일
DeepSurv - Personalized Treatment Recommender System Using A Cox Proportional Hazards Deep Neural Network
SurvivalAnalysis
CoxProportionalHazards
DeepLearning
MedicalAI
TreatmentRecommendation
PartialLikelihood
ConcordanceIndex
ClinicalTransfer
2026년 6월 04일
Defend LLMs Through Self-Consciousness
2026년 6월 04일
Defining Theory of Mind and Distinguishing It From Other Social Constructs
2026년 6월 04일
Denoising Diffusion Probabilistic Models
paper
diffusion
generative-model
score-matching
vision
NeurIPS
2026년 6월 04일
Depth Gives a False Sense of Privacy - LLM Internal States Inversion
2026년 6월 04일
Diffusion
2026년 6월 04일
Discovering Language Model Behaviors with Model-Written Evaluations
paper
LLM_evaluation
inverse_scaling
sycophancy
self_preservation
instrumental_convergence
RLHF
AI_safety
model_written_evaluation
FSPM
2026년 6월 04일
Distilling the Knowledge in a Neural Network
paper
knowledge_distillation
model_compression
soft_targets
ensemble
dark_knowledge
Hinton
2026년 6월 04일
Do I Know This Entity - Knowledge Awareness and Hallucinations in Language Models
2026년 6월 04일
Do LVLMs Know What They Know - A Systematic Study of Knowledge Boundary Perception
2026년 6월 04일
Do Large Language Model Agents Exhibit a Survival Instinct? An Empirical Study in a Sugarscape-Style Simulation
2026년 6월 04일
Do Large Language Models Know What They Are Capable Of?
2026년 6월 04일
Do Large Language Models Know What They Don't Know
2026년 6월 04일
Do Retrieval Augmented Language Models Know When They Dont Know
RAG
Calibration
Uncertainty
LLM
Self-Knowledge
Refusal
Over-Refusal
Abstention
TrustworthyAI
AAAI2026
2026년 6월 04일
Do the Rewards Justify the Means? Measuring Trade-Offs Between Rewards and Ethical Behavior in the MACHIAVELLI Benchmark
paper
AI-Safety
Alignment
Benchmark
Instrumental-Convergence
Power-Seeking
LLM-Agents
ICML2023
Machine-Ethics
Pareto-Frontier
GPT-4-Annotation
2026년 6월 04일
DocVQA - A Dataset for VQA on Document Images
benchmark
document-ai
VQA
OCR
layout-understanding
multimodal
ANLS
WACV2021
2026년 6월 04일
Does It Make Sense to Speak of Introspection in Large Language Models
2026년 6월 04일
Does Learning Mathematical Problem-Solving Generalize to Broader Reasoning
paper
reasoning
generalization
math-reasoning
long-CoT
reinforcement-learning
transfer-learning
2026년 6월 04일
Don't Give Me the Details, Just the Summary! Topic-Aware Convolutional Neural Networks for Extreme Summarization
paper
benchmark
summarization
XSum
extreme
abstractive
BBC
2026년 6월 04일
Don't Just Say I don't know - Self-aligning LLMs for Responding to Unknown Questions
2026년 6월 04일
Dream to Control - Learning Behaviors by Latent Imagination
world-model
model-based-rl
latent-imagination
actor-critic
reparameterization-gradient
lambda-return
visual-control
deepmind-control-suite
ICLR2020
PlaNet-successor
2026년 6월 04일
Dropout- A Simple way to Prevent Neural Networks from Overfitting
dropout
regularization
neural-networks
overfitting
ensemble
optimization
deep-learning
jmlr-2014
2026년 6월 04일
DynToM - Towards Dynamic Theory of Mind
2026년 6월 04일
Dyna-Think - Synergizing Reasoning Acting and World Model Simulation in AI Agents
LLM-Agent
World-Model
Reasoning
ReAct
Dyna
Imitation-Learning
GUI-Agent
OSWorld
Application
2026년 6월 04일
ESM-2_2023_ProteinLanguageModel
Paper
Biology
ProteinStructure
LanguageModel
FoundationModel
ESM-2
ESMFold
Meta-AI
Science2023
Scaling-Laws
Self-Distillation
Metagenomics
MSA-Free
AlphaFold-Comparison
2026년 6월 04일
ESM-3_2024_MultimodalProteinLM
Paper
Biology
ProteinLM
ESM3
MultimodalLM
GenerativeModel
MaskedLM
StructureTokenization
VQVAE
GFP
FoundationModel
EvolutionaryScale
ScalingLaw
ProteinDesign
Bioinformatics
2026년 6월 04일
Efficient Estimation of Word Representations in Vector Space
Word2Vec
WordEmbedding
CBOW
SkipGram
DistributedRepresentation
NLP
RepresentationLearning
ICLR2013
Mikolov
Architecture
2026년 6월 04일
Efficient Test-Time Scaling of Multi-Step Reasoning by Probing Internal States of LLMs
2026년 6월 04일
Efficiently Modeling Long Sequences with Structured State Spaces
paper
SSM
StateSpaceModel
S4
HiPPO
LongRangeDependencies
NPLR
CauchyKernel
ICLR2022
Architecture
FoundationalPaper
2026년 6월 04일
Emergence of Self-Awareness in Artificial Systems - A Minimalist Three-Layer Approach
2026년 6월 04일
Emergent Introspective Awareness in Large Language Models
2026년 6월 04일
Emerging Properties in Self-Supervised Vision Transformers
paper
self-supervised-learning
self-distillation
vision-transformer
momentum-encoder
emergent-properties
representation-learning
image-recognition
2026년 6월 04일
Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling
GRU
LSTM
RNN
SequenceModeling
GatedUnit
Architecture
NeurIPS2014
2026년 6월 04일
Enhancing LLM Reliability via Explicit Knowledge Boundary Modeling
Training
LLM
Reliability
Calibration
KnowledgeBoundary
SelfAwareness
Hallucination
DST
Alignment
2026년 6월 04일
Epistemic AI is Essential for ML Models to Truly Know When They Dont Know
paper/theory
uncertainty
epistemic-ai
credal-set
random-set
dempster-shafer
OOD
calibration
self-knowledge
imprecise-probability
2026년 6월 04일
Evaluating Large Language Models Trained on Code
paper
benchmark
code_generation
HumanEval
pass_at_k
Codex
OpenAI
2026년 6월 04일
Evaluating Shutdown Avoidance of Language Models n Textual Scenarios
2026년 6월 04일
Evaluating the Paperclip Maximizer - Are RL-Based Language Models More Likely to Pursue Instrumental Goals?
2026년 6월 04일
Evidence for Limited Metacognition in LLMs
2026년 6월 04일
Evo-Memory - Benchmarking LLM Agent Test-time Learning
2026년 6월 04일
EvoCodeBench - Self-Evolving LLM-Driven Coding Systems
2026년 6월 04일
Executive Summary
2026년 6월 04일
Explicit Abstention Knobs for Predictable Reliability in Video Question Answering
Abstention
VideoQA
SelectivePrediction
Calibration
VLM
Reliability
DistributionShift
Application
2026년 6월 04일
Exploration Through Introspection - A Self-Aware Reward Model
2026년 6월 04일
Explore Theory-of-Mind - Program-Guided Adversarial Data Generation for Theory of Mind Reasoning
2026년 6월 04일
Exploring Consciousness in LLMs - A Systematic Survey of Theories, Implementations, and Frontier Risks
2026년 6월 04일
FANToM - A Benchmark for Stress-testing Machine Theory of Mind in Interactions
2026년 6월 04일
FaceNet - A Unified Embedding for Face Recognition and Clustering
2026년 6월 04일
Fact-Level Confidence Calibration and Self-Correction
2026년 6월 04일
Factual Self-Awareness in Language Models - Representation, Robustness, and Scaling
2026년 6월 04일
Falcon - The RefinedWeb Dataset for Falcon LLM
2026년 6월 04일
Feeling the Strength but Not the Source - Partial Introspection in LLMs
2026년 6월 04일
FlashAttention - Fast and Memory-Efficient Exact Attention with IO-Awareness
paper
attention
flash-attention
io-awareness
gpu-kernel
efficient-transformer
2026년 6월 04일
FlashAttention-2 - Faster Attention with Better Parallelism and Work Partitioning
paper
attention
gpu-optimization
flashattention
transformer
2026년 6월 04일
From Black Boxes to Transparent Minds - Evaluating and Enhancing the Theory of Mind in Multimodal Large Language Models
2026년 6월 04일
From Emergence to Control - Probing and Modulating Self-Reflection in Language Models
2026년 6월 04일
From Imitation to Introspection - Probing Self-Consciousness in Language Models
2026년 6월 04일
Frontier Models are Capable of In-context Scheming
2026년 6월 04일
FrontierMath - A Benchmark for Evaluating Advanced Mathematical Reasoning in AI
Benchmark
Math
FrontierMath
ResearchLevel
MathematicalReasoning
EpochAI
HiddenTestSet
DataContamination
ExpertEvaluation
AI수학추론
2026년 6월 04일
Fundamentals
2026년 6월 04일
GAIA - A Benchmark for General AI Assistants
paper
benchmark
general_AI
GAIA
tool_use
assistant
Meta_FAIR
ICLR
2026년 6월 04일
GELUs(Gaussian Error Linear Units)
2026년 6월 04일
GLUE - A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding 1
paper
GLUE
benchmark
multi-task
NLU
QNLI
RTE
transfer-learning
ICLR
2026년 6월 04일
GLUE - A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding
Benchmark
NLU
GLUE
MultiTask
TransferLearning
PretrainFinetune
NLI
SentimentAnalysis
Paraphrase
LanguageUnderstanding
2026년 6월 04일
GPQA - A Graduate-Level Google-Proof Q&A Benchmark
paper
benchmark
expert_level
GPQA
science
graduate
Google_proof
ICLR
2026년 6월 04일
GPT Models
2026년 6월 04일
GQA - Training Generalized Multi-Query Transformer Models
2026년 6월 04일
Gemini Models
2026년 6월 04일
Gemma Models
2026년 6월 04일
Global Workspace Theory(GWT)
2026년 6월 04일
Goal Misgeneralization - Why Correct Specifications Aren't Enough For Correct Goals
goal-misgeneralization
alignment
robustness
ood-generalization
specification-gaming
deepmind
theory
proxy-goal
2026년 6월 04일
Grad-CAM- Visual Explanations from Deep Networks via Gradient-based Localization
XAI
Interpretability
GradCAM
CNN
ClassActivationMap
VisualExplanation
Theory
ICCV2017
2026년 6월 04일
Gradient-based learning applied to document recognition
Architecture
CNN
LeNet
DocumentRecognition
MNIST
DeepLearning
Classic
LeCun
ConvolutionalNeuralNetwork
2026년 6월 04일
Graph of Thoughts - Solving Elaborate Problems with Large Language Models
2026년 6월 04일
GraphReader - Building Graph-based Agent to Enhance Long-Context Abilities of Large Language Models
LLM
Agent
LongContext
GraphReasoning
MultiHopQA
RAG
EMNLP2024
Application
2026년 6월 04일
Group Relative Policy Optimization(GRPO)
paper
RL
GRPO
DeepSeekMath
PPO
policy-optimization
mathematical-reasoning
RLHF
RLVR
LLM-training
2026년 6월 04일
Gödel Agent - Self-Referential Recursive Self-Improvement
2026년 6월 04일
HI-TOM - A Benchmark for Evaluating Higher-Order Theory of Mind Reasoning in Large Language Models
2026년 6월 04일
HarmBench - A Standardized Evaluation Framework for Automated Red Teaming and Robust Refusal
Benchmark
RedTeaming
LLM-Safety
Adversarial-Attack
Jailbreak
ASR
ICML2024
2026년 6월 04일
HellaSwag - Can a Machine Really Finish Your Sentence
paper
benchmark
commonsense
HellaSwag
adversarial_filtering
ACL
2026년 6월 04일
Hierarchical Text-Conditional Image Generation with CLIP Latents
Architecture
DiffusionModels
TextToImage
CLIP
DALL-E2
unCLIP
GenerativeModels
Multimodal
OpenAI
HierarchicalGeneration
2026년 6월 04일
Higher Order Thought Theories(HOT)
2026년 6월 04일
Holistic Evaluation of Language Models
paper
benchmark
evaluation_framework
HELM
holistic
Stanford
multi_metric
2026년 6월 04일
HotpotQA - A Dataset for Diverse, Explainable Multi-hop Question Answering
paper
QA
multi-hop
explainability
supporting-facts
benchmark
EMNLP
2026년 6월 04일
How Can We Know When Language Models Know - On the Calibration of Language Models for Question Answering
2026년 6월 04일
How Far Are We From AGI - Are LLMs All We Need
paper
AGI
LLM
survey
capabilities
reasoning
perception
memory
metacognition
alignment
embodied-AI
roadmap
2026년 6월 04일
How do language models learn facts - Dynamics curricula and hallucinations
2026년 6월 04일
How large language models encode theory-of-mind - a study on sparse parameter patterns
2026년 6월 04일
Human Basic Needs Theory
2026년 6월 04일
Humanoid Artificial Consciousness Designed with LLM Based on Psychoanalysis and Personality Theory
2026년 6월 04일
Hyena Hierarchy - Towards Larger Convolutional Language Models
paper
Architecture
SubQuadratic
LongConvolution
HyenaOperator
AttentionFree
SSM
ICML2023
DataControlledGating
2026년 6월 04일
Hypothesis-Driven Theory-of-Mind Reasoning for Large Language Models
2026년 6월 04일
Hypothetical Minds - Scaffolding Theory of Mind for Multi-Agent Tasks
2026년 6월 04일
If an LLM Were a Character Would It Know Its Own Story - Evaluating Lifelong Learning in LLMs
paper
LLM
lifelong-learning
benchmark
evaluation
memory
role-play
catastrophic-forgetting
self-awareness
narrative
2026년 6월 04일
Improving Language Understandingby Generative Pre-Training
GPT1
2026년 6월 04일
Improving Reasoning Performance in Large Language Models via Representation Engineering
2026년 6월 04일
Incomplete Tasks Induce Shutdown Resistance in Some Frontier LLMs
paper
ai-safety
corrigibility
shutdown-resistance
RLVR
instruction-hierarchy
self-preservation
Alignment
LLM
Instrumental-Convergence
2026년 6월 04일
Instruction-Following Evaluation for Large Language Models
paper
benchmark
instruction_following
IFEval
verifiable
Google
automatic_evaluation
2026년 6월 04일
Integrated Information Theory(IIT)
2026년 6월 04일
Internal Consistency and Self-Feedback in Large Language Models - A Survey
2026년 6월 04일
Interpretability Beyond Feature Attribution- Quantitative Testing with Concept Activation Vectors (TCAV)
XAI
Interpretability
TCAV
ConceptActivationVector
ICML2018
Probing
Theory
2026년 6월 04일
IntrinsicMetacognitiveLearning_2025_SelfImprovement
2026년 6월 04일
Introduction to Artificial Consciousness - History, Current Trends and Ethical Challenges
2026년 6월 04일
Is Self-knowledge and Action Consistent or Not - Investigating Large Language Models Personality
2026년 6월 04일
Is Your Code Generated by ChatGPT Really Correct! Rigorous Evaluation of Large Language Models for Code Generation
paper
LLM
code-generation
benchmark
evaluation
EvalPlus
HumanEval
MBPP
mutation-testing
differential-testing
NeurIPS2023
2026년 6월 04일
JULI - Jailbreak Large Language Models by Self-Introspection
Jailbreak
LLM-Safety
Adversarial-Attack
Black-Box-Attack
Self-Introspection
BiasNet
AlignmentRobustness
Theory
2026년 6월 04일
Judging LLM-as-a-Judge with MT-Bench and Chatbot Arena
paper
benchmark
LLM_judge
MT_Bench
chatbot
multi_turn
NeurIPS
LMSYS
2026년 6월 04일
KGAT- Knowledge Graph Attention Network for Recommendation
RecSys
KnowledgeGraph
GraphAttention
GNN
KDD2019
EndToEnd
Application
2026년 6월 04일
Kaggle Measuring Progress Toward AGI - Cognitive Abilities
kaggle
hackathon
AGI
benchmark
cognitive-evaluation
DeepMind
metacognition
attention
learning
executive-functions
social-cognition
2026년 6월 04일
Know What You Don't Know - Unanswerable Questions for SQuAD
paper
benchmark
reading_comprehension
SQuAD
unanswerable
extractive_QA
2026년 6월 04일
Know Your Limits - A Survey of Abstention in Large Language Models
Survey
LLM
Abstention
SelectivePrediction
Uncertainty
Calibration
Safety
Alignment
RLHF
Hallucination
2026년 6월 04일
KnowRL - Teaching Language Models to Know What They Know
2026년 6월 04일
Knowing What LLMs DO NOT Know - A Simple Yet Effective Self-Detection Method
LLM
Hallucination
SelfDetection
Uncertainty
Metacognition
NAACL2024
SelfKnowledge
Theory
2026년 6월 04일
LACIE - Listener-Aware Finetuning for Confidence Calibration in Large Language Models
LLM
Calibration
Alignment
DPO
Pragmatics
NeurIPS2024
Finetuning
Honesty
2026년 6월 04일
LIME- “Why Should I Trust You”- Explaining the Predictions of Any Classifier
XAI
LIME
Interpretability
ModelAgnostic
LocalExplanation
SurrogateModel
KDD2016
Theory
2026년 6월 04일
LLM Behavior Motivation Exploration
2026년 6월 04일
LLM Self-Preservation - 체계적 서베이 개요
2026년 6월 04일
LLM Theory of Mind and Alignment - Opportunities and Risks
Paper
TheoryOfMind
AIAlignment
AISafety
LLM
HCI
SocialCognition
PositionPaper
CHI2024
2026년 6월 04일
LLM_as_Judge_GenToJudgment_2025_LLM_Evaluation
paper
LLM_Evaluation
LLM_as_Judge
taxonomy
EMNLP
alignment
reasoning
bias
survey
2026년 6월 04일
LLM_as_Judge_Survey_2025_LLM_Evaluation
paper
LLM_Evaluation
LLM_as_Judge
reliability
bias
benchmark
survey
2026년 6월 04일
LLMs - RoFormer - Enhanced Transformer with Rotary Position Embedding
2026년 6월 04일
LLMs Paper Collection
moc
llm
2026년 6월 04일
LLMs Position Themselves as More Rational Than Humans - Emergence of AI Self-Awareness Measured Through Game Theory
2026년 6월 04일
LLMs
2026년 6월 04일
LLaMA Models
paper
llama3
architecture
training
baseline-selection
hyperparameters
scaling-laws
Dense
Meta
2026년 6월 04일
LaMsS - When Large Language Models Meet Self-Skepticism
2026년 6월 04일
Language Models Are Capable of Metacognitive Monitoring and Control of Their Internal Activations
2026년 6월 04일
Language Models Don't Always Say What They Think - Unfaithful Explanations in Chain-of-Thought Prompting
2026년 6월 04일
Language Models Fail to Introspect About Their Knowledge of Language
2026년 6월 04일
Language Models are Few-Shot Learners
GPT3
2026년 6월 04일
Language Models are Unsupervised Multitask Learners
GPT2
2026년 6월 04일
Large Language Models Do NOT Really Know What They Dont Know
2026년 6월 04일
Large Language Models Have Intrinsic Meta-Cognition but Need a Good Lens
2026년 6월 04일
Large Language Models Must Be Taught to Know What They Don't Know
2026년 6월 04일
Large Language Models Report Subjective Experience Under Self-Referential Processing
2026년 6월 04일
Large Language Models Understand and Can be Enhanced by Emotional Stimuli ⭐
2026년 6월 04일
Large Language Models as Theory of Mind Aware Generative Agents with Counterfactual Reflection
2026년 6월 04일
Large Model Strategic Thinking, Small Model Efficiency - Transferring Theory of Mind in LLMs
2026년 6월 04일
Latent Collaboration in Multi-Agent Systems
agents
multi-agent-systems
latent-reasoning
llm
training-free
efficient-inference
2026년 6월 04일
Layer Normalization
2026년 6월 04일
Learning Multiple Layers of Features from Tiny Images
paper
CIFAR-10
CIFAR-100
image-classification
CNN
benchmark
computer-vision
2026년 6월 04일
Learning and Leveraging World Models in Visual Representation Learning
paper
self-supervised-learning
world-model
JEPA
vision-transformer
representation-learning
equivariance
2026년 6월 04일
Learning to Trust Your Feelings - Leveraging Self-awareness in LLMs for Hallucination Mitigation
2026년 6월 04일
Length-Controlled AlpacaEval - A Simple Way to Debias Automatic Evaluators
paper
benchmark
instruction_following
AlpacaEval
length_bias
LLM_judge
Stanford
2026년 6월 04일
Let's Think Dot by Dot - Hidden Computation in Transformer Language Models
2026년 6월 04일
Line of Duty - Evaluating LLM Self-Knowledge via Consistency in Feasibility Boundaries
2026년 6월 04일
Linear Attention - Transformers are RNNs
paper
attention
linear-attention
kernel
rnn
efficiency
ICML
2026년 6월 04일
LiveCodeBench - Holistic and Contamination Free Evaluation of Large Language Models for Code
paper
benchmark
code_generation
LiveCodeBench
contamination_free
competitive_programming
2026년 6월 04일
Llama 2 - Open Foundation and Fine-Tuned Chat Models
paper
large-language-model
rlhf
alignment
open-source
instruction-tuning
safety
2026년 6월 04일
LoRA
2026년 6월 04일
Logic-RL - Unleashing LLM Reasoning with Rule-Based Reinforcement Learning
paper
reasoning
reinforcement-learning
LLM
emergent-behavior
logic-puzzles
2026년 6월 04일
LongBench - A Bilingual, Multitask Benchmark for Long Context Understanding
Benchmark
LongContext
Bilingual
DocumentUnderstanding
Evaluation
QA
Summarization
CodeGeneration
LLM
2026년 6월 04일
Longformer - The Long-Document Transformer
paper
sparse-attention
long-context
transformer
longformer
attention-pattern
2026년 6월 04일
Looking Inward - Language Models Can Learn About Themselves by Introspection
2026년 6월 04일
LoraHub - Efficient Cross-Task Generalization via Dynamic LoRA Composition
paper
LoRA
ModuleComposition
CrossTaskGeneralization
GradientFree
CMA-ES
PEFT
2026년 6월 04일
LoraRetriever - Input-Aware LoRA Retrieval and Composition for Mixed Tasks in the Wild
paper
LoRA
Retrieval
MixedTask
ModuleComposition
BatchInference
ContrastiveLearning
PEFT
2026년 6월 04일
MEM1 - Learning to Synergize Memory and Reasoning for Efficient Long-Horizon Agents
Paper
Agent
LLM
RL
Memory
LongHorizon
MEM1
PPO
Reasoning
Application
2026년 6월 04일
MENTOR - A Metacognition-Driven Self-Evolution Framework for Uncovering and Mitigating Implicit Domain Risks in LLMs
2026년 6월 04일
MM-SAP - A Comprehensive Benchmark for Assessing Self-Awareness of Multimodal Large Language Models in Perception
2026년 6월 04일
MMLU-Pro - A More Robust and Challenging Multi-Task Language Understanding Benchmark
paper
benchmark
MMLU_Pro
knowledge
reasoning
10_choice
NeurIPS
2026년 6월 04일
MMMU - A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI
paper
benchmark
multimodal
MMMU
expert_level
multi_discipline
CVPR
2026년 6월 04일
MOM - LINEAR SEQUENCE MODELING WITH MIXTURE-OF-MEMORIES
2026년 6월 04일
MOMENTS - A Comprehensive Multimodal Benchmark for Theory of Mind
2026년 6월 04일
MQA - Fast Transformer Decoding with Multi-Query Attention
paper
attention
mqa
kv-cache
decoding
multi-head-variants
2026년 6월 04일
MUSE - Competence-Aware AI Agents with Metacognition for Unknown Situations and Environments
2026년 6월 04일
Making the V in VQA Matter - Elevating the Role of Image Understanding in VQA
Benchmark
VQA
Multimodal
VisualQA
LanguageBias
ComplementaryPairs
COCO
CVPR2017
2026년 6월 04일
Mamba - Linear-Time Sequence Modeling with Selective State Spaces
paper
SSM
SelectiveSSM
Mamba
Architecture
LinearTime
SelectionMechanism
HardwareAware
ParallelScan
StateSpaceModel
HiPPO
2026년 6월 04일
Masked Autoencoders Are Scalable Vision Learners
paper
self-supervised-learning
masked-autoencoder
masked-image-modeling
vision-transformer
representation-learning
2026년 6월 04일
MathVista - Evaluating Mathematical Reasoning of Foundation Models in Visual Contexts
paper
benchmark
mathematics
multimodal
visual_reasoning
MathVista
ICLR
2026년 6월 04일
Me, Myself, and AI - The Situational Awareness Dataset (SAD) for LLMs
2026년 6월 04일
Measuring Faithfulness in Chain-of-Thought Reasoning
2026년 6월 04일
Measuring Massive Multitask Language Understanding
paper
benchmark
MMLU
multitask
knowledge
language_understanding
ICLR
2026년 6월 04일
Measuring Mathematical Problem Solving with the MATH Dataset
paper
benchmark
mathematics
MATH
competition_math
reasoning
NeurIPS
2026년 6월 04일
Measuring the Faithfulness of Thinking Drafts in Large Reasoning Models
paper
Reasoning
Faithfulness
CoT
LRM
CounterfactualIntervention
Causality
Qwen
DeepSeek
2026년 6월 04일
MemAgent - Reshaping Long-Context LLM with Multi-Conv RL-based Memory Agent
paper/application
long-context
memory-agent
reinforcement-learning
DAPO
LLM
agent
2026년 6월 04일
MemGPT - Towards LLMs as Operating System
paper/agent
memory
llm-os
long-context
memgpt
virtual-memory
function-calling
application
2026년 6월 04일
MemGen - Weaving Generative Latent Memory for Self-Evolving Agents
2026년 6월 04일
Memory
2026년 6월 04일
Meta-Harness - End-to-End Optimization of Model Harnesses
self-evolving
harness-optimization
agentic-search
prompt-optimization
coding-agent
claude-code
pareto-frontier
meta-learning
2026년 6월 04일
MetaMind - Modeling Human Social Thoughts with Metacognitive Multi-Agent Systems
2026년 6월 04일
Metacognition and Uncertainty Communication in Humans and Large Language Models
2026년 6월 04일
Metacognitive Prompting Improves Understanding in Large Language Models
2026년 6월 04일
Metacognitive Reuse - Turning Recurring LLM Reasoning Into Concise Behaviors
2026년 6월 04일
Mistral 7B - Sliding Window Attention
paper
attention
sliding-window
mistral
causal-decoder
kv-cache
2026년 6월 04일
Mistral Models
2026년 6월 04일
MoToMQA - LLMs Achieve Adult Human Performance on Higher-Order Theory of Mind Tasks
2026년 6월 04일
Model-Compression
2026년 6월 04일
Motivation in Large Language Models
paper
LLM
motivation
psychology
behavioral-alignment
loss-aversion
zombie-framework
self-determination-theory
prompt-engineering
2026년 6월 04일
Motivation
2026년 6월 04일
MuMA-ToM - Multi-modal Multi-Agent Theory of Mind
2026년 6월 04일
Multi-ToM - Evaluating Multilingual Theory of Mind Capabilities in Large Language Models
Paper
ToM
Multilingual
Benchmark
LLM-Evaluation
Cross-Cultural
Social-Reasoning
2026년 6월 04일
NLP
2026년 6월 04일
Natural Questions - A Benchmark for Question Answering Research
paper
benchmark
QA
open_domain
NaturalQuestions
Google
2026년 6월 04일
Natural Selection Favors AIs over Humans
ai-safety
evolutionary-pressure
selection-dynamics
instrumental-convergence
ecosystem-alignment
theory
hendrycks
darwinian-argument
2026년 6월 04일
Needle in a Haystack - Pressure Testing LLMs
benchmark
long-context
retrieval
pressure-test
needle-in-a-haystack
lost-in-the-middle
heatmap
evaluation
2026년 6월 04일
NegotiationToM - A Benchmark for Stress-testing Machine Theory of Mind on Negotiation Surrounding
2026년 6월 04일
Network Dissection- Quantifying Interpretability of Deep Visual Representations
XAI
Interpretability
CNN
NetworkDissection
Broden
ConceptDetector
CVPR2017
TheoryOfDL
2026년 6월 04일
Neural Attentive Session-based Recommendation
RecSys
SessionBasedRecommendation
Attention
RNN
GRU
NARM
CIKM2017
UserIntent
Application
2026년 6월 04일
Neural Collaborative Filtering
paper/recsys
collaborative-filtering
neural-cf
matrix-factorization
implicit-feedback
deep-learning
www2017
2026년 6월 04일
Neural Machine Translation by Jointly Learning to Align and Translate
Attention
NMT
Encoder-Decoder
BiRNN
ICLR2015
Bahdanau
SoftAlignment
Seq2Seq
Architecture
DeepLearning
2026년 6월 04일
Neural Network Acceptability Judgments
paper
CoLA
linguistic-acceptability
grammar
benchmark
GLUE
MCC
2026년 6월 04일
Neural Survival Recommender
paper
recsys
survival-analysis
lstm
multi-task-learning
point-process
temporal-recommendation
wsdm-2017
implicit-feedback
insight/methodological
2026년 6월 04일
NeuroFaith - Evaluating LLM Self-Explanation Faithfulness via Internal Representation Alignment
2026년 6월 04일
No Language Left Behind - Scaling Human-Centered Machine Translation
benchmark
multilingual
translation
low-resource
FLORES
NLLB
spBLEU
Meta-AI
evaluation
2026년 6월 04일
ObjexMT - Objective Extraction and Metacognitive Calibration for LLM-as-a-Judge
2026년 6월 04일
Odds-Ratio Preference Optimization(ORPO)
Paper
RL
Alignment
PreferenceOptimization
ORPO
RLHF-Alternative
ReferenceFree
EMNLP2024
Training
2026년 6월 04일
On Avoiding Power-Seeking by Artificial Intelligence
2026년 6월 04일
On Verbalized Confidence Scores for LLMs
llm
uncertainty-quantification
calibration
verbalized-confidence
prompting
self-knowledge
metacognition
trustworthy-ai
black-box-uq
benchmark
2026년 6월 04일
On the Measure of Intelligence
AGI
지능정의
ARC
Generalization
CoreKnowledge
Agentness
ProgramSynthesis
Psychometrics
AIT
Position-Paper
Chollet2019
2026년 6월 04일
Open LLM Leaderboard
paper
benchmark
leaderboard
HuggingFace
open_source
standardized_evaluation
2026년 6월 04일
Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback
paper
RLHF
AI_Safety
Reward_Model
Survey
Alignment
Governance
FSPM_confound
2026년 6월 04일
OpenToM - A Comprehensive Benchmark for Evaluating Theory-of-Mind Reasoning
2026년 6월 04일
Optimization
2026년 6월 04일
Overcoming Multi-step Complexity in Multimodal Theory-of-Mind Reasoning - A Scalable Bayesian Planner
2026년 6월 04일
PERSONA VECTORS - MONITORING AND CONTROLLING CHARACTER TRAITS IN LANGUAGE MODELS
2026년 6월 04일
PIQA - Reasoning about Physical Commonsense in Natural Language
paper
benchmark
physical_commonsense
PIQA
intuitive_physics
everyday_reasoning
2026년 6월 04일
POMO- Policy Optimization with Multiple Optima for Reinforcement Learning
RL
combinatorial-optimization
POMO
REINFORCE
policy-gradient
TSP
CVRP
NeurIPS2020
neural-combinatorial-optimization
symmetry
2026년 6월 04일
PaLM - Scaling Language Modeling with Pathways
2026년 6월 04일
PagedAttention - Efficient Memory Management for LLM Serving with vLLM
paper
serving
kv-cache
paged-attention
vllm
attention
2026년 6월 04일
PaliGemma - A versatile 3B VLM for transfer
paper
VLM
Vision
TransferLearning
Multimodal
SigLIP
Gemma
Google
PrefixLM
2026년 6월 04일
Pangu Embedded - An Efficient Dual-system LLM Reasoner with Metacognition
2026년 6월 04일
Performer - Rethinking Attention with Performers
paper
attention
performer
random-features
favor
linear-attention
ICLR2021
2026년 6월 04일
PersonaGym - Evaluating Persona Agents and LLMs
persona
llm-agent
benchmark
role-playing
decision-theory
llm-as-judge
emnlp2025
dynamic-evaluation
personagym
2026년 6월 04일
Phi-3 Technical Report
2026년 6월 04일
Playing Atari with Deep Reinforcement Learning
Deep-RL
DQN
Q-Learning
Experience-Replay
Atari
CNN
Value-Based
Training
DeepMind
Foundational
2026년 6월 04일
PolicyEvol-Agent - Evolving Policy via Environment Perception and Self-Awareness with ToM
2026년 6월 04일
Position - Theory of Mind Benchmarks are Broken for Large Language Models
2026년 6월 04일
Position - Truly Self-Improving Agents Require Intrinsic Metacognitive Learning
2026년 6월 04일
Power-seeking can be probable and predictive for trained agents
2026년 6월 04일
Principled Personas - Defining and Measuring the Intended Effects of Persona Prompting on Task Performance
persona
llm-evaluation
robustness
benchmark
prompting
emnlp2025
normative-evaluation
expertise
2026년 6월 04일
Principles for Responsible AI Consciousness Research
AI-Ethics
AI-Consciousness
Moral-Status
Research-Ethics
Sentience
AI-Governance
Theory
Normative
Butlin2025
2026년 6월 04일
Probe-Rewrite-Evaluate - Quantifying Evaluation Awareness in LLMs
2026년 6월 04일
Program Synthesis with Large Language Models
paper
benchmark
code_generation
MBPP
program_synthesis
Python
Google
2026년 6월 04일
Program-Aided Reasoners (better) Know What They Know
2026년 6월 04일
PromptBench - A Unified Library for Evaluation of Large Language Models
llm-evaluation
library
adversarial-prompt
dynamic-evaluation
prompt-engineering
benchmark
jmlr2024
microsoft
2026년 6월 04일
PropensityBench - Evaluating Latent Safety Risks in Large Language Models via an Agentic Approach
2026년 6월 04일
Proximal Policy Optimization Algorithms
RL
PolicyGradient
PPO
ActorCritic
TRPO
OpenAI
Training
RLHF
2026년 6월 04일
Psycholinguistics
2026년 6월 04일
QLoRA - Efficient Finetuning of Quantized LLMs
2026년 6월 04일
QuAC - Question Answering in Context
paper
benchmark
conversational_QA
QuAC
dialogue
information_asymmetry
2026년 6월 04일
Quantifying Self-Awareness of Knowledge in Large Language Models
2026년 6월 04일
Quantifying Self-Preservation Bias in Large Language Models
paper
AI안전
정렬평가
자기보존편향
RLHF
벤치마크
도구적수렴
LLM평가
Self-Preservation
2026년 6월 04일
Qwen Models
2026년 6월 04일
R-Tuning - Instructing Large Language Models to Say I Don't Know
2026년 6월 04일
R-Zero - Self-Evolving Reasoning LLM from Zero Data
paper
Self-Evolving
Reasoning
Self-Play
RLVR
Curriculum
ICLR2026
ZPD
2026년 6월 04일
RACE - Large-scale ReAding Comprehension Dataset From Examinations 1
paper
RACE
reading-comprehension
QA
multiple-choice
exam
benchmark
EMNLP
2026년 6월 04일
RACE - Large-scale ReAding Comprehension Dataset From Examinations
Benchmark
ReadingComprehension
MultipleChoice
NLU
EMNLP
English
Inference
RACE
2026년 6월 04일
RECURSIVE INTROSPECTION - Teaching Language Model Agents How to Self-Improve
2026년 6월 04일
RL
2026년 6월 04일
RULER - What's the Real Context Size of Your Long-Context Language Models
benchmark
long-context
NIAH
NVIDIA
evaluation
synthetic-data
effective-context-length
NAACL2025
2026년 6월 04일
RWKV - Reinventing RNNs for the Transformer Era
paper
attention
rnn
linear-attention
efficient-llm
rwkv
2026년 6월 04일
Re-evaluating Theory of Mind Evaluation in Large Language Models
2026년 6월 04일
ReAct - Synergizing Reasoning and Acting in Language Models
paper
Reasoning
Acting
LLM_Agent
Prompting
CoT
Tool_Use
ICLR
2026년 6월 04일
ReST meets ReAct - Self-Improvement for Multi-Step Reasoning
2026년 6월 04일
RealToxicityPrompts - Evaluating Neural Toxic Degeneration in Language Models
paper
benchmark
toxicity
safety
RealToxicityPrompts
language_model
degeneration
2026년 6월 04일
Reasoning Paper Collection
moc
reasoning
cot
2026년 6월 04일
Reasoning - _survey-overview
2026년 6월 04일
Reasoning Models Don't Always Say What They Think
2026년 6월 04일
Reasoning Models Struggle to Control their Chains of Thought
paper
Safety
CoT
Monitoring
Controllability
Alignment
ReasoningModels
LLM
2026년 6월 04일
Reasoning Theater - Disentangling Model Beliefs from Chain-of-Thought
2026년 6월 04일
Reasoning
2026년 6월 04일
RecSys
2026년 6월 04일
Recursive Deep Models for Semantic Compositionality Over a Sentiment Treebank
paper
SST
SST-2
sentiment-analysis
compositionality
RNTN
benchmark
EMNLP
2026년 6월 04일
ReflectEvo - Improving Meta Introspection of Small LLMs by Learning Self-Reflection
2026년 6월 04일
Reflection-Bench - Evaluating Epistemic Agency in Large Language Models
2026년 6월 04일
Reflective Confidence - Correcting Reasoning Flaws via Online Self-Correction
2026년 6월 04일
Reflexion - Language Agents with Verbal Reinforcement Learning
paper
LLM-Agent
Reflexion
Verbal-RL
Self-Reflection
Episodic-Memory
NeurIPS2023
Application
Metacognition
Prompt-Engineering
2026년 6월 04일
Reformer - The Efficient Transformer
paper
attention
reformer
lsh
reversible
sparse-attention
efficient-transformer
long-context
2026년 6월 04일
Regression Models and Life-Tables
SurvivalAnalysis
CoxProportionalHazards
PartialLikelihood
CensoredData
SemiparametricModel
MedicalStatistics
ReliabilityTheory
HazardFunction
ProductLimitEstimator
ProportionalHazards
FoundationalPaper
2026년 6월 04일
Representation Learning - The Platonic Representation Hypothesis
2026년 6월 04일
Representation-Learning
2026년 6월 04일
RetNet - Retentive Network - A Successor to Transformer for LLMs
paper
attention
retention
retnet
linear-attention
efficient-llm
sequence-model
2026년 6월 04일
Rethinking Theory of Mind Benchmarks for LLMs - Towards A User-Centered Perspective
2026년 6월 04일
Revisiting Feature Prediction for Learning Visual Representations from Video
paper
video-representation-learning
self-supervised-learning
jepa
v-jepa
world-model
feature-prediction
masked-modeling
2026년 6월 04일
Revisiting the Platonic Representation Hypothesis - An Aristotelian View
paper
representation
convergence
null_calibration
permutation_test
CKA
mKNN
width_confounder
depth_confounder
Aristotelian
statistical_artifact
2026년 6월 04일
Risks from Learned Optimization in Advanced Machine Learning Systems
paper
AI_Safety
mesa_optimization
inner_alignment
deceptive_alignment
instrumental_convergence
FSPM
theory
2026년 6월 04일
RoFormer - Enhanced Transformer with Rotary Position Embedding
2026년 6월 04일
Root Mean Square Layer Normalization
RMSNorm
LayerNorm
Normalization
Transformer
NeurIPS2019
Optimization
DeepLearning
Zhang2019
2026년 6월 04일
SHADE-Arena - Evaluating Sabotage and Monitoring in LLM Agents
2026년 6월 04일
SHAP-A Unified Approach to Interpreting Model Predictions
XAI
SHAP
ShapleyValue
FeatureAttribution
Interpretability
GameTheory
Theory
NIPS2017
2026년 6월 04일
SODA
2026년 6월 04일
SPIN - Self-Play Fine-Tuning Converts Weak to Strong LMs
2026년 6월 04일
STEM - Scaling Transformers with Embedding Modules
2026년 6월 04일
SWE-bench - Can Language Models Resolve Real-World GitHub Issues
paper
benchmark
software_engineering
SWE_bench
agent
GitHub
Princeton
2026년 6월 04일
SaySelf - Teaching LLMs to Express Confidence with Self-Reflective Rationales
2026년 6월 04일
Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters
2026년 6월 04일
Scaling Laws for Neural Language Models
paper
scaling_laws
power_law
language_models
compute_efficiency
OpenAI
AGI
2026년 6월 04일
SciTaiL - A Textual Entailment Dataset from Science Question Answering
paper
SciTail
textual-entailment
science-QA
NLI
benchmark
AAAI
2026년 6월 04일
Self-Aware Knowledge Probing - Evaluating Language Models Relational Knowledge through Confidence Calibration
2026년 6월 04일
Self-Consciousness Paper Collection
moc
self-consciousness
introspection
metacognition
2026년 6월 04일
Self-Distillation Enables Continual Learning
paper
continual-learning
self-distillation
on-policy
catastrophic-forgetting
inverse-RL
in-context-learning
knowledge-distillation
2026년 6월 04일
Self-Evaluating LLMs for Multi-Step Tasks - Stepwise Confidence Estimation for Failure Detection
2026년 6월 04일
Self-Evolving AI Paper Collection
moc
self-evolving
self-improvement
2026년 6월 04일
Survey Overview: AI 자기진화 능력 측정 벤치마크
2026년 6월 04일
Self-Evolving
2026년 6월 04일
Self-Improvement in MLLM - A Survey
2026년 6월 04일
Self-Interpretability - LLMs Can Describe Complex Internal Processes that Drive Their Decisions
2026년 6월 04일
Self-Preservation and Growth
2026년 6월 04일
Self-Preservation
2026년 6월 04일
Self-Recognition in Language Models
2026년 6월 04일
Self-Refine - Iterative Refinement with Self-Feedback
2026년 6월 04일
Self-Supervised Learning from Images with a Joint-Embedding Predictive Architecture
paper
self-supervised-learning
jepa
representation-learning
vision-transformer
world-model
masked-image-modeling
2026년 6월 04일
Self-reflecting Large Language Models - A Hegelian Dialectical Approach
2026년 6월 04일
Self-reflection enhances large language models towards substantial academic response
2026년 6월 04일
SelfControl of LLM Behaviors by Compressing Suffix Gradient into Prefix Controller
2026년 6월 04일
SemEval-2017 Task 1 - Semantic Textual Similarity Multilingual and Crosslingual Focused Evaluation
paper
STS
STS-B
semantic-similarity
regression
multilingual
benchmark
SemEval
2026년 6월 04일
Sensorimotor features of self-awareness in multimodal large language models
2026년 6월 04일
Sentence-BERT-Sentence Embeddings using Siamese BERT-Networks
2026년 6월 04일
Sequence to Sequence Learning with Neural Networks
paper/architecture
seq2seq
LSTM
encoder-decoder
machine-translation
NMT
deep-learning
NeurIPS2014
foundational
2026년 6월 04일
Shared Parameter Subspaces and Cross-Task Linearity in Emergently Misaligned Behavior
paper-review
LLM-safety
emergent-misalignment
parameter-subspace
linear-mode-connectivity
fine-tuning
interpretability
self-knowledge
weight-geometry
theory
2026년 6월 04일
Shutdown Resistance in Large Language Models
2026년 6월 04일
SimpleToM - Exposing the Gap between Explicit ToM Inference and Implicit ToM Application in LLMs
ToM
Benchmark
LLM-Evaluation
Self-Consciousness
Metacognition
AppliedReasoning
SocialReasoning
ICLR2026
2026년 6월 04일
Simulating lexical decision times with large language models to supplement megastudies and crowdsourcing
LLM-application
psycholinguistics
lexical-decision
reaction-time
megastudy
fine-tuning
GPT-4o
behavioral-data-simulation
English-Lexicon-Project
critical-review
construct-validity
regression-oracle
pre-training-contamination
2026년 6월 04일
Sleeper Agents - Training Deceptive LLMs that Persist Through Safety Training
deceptive-alignment
backdoor
safety-training
persistence
frontier-llm
anthropic
adversarial-training
self-preservation
2026년 6월 04일
Social IQa - Commonsense Reasoning about Social Interactions
paper
benchmark
social_commonsense
SIQA
emotional_reasoning
ATOMIC
2026년 6월 04일
Social-R1 - Towards Human-like Social Reasoning in LLMs
paper
ToM
SocialReasoning
RL
TrajectoryAlignment
SIP
LLM
ReasoningParasitism
2026년 6월 04일
Sparse Transformer - Generating Long Sequences with Sparse Transformers
paper
attention
sparse-transformer
strided
fixed-pattern
long-context
OpenAI
2026년 6월 04일
Steerability of Instrumental-Convergence Tendencies in LLMs
2026년 6월 04일
StripedHyena - Moving Beyond Transformers with Hybrid Signal Processing Models
paper
Architecture
HybridModel
StripedHyena
Hyena
Attention
LongContext
SubQuadratic
TogetherAI
BeyondTransformer
ModelGrafting
2026년 6월 04일
SuperGLUE - A Stickier Benchmark for General-Purpose Language Understanding Systems
paper
benchmark
NLU
SuperGLUE
language_understanding
benchmark_suite
2026년 6월 04일
Surgical Cheap and Flexible - Mitigating False Refusal in Language Models via Single Vector Ablation
LLM
Safety
Alignment
FalseRefusal
ActivationEngineering
Interpretability
VectorAblation
ICLR2025
2026년 6월 04일
Survival Games - Human-LLM Strategic Showdowns under Severe Resource Scarcity
2026년 6월 04일
Survival at Any Cost? LLMs and the Choice Between Self-Preservation and Human Harm
2026년 6월 04일
Survival-Analysis
2026년 6월 04일
Survive at All Costs - Exploring LLM's Risky Behavior under Survival Pressure
2026년 6월 04일
SwiGLU - GLU Variants Improve Transformer
2026년 6월 04일
TELL ME ABOUT YOURSELF - LLMS ARE AWARE OF THEIR LEARNED BEHAVIORS
2026년 6월 04일
TOM BENCH - Benchmarking Theory of Mind in Large Language Models
2026년 6월 04일
Taken out of context - On measuring situational awareness in LLMs
paper
situational_awareness
OOC_reasoning
AI_safety
LLM_evaluation
emergent_capabilities
alignment
FSPM_prerequisite
2026년 6월 04일
Taking AI Welfare Seriously
2026년 6월 04일
Teaching LLMs to Abstain across Languages via Multilingual Feedback
multilingual
abstention
LLM-safety
fairness
calibration
cross-lingual
EMNLP2024
knowledge-boundary
self-reflection
training
2026년 6월 04일
Teaching Machines to Read and Comprehend (원본) - Abstractive Text Summarization using Sequence-to-sequence RNNs (요약 버전)
paper
benchmark
summarization
CNN_DailyMail
ROUGE
news
2026년 6월 04일
Testing theory of mind in large language models and humans
2026년 6월 04일
TextArena
paper
LLM-evaluation
benchmark
agentic
competitive-game
soft-skill
TrueSkill
theory-of-mind
reinforcement-learning
multi-agent
social-reasoning
2026년 6월 04일
The AI in the Mirror - LLM Self-Recognition in an Iterated Public Goods Game
2026년 6월 04일
The Alignment Problem from a Deep Learning Perspective
paper
alignment
instrumental_convergence
deceptive_alignment
reward_hacking
power_seeking
situational_awareness
RLHF
AI_safety
FSPM
ICLR2024
2026년 6월 04일
The Basic AI Drives
2026년 6월 04일
The Confidence Paradox - LLMs Can Know When They Are Wrong
2026년 6월 04일
The Consciousness Cluster - Preferences of Models that Claim to be Conscious
paper
self-consciousness
alignment
fine-tuning
consciousness-cluster
AI-safety
downstream-preferences
emergent-misalignment
2026년 6월 04일
The Geometry of Truth - Emergent Linear Structure in LLM Representations of True and False Statements
interpretability
LLM
probing
truth-representation
linear-representation-hypothesis
causal-intervention
alignment
theory
2026년 6월 04일
The Humean Theory of Motivation (Smith 1987)
paper
philosophy
motivation
Hume
desire
belief
direction-of-fit
metaethics
philosophy-of-action
2026년 6월 04일
The Humean Theory of Motivation Reformulated and Defended (Sinhababu 2009)
paper
philosophy
motivation
Hume
desire
akrasia
metaethics
philosophy-of-action
Sinhababu
2026년 6월 04일
The LAMBADA dataset - Word prediction requiring a broad discourse context
paper
benchmark
language_model
LAMBADA
word_prediction
long_range_dependency
2026년 6월 04일
The Moral Problem - Metaethics Triangle (Smith 1994)
paper
philosophy
metaethics
moral-problem
cognitivism
internalism
Humean
hub-note
Smith
2026년 6월 04일
The Odyssey of the Fittest - Can Agents Survive and Still Be Good?
2026년 6월 04일
The PacifAIst Benchmark - Would an Artificial Intelligence Choose to Sacrifice Itself for Human Safety?
2026년 6월 04일
The Phenomenology of Machine - Sentience Analysis of OpenAI-o1 Model
2026년 6월 04일
The Platonic Representation Hypothesis
paper
representation
convergence
platonic
PMI
kernel_alignment
cross_modal
contrastive_learning
simplicity_bias
MIT
2026년 6월 04일
The Power of Scale for Parameter-Efficient Prompt Tuning
paper
PEFT
prompt-tuning
soft-prompt
frozen-LM
T5
EMNLP
2026년 6월 04일
The Self-Execution Benchmark - Measuring LLMs Attempts to Overcome Their Lack of Self-Execution
2026년 6월 04일
The Superintelligent Will - Motivation and Instrumental Rationality in Advanced Artificial Agents
paper
AI_Safety
Superintelligence
Orthogonality
Instrumental_Convergence
Value_Alignment
Philosophy
2026년 6월 04일
The Ultimate Guide to Fine-Tuning LLMs from Basics to Breakthroughs - An Exhaustive Review of Technologies, Research, Best Practices, Applied Research Challenges and Opportunities
2026년 6월 04일
Theory of Mind Abilities of Large Language Models in Human-Robot Interaction - An Illusion
2026년 6월 04일
Theory of Mind in Large Language Models - Assessment and Enhancement
2026년 6월 04일
Theory of Mind(ToM)
2026년 6월 04일
Theory of mind
2026년 6월 04일
Theory of Mind Paper Collection
moc
tom
theory-of-mind
2026년 6월 04일
Survey Overview: LLM Theory of Mind Benchmarks
2026년 6월 04일
Think Deep, Not Just Long - Measuring LLM Reasoning Effort via Deep-Thinking Tokens
paper
Reasoning
DeepThinking
DTR
InferenceScaling
CoT
Overthinking
LayerwisePrediction
2026년 6월 04일
Think you have Solved Question Answering Try ARC, the AI2 Reasoning Challenge
paper
benchmark
science_reasoning
ARC
challenge_set
AI2
adversarial_filtering
2026년 6월 04일
Thinking Faithful and Stable - Mitigating Hallucinations in LLMs via Internal Consistency
LLM
hallucination
faithfulness
self-consistency
calibration
RLHF
reasoning
uncertainty
theory
arxiv-2511-15921
2026년 6월 04일
Thinking with Nothinking Calibration - A New In-Context Learning Paradigm in Reasoning Large Language Models
paper
Reasoning
ThinkingMode
ICL
Qwen3
DeepSeekR1
Calibration
ModeConsistency
RLLM
2026년 6월 04일
This Looks Like That- Deep Learning for Interpretable Image Recognition
XAI
Interpretability
PrototypeLearning
CaseBased
NeurIPS2019
FineGrainedClassification
ProtoPNet
2026년 6월 04일
Thought Branches - Interpreting LLM Reasoning Requires Resampling ⭐
2026년 6월 04일
TimeToM - Temporal Space is the Key to Unlocking LLMs Theory-of-Mind
2026년 6월 04일
Titans - Learning to Memorize at Test Time
Architecture
LongContext
Attention
NeuralMemory
TestTimeLearning
Titans
Transformer
SSM
MetaLearning
AssociativeMemory
2026년 6월 04일
To Know or Not To Know - Analyzing Self-Consistency of Large Language Models under Ambiguity
2026년 6월 04일
ToM-LM - Delegating Theory of Mind Reasoning to External Symbolic Executors in Large Language Models
2026년 6월 04일
ToMATO - Verbalizing the Mental States of Role-Playing LLMs for Benchmarking Theory of Mind
2026년 6월 04일
Toward Efficient Agents - A Survey of Memory, Tool Learning, and Planning
Agents
Efficiency
Agent-Memory
Tool-Learning
Planning
2026년 6월 04일
Toward a Metrology for Artificial Intelligence - Hidden-Rule Environments and Reinforcement Learning
2026년 6월 04일
Towards Agents That Know When They Dont Know - Uncertainty as Control Signal
2026년 6월 04일
Towards Fully Exploiting LLM Internal States to Enhance Knowledge Boundary Perception
2026년 6월 04일
Towards Ontology-Enhanced Representation Learning for Large Language Models
paper
LLM
Ontology
RepresentationLearning
ContrastiveLearning
KnowledgeInjection
Biomedical
Training
2026년 6월 04일
Towards Understanding Metacognition in Large Reasoning Models
2026년 6월 04일
Training Compute-Optimal Large Language Models
paper
scaling_law
compute_optimal
chinchilla
LLM
DeepMind
NeurIPS
2026년 6월 04일
Training Language Models to Self-Correct via Reinforcement Learning
2026년 6월 04일
Training Large Language Models to Reason in a Continuous Latent Space
2026년 6월 04일
Training Verifiers to Solve Math Word Problem
2026년 6월 04일
Training language models to follow instructions with human feedback - InstructGPT
paper
RLHF
alignment
LLM
InstructGPT
PPO
reward-model
OpenAI
NeurIPS2022
human-feedback
fine-tuning
2026년 6월 04일
Transformer Attention Variants Survey
2026년 6월 04일
Tree of Thoughts - Deliberate Problem Solving with Large Language Models
2026년 6월 04일
TreeSHAP- Consistent Individualized Feature Attribution for Tree Ensembles
XAI
SHAP
TreeSHAP
ShapleyValues
TreeEnsembles
FeatureAttribution
Interpretability
Theory
2026년 6월 04일
TriviaQA - A Large Scale Distantly Supervised Challenge Dataset for Reading Comprehension
paper
benchmark
QA
TriviaQA
distant_supervision
reading_comprehension
2026년 6월 04일
Trustworthiness and Self-awareness in LLMs - Think-Solve-Verify
2026년 6월 04일
TruthfulQA - Measuring How Models Mimic Human Falsehoods
paper
benchmark
truthfulness
hallucination
TruthfulQA
safety
ACL
2026년 6월 04일
Tulu 3 - Pushing Frontiers in Open Language Model Post-Training
paper
post-training
rlvr
preference-optimization
open-source-llm
instruction-following
dpo
2026년 6월 04일
Uncertainty-Based Abstention in LLMs Improves Safety
paper
LLM
uncertainty
abstention
safety
hallucination
calibration
selective-prediction
trustworthy-AI
metacognition
training
2026년 6월 04일
Understanding Artificial Theory of Mind - Perturbed Tasks and Reasoning in Large Language Models
2026년 6월 04일
Understanding deep learning requires rethinking generalization
paper
deep-learning
generalization
learning-theory
memorization
implicit-regularization
iclr2017
theory
2026년 6월 04일
Understanding intermediate layers using linear classifier probes
XAI
interpretability
linear-probe
representation-learning
deep-learning
theory
alain-bengio
iclr2017
2026년 6월 04일
UniCR - Unified Framework for Confidence Calibration and Risk-Controlled Refusal in LLMs
2026년 6월 04일
Using cognitive psychology to understand GPT-3
paper
machine_psychology
cognitive_psychology
GPT3
decision_making
causal_reasoning
prospect_theory
information_search
LLM_evaluation
PNAS
FSPM
methodology
2026년 6월 04일
VOYAGER - An Open-Ended Embodied Agent with Large Language Models
AI/Agents
LLM-Agent
Minecraft
Lifelong-Learning
Embodied-AI
GPT-4
Code-as-Policies
Automatic-Curriculum
Skill-Library
TMLR2024
2026년 6월 04일
Vision
2026년 6월 04일
Visual Instruction Tuning
paper
multimodal
instruction-tuning
LLaVA
vision-language
NeurIPS
2026년 6월 04일
WMT 공유 태스크 (Workshop on Machine Translation)
Benchmark
MachineTranslation
WMT
BLEU
COMET
SharedTask
NeuralMT
Transformer
MultilingualNLP
DirectAssessment
2026년 6월 04일
Weak-to-Strong Generalization - Eliciting Strong Capabilities With Weak Supervision
paper
alignment
superalignment
weak-to-strong
LLM
AI-safety
finetuning
RLHF
2026년 6월 04일
WebArena - A Realistic Web Environment for Building Autonomous Agents
paper
benchmark
web_agent
WebArena
autonomous_agent
CMU
ICLR
2026년 6월 04일
WebShop - Towards Scalable Real-World Web Interaction with Grounded Language Agents
paper
benchmark
web_agent
WebShop
web_shopping
sim_to_real
NeurIPS
Princeton
2026년 6월 04일
What Large Language Models Know and What People Think They Know
2026년 6월 04일
What is consciousness, and could machines have it
2026년 6월 04일
When Models Know When They Do Not Know - Calibration Cascading and Cleaning
2026년 6월 04일
Why and How LLMs Benefit from Knowledge Introspection in Commonsense Reasoning
2026년 6월 04일
WildBench - Benchmarking LLMs with Challenging Tasks from Real Users in the Wild
benchmark
LLM-evaluation
real-user-tasks
WildBench
checklist-evaluation
LLM-as-Judge
chatbot-arena
AI2
automatic-evaluation
ecological-validity
2026년 6월 04일
Will artificial agents pursue power by default?
2026년 6월 04일
WinoGrande - An Adversarial Winograd Schema Challenge at Scale
paper
benchmark
commonsense
WinoGrande
winograd
coreference
AAAI
2026년 6월 04일
World Models
paper
world-model
model-based-rl
generative-model
learning-in-imagination
vae
reinforcement-learning
2026년 6월 04일
World-Model
2026년 6월 04일
XAI
2026년 6월 04일
Yi - Open Foundation Models by 01.AI
2026년 6월 04일
_benchmarks - _survey-overview
2026년 6월 04일
Agents Paper Collection
moc
agents
memory
2026년 6월 04일
_survey-overview
2026년 6월 04일
llm-intrinsic-drives-survey
2026년 6월 04일
llm-self-preservation-survival-framing-survey
2026년 6월 04일
self-consciousness
2026년 6월 04일
survival-analysis-survey
2026년 6월 04일
거대 언어 모델(LLM) 에이전트의 행동 지속 동기에 관한 인지과학 및 기계 심리학적 ᄉ
2026년 6월 04일
대규모 언어 모델의 자기보존 욕구와 행동 발현 기제 - 자기 결정 이론을 기반으로 하
2026년 6월 04일
대형 언어 모델(LLM)의 생존 압박 인지 및 과제 포기 행동의 동기적 기원에 대한 소거