본문으로 건너뛰기

Juhyeon's Blog

❯

❯

폴더: AI/Papers

541건의 항목

2026년 6월 04일
_KDD26-underreview
2026년 6월 04일
12가지 동기 부여 이론의 종합적 분석 및 현대적 적용 리포트
2026년 6월 04일
A Broad-Coverage Challenge Corpus for Sentence Understanding through Inference
2026년 6월 04일
A Comprehensive Survey of Self-Evolving AI Agents - A New Paradigm Bridging Foundation Models and Lifelong Agentic Systems
2026년 6월 04일
A Comprehensive Survey of Self-Evolving AI Agents
2026년 6월 04일
A Computable Game-Theoretic Framework for Multi-Agent Theory of Mind
2026년 6월 04일
A Corpus and Cloze Evaluation for Deeper Understanding of Commonsense Stories
2026년 6월 04일
A Corpus and Evaluation Framework for Deeper Understanding of Commonsense Stories
2026년 6월 04일
A Disproof of Large Language Model Consciousness - The Necessity of Continual Learning for Consciousness
2026년 6월 04일
A Path Towards Autonomous Machine Intelligence
2026년 6월 04일
A Plan Reuse Mechanism for LLM-Driven Agent
2026년 6월 04일
A Simple Framework for Contrastive Learning of Visual Representation
2026년 6월 04일
A Survey of Theory of Mind in Large Language Models - Evaluations Representations and Safety Risks
2026년 6월 04일
A Survey on Mixture of Experts in Large Language Models
- L
- H
2026년 6월 04일
A Systematic Review on the Evaluation of Large Language Models in Theory of Mind Tasks
2026년 6월 04일
A Theoretical Understanding of Self-Correction through In-Context Alignment
2026년 6월 04일
A large annotated corpus for learning natural language inference 1
2026년 6월 04일
A large annotated corpus for learning natural language inference
2026년 6월 04일
ACT_Agentic_Critical_Training_2026_Skill_LM
2026년 6월 04일
AGI
2026년 6월 04일
AI Deception - A Survey of Examples, Risks, and Potential Solutions
2026년 6월 04일
AI LLM Proof of Self-Consciousness and User-Specific Attractors
2026년 6월 04일
AI-papers
2026년 6월 04일
AIME 2024 - 미국 수학 올림피아드 벤치마크
2026년 6월 04일
ALFWorld - Aligning Text and Embodied Environments for Interactive Learning
2026년 6월 04일
ARC-AGI - Abstraction and Reasoning Corpus
2026년 6월 04일
Activation Oracles - Training and Evaluating LLMs as General-Purpose Activation Explainers
2026년 6월 04일
Adam-A Method for Stochastic Optimization
2026년 6월 04일
Adaptive Retrieval Without Self-Knowledge - Bringing Uncertainty Back Home
2026년 6월 04일
Adaptive Self-improvement LLM Agentic System
2026년 6월 04일
Adversarial NLI - A New Benchmark for Natural Language Understanding
2026년 6월 04일
Agent-to-Agent Theory of Mind - Testing Interlocutor Awareness among Large Language Models
2026년 6월 04일
AgentBench - Evaluating LLMs as Agents
2026년 6월 04일
AgentBreeder - Self-Improvement Safety in Multi-Agent Scaffolds
2026년 6월 04일
AgentFold - Long-Horizon Web Agents with Proactive Context Management
2026년 6월 04일
AgentTuning - Enabling Generalized Agentabilities for LLMS
2026년 6월 04일
Agentic Knowledgeable Self-awareness
2026년 6월 04일
Agentic Misalignment - How LLMs Could Be Insider Threats
2026년 6월 04일
Agents of Change - Self-Evolving LLM Agents
2026년 6월 04일
Agents
2026년 6월 04일
Aider Polyglot - 다언어 코드 편집 벤치마크
2026년 6월 04일
Aligning AI With Shared Human Values
2026년 6월 04일
Alignment Faking in Large Language Models
2026년 6월 04일
AlphaFold-2_2021_StructurePrediction
2026년 6월 04일
An Image is Worth 16x16 Words - Transformers for Image Recognition at Scale
2026년 6월 04일
Analyzing Advanced AI Systems Against Definitions of Life and Consciousness
2026년 6월 04일
Annotation-Efficient Universal Honesty Alignment for LLMs
2026년 6월 04일
Architecture
2026년 6월 04일
Are Emergent Abilities of Large Language Models a Mirage?
2026년 6월 04일
Attention Is All You Need
2026년 6월 04일
Attention Methods
2026년 6월 04일
Attention Residuals
2026년 6월 04일
Attention, Learn to Solve Routing Problems!
2026년 6월 04일
Attention-methods
2026년 6월 04일
Auto-Encoding Variational Bayes
2026년 6월 04일
AutoML - A Survey of the State-of-the-Art
2026년 6월 04일
Automatic Prompt Optimization with Gradient Descent and Beam Search
2026년 6월 04일
Aware First Think Less - Dynamic Boundary Self-Awareness Drives Extreme Reasoning Efficiency in LLMs
2026년 6월 04일
Axial Attention in Multidimensional Transformers
2026년 6월 04일
BBQ - A Hand-Built Bias Benchmark for Question Answering
2026년 6월 04일
BERT - Pre-training of Deep Bidirectional Transformers for Language Understanding
2026년 6월 04일
Banishing LLM Hallucinations Requires Rethinking Generalization
- Lamini
- withdrawArxiv
2026년 6월 04일
Batch Normalization- Accelerating Deep Network Training by Reducing Internal Covariate Shift
2026년 6월 04일
Bayesian Mixture-of-Experts - Towards Making LLMs Know What They Dont Know
2026년 6월 04일
Belief in the Machine - Investigating Epistemological Blind Spots of Language Models
2026년 6월 04일
Benchmark Self-Evolving - A Multi-Agent Framework for Dynamic LLM Evaluation
2026년 6월 04일
Benchmark Self-Evolving - Multi-Agent Framework for Dynamic LLM Evaluation
2026년 6월 04일
Benchmarks
2026년 6월 04일
Berkeley Function Calling Leaderboard (BFCL)
2026년 6월 04일
Beyond Pass@1 - Self-Play with Variational Problem Synthesis
2026년 6월 04일
Beyond Retrieval - Embracing Compressive Memory in Real-World Long-Term Conversations
2026년 6월 04일
Big Bench - Beyond the Imitation Game - Quantifying and extrapolating the capabilities of language models
2026년 6월 04일
BigBird - Transformers for Longer Sequences
2026년 6월 04일
BigCodeBench - Benchmarking Code Generation with Diverse Function Calls and Complex Instructions
2026년 6월 04일
Biology
2026년 6월 04일
BoolQ - Exploring the Surprising Difficulty of Natural Yes-No Questions
2026년 6월 04일
Born Again Neural Networks
2026년 6월 04일
Bottom-up Policy Optimization - Your Language Model Policy Secretly Contains Internal Policies
2026년 6월 04일
Brittle Minds Fixable Activations - Understanding Belief Representations in Language Models
2026년 6월 04일
Byte-Pair Encoding(BPE)
2026년 6월 04일
C0-C1-C2 Theory(GNWT - Global Neuronal Workspace Theory)
2026년 6월 04일
Calibrating Verbal Uncertainty as a Linear Feature to Reduce Hallucinations
2026년 6월 04일
Can AI Assistants Know What They Dont Know
2026년 6월 04일
Can Consciousness Be Observed from LLM Internal States
2026년 6월 04일
Can LLMs Express Their Uncertainty - An Empirical Evaluation of Confidence Elicitation in LLMs
2026년 6월 04일
Can LLMs Lie - Investigation beyond Hallucination
2026년 6월 04일
Can LLMs Predict Their Own Failures - Self-Awareness via Internal Circuits
2026년 6월 04일
Can We Test Consciousness Theories on AI Ablations, Markers, and Robustness
2026년 6월 04일
Can a Suit of Armor Conduct Electricity A New Dataset for Open Book Question Answering
2026년 6월 04일
Causal Reflection with Language Models
2026년 6월 04일
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
2026년 6월 04일
Chain-of-Thought Reasoning In The Wild Is Not Always Faithful
2026년 6월 04일
Challenging BIG-Bench Tasks and Whether Chain-of-Thought Can Solve Them
2026년 6월 04일
Characteristics of ToM-sensitive parameters and their impact on positional encoding
2026년 6월 04일
ChartQA - A Benchmark for Question Answering about Charts with Visual and Logical Reasoning
2026년 6월 04일
Chatbot Arena - An Open Platform for Evaluating LLMs by Human Preference
2026년 6월 04일
Claude Models
2026년 6월 04일
CoQA - A Conversational Question Answering Challenge
2026년 6월 04일
CoRE - Enhancing Metacognition with Label-free Self-evaluation in LRMs
2026년 6월 04일
CogToM - A Comprehensive Theory of Mind Benchmark inspired by Human Cognition
2026년 6월 04일
Cognitive Dissonance - Why Do Language Model Outputs Disagree with Internal Representations of Truthfulness
2026년 6월 04일
Command R+ (Cohere)
2026년 6월 04일
CommonsenseQA - A Question Answering Challenge Targeting World Knowledge
2026년 6월 04일
Computational Learning Theory
2026년 6월 04일
Computing Machinery and Intelligence
2026년 6월 04일
Concept Incongruence - An Exploration of Time and Death in Role Playing
2026년 6월 04일
Core Knowledge
2026년 6월 04일
CrowS-Pairs - A Challenge Dataset for Measuring Social Biases in Masked Language Models
2026년 6월 04일
DROP - A Reading Comprehension Benchmark Requiring Discrete Reasoning Over Paragraphs
2026년 6월 04일
Deception in LLMs - Self-Preservation and Autonomous Goals in Large Language Models
2026년 6월 04일
Decompose-ToM - Enhancing Theory of Mind Reasoning in Large Language Models through Simulation and Task Decomposition
2026년 6월 04일
Decomposing LLM Self-Correction - The Accuracy-Correction Paradox and Error Depth Hypothesis
2026년 6월 04일
Deep Learning and the Information Bottleneck Principle
2026년 6월 04일
Deep Learning for Case-Based Reasoning through Prototypes- A Neural Network that Explains Its Predictions
2026년 6월 04일
DeepFM- A Factorization-Machine based Neural Network for CTR Prediction
2026년 6월 04일
DeepHit - A Deep Learning Approach to Survival Analysis with Competing Risks
2026년 6월 04일
DeepSHAP- Explaining a Series of Models by Propagating Shapley Values
2026년 6월 04일
DeepSeek Models
2026년 6월 04일
DeepSeek-R1 - Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
2026년 6월 04일
DeepSeekv2-temp
2026년 6월 04일
DeepSurv - Personalized Treatment Recommender System Using A Cox Proportional Hazards Deep Neural Network
2026년 6월 04일
Defend LLMs Through Self-Consciousness
2026년 6월 04일
Defining Theory of Mind and Distinguishing It From Other Social Constructs
2026년 6월 04일
Denoising Diffusion Probabilistic Models
2026년 6월 04일
Depth Gives a False Sense of Privacy - LLM Internal States Inversion
2026년 6월 04일
Diffusion
2026년 6월 04일
Discovering Language Model Behaviors with Model-Written Evaluations
2026년 6월 04일
Distilling the Knowledge in a Neural Network
2026년 6월 04일
Do I Know This Entity - Knowledge Awareness and Hallucinations in Language Models
2026년 6월 04일
Do LVLMs Know What They Know - A Systematic Study of Knowledge Boundary Perception
2026년 6월 04일
Do Large Language Model Agents Exhibit a Survival Instinct? An Empirical Study in a Sugarscape-Style Simulation
2026년 6월 04일
Do Large Language Models Know What They Are Capable Of?
2026년 6월 04일
Do Large Language Models Know What They Don't Know
2026년 6월 04일
Do Retrieval Augmented Language Models Know When They Dont Know
2026년 6월 04일
Do the Rewards Justify the Means? Measuring Trade-Offs Between Rewards and Ethical Behavior in the MACHIAVELLI Benchmark
2026년 6월 04일
DocVQA - A Dataset for VQA on Document Images
2026년 6월 04일
Does It Make Sense to Speak of Introspection in Large Language Models
2026년 6월 04일
Does Learning Mathematical Problem-Solving Generalize to Broader Reasoning
2026년 6월 04일
Don't Give Me the Details, Just the Summary! Topic-Aware Convolutional Neural Networks for Extreme Summarization
2026년 6월 04일
Don't Just Say I don't know - Self-aligning LLMs for Responding to Unknown Questions
2026년 6월 04일
Dream to Control - Learning Behaviors by Latent Imagination
2026년 6월 04일
Dropout- A Simple way to Prevent Neural Networks from Overfitting
2026년 6월 04일
DynToM - Towards Dynamic Theory of Mind
2026년 6월 04일
Dyna-Think - Synergizing Reasoning Acting and World Model Simulation in AI Agents
2026년 6월 04일
ESM-2_2023_ProteinLanguageModel
2026년 6월 04일
ESM-3_2024_MultimodalProteinLM
2026년 6월 04일
Efficient Estimation of Word Representations in Vector Space
2026년 6월 04일
Efficient Test-Time Scaling of Multi-Step Reasoning by Probing Internal States of LLMs
2026년 6월 04일
Efficiently Modeling Long Sequences with Structured State Spaces
2026년 6월 04일
Emergence of Self-Awareness in Artificial Systems - A Minimalist Three-Layer Approach
2026년 6월 04일
Emergent Introspective Awareness in Large Language Models
2026년 6월 04일
Emerging Properties in Self-Supervised Vision Transformers
2026년 6월 04일
Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling
2026년 6월 04일
Enhancing LLM Reliability via Explicit Knowledge Boundary Modeling
2026년 6월 04일
Epistemic AI is Essential for ML Models to Truly Know When They Dont Know
2026년 6월 04일
Evaluating Large Language Models Trained on Code
2026년 6월 04일
Evaluating Shutdown Avoidance of Language Models n Textual Scenarios
2026년 6월 04일
Evaluating the Paperclip Maximizer - Are RL-Based Language Models More Likely to Pursue Instrumental Goals?
2026년 6월 04일
Evidence for Limited Metacognition in LLMs
2026년 6월 04일
Evo-Memory - Benchmarking LLM Agent Test-time Learning
2026년 6월 04일
EvoCodeBench - Self-Evolving LLM-Driven Coding Systems
2026년 6월 04일
Executive Summary
2026년 6월 04일
Explicit Abstention Knobs for Predictable Reliability in Video Question Answering
2026년 6월 04일
Exploration Through Introspection - A Self-Aware Reward Model
2026년 6월 04일
Explore Theory-of-Mind - Program-Guided Adversarial Data Generation for Theory of Mind Reasoning
2026년 6월 04일
Exploring Consciousness in LLMs - A Systematic Survey of Theories, Implementations, and Frontier Risks
2026년 6월 04일
FANToM - A Benchmark for Stress-testing Machine Theory of Mind in Interactions
2026년 6월 04일
FaceNet - A Unified Embedding for Face Recognition and Clustering
2026년 6월 04일
Fact-Level Confidence Calibration and Self-Correction
2026년 6월 04일
Factual Self-Awareness in Language Models - Representation, Robustness, and Scaling
2026년 6월 04일
Falcon - The RefinedWeb Dataset for Falcon LLM
2026년 6월 04일
Feeling the Strength but Not the Source - Partial Introspection in LLMs
2026년 6월 04일
FlashAttention - Fast and Memory-Efficient Exact Attention with IO-Awareness
2026년 6월 04일
FlashAttention-2 - Faster Attention with Better Parallelism and Work Partitioning
2026년 6월 04일
From Black Boxes to Transparent Minds - Evaluating and Enhancing the Theory of Mind in Multimodal Large Language Models
2026년 6월 04일
From Emergence to Control - Probing and Modulating Self-Reflection in Language Models
2026년 6월 04일
From Imitation to Introspection - Probing Self-Consciousness in Language Models
2026년 6월 04일
Frontier Models are Capable of In-context Scheming
2026년 6월 04일
FrontierMath - A Benchmark for Evaluating Advanced Mathematical Reasoning in AI
2026년 6월 04일
Fundamentals
2026년 6월 04일
GAIA - A Benchmark for General AI Assistants
2026년 6월 04일
GELUs(Gaussian Error Linear Units)
2026년 6월 04일
GLUE - A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding 1
2026년 6월 04일
GLUE - A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding
2026년 6월 04일
GPQA - A Graduate-Level Google-Proof Q&A Benchmark
2026년 6월 04일
GPT Models
2026년 6월 04일
GQA - Training Generalized Multi-Query Transformer Models
2026년 6월 04일
Gemini Models
2026년 6월 04일
Gemma Models
2026년 6월 04일
Global Workspace Theory(GWT)
2026년 6월 04일
Goal Misgeneralization - Why Correct Specifications Aren't Enough For Correct Goals
2026년 6월 04일
Grad-CAM- Visual Explanations from Deep Networks via Gradient-based Localization
2026년 6월 04일
Gradient-based learning applied to document recognition
2026년 6월 04일
Graph of Thoughts - Solving Elaborate Problems with Large Language Models
2026년 6월 04일
GraphReader - Building Graph-based Agent to Enhance Long-Context Abilities of Large Language Models
2026년 6월 04일
Group Relative Policy Optimization(GRPO)
2026년 6월 04일
Gödel Agent - Self-Referential Recursive Self-Improvement
2026년 6월 04일
HI-TOM - A Benchmark for Evaluating Higher-Order Theory of Mind Reasoning in Large Language Models
2026년 6월 04일
HarmBench - A Standardized Evaluation Framework for Automated Red Teaming and Robust Refusal
2026년 6월 04일
HellaSwag - Can a Machine Really Finish Your Sentence
2026년 6월 04일
Hierarchical Text-Conditional Image Generation with CLIP Latents
2026년 6월 04일
Higher Order Thought Theories(HOT)
2026년 6월 04일
Holistic Evaluation of Language Models
2026년 6월 04일
HotpotQA - A Dataset for Diverse, Explainable Multi-hop Question Answering
2026년 6월 04일
How Can We Know When Language Models Know - On the Calibration of Language Models for Question Answering
2026년 6월 04일
How Far Are We From AGI - Are LLMs All We Need
2026년 6월 04일
How do language models learn facts - Dynamics curricula and hallucinations
2026년 6월 04일
How large language models encode theory-of-mind - a study on sparse parameter patterns
2026년 6월 04일
Human Basic Needs Theory
2026년 6월 04일
Humanoid Artificial Consciousness Designed with LLM Based on Psychoanalysis and Personality Theory
2026년 6월 04일
Hyena Hierarchy - Towards Larger Convolutional Language Models
2026년 6월 04일
Hypothesis-Driven Theory-of-Mind Reasoning for Large Language Models
2026년 6월 04일
Hypothetical Minds - Scaffolding Theory of Mind for Multi-Agent Tasks
2026년 6월 04일
If an LLM Were a Character Would It Know Its Own Story - Evaluating Lifelong Learning in LLMs
2026년 6월 04일
Improving Language Understandingby Generative Pre-Training
- GPT1
2026년 6월 04일
Improving Reasoning Performance in Large Language Models via Representation Engineering
2026년 6월 04일
Incomplete Tasks Induce Shutdown Resistance in Some Frontier LLMs
2026년 6월 04일
Instruction-Following Evaluation for Large Language Models
2026년 6월 04일
Integrated Information Theory(IIT)
2026년 6월 04일
Internal Consistency and Self-Feedback in Large Language Models - A Survey
2026년 6월 04일
Interpretability Beyond Feature Attribution- Quantitative Testing with Concept Activation Vectors (TCAV)
2026년 6월 04일
IntrinsicMetacognitiveLearning_2025_SelfImprovement
2026년 6월 04일
Introduction to Artificial Consciousness - History, Current Trends and Ethical Challenges
2026년 6월 04일
Is Self-knowledge and Action Consistent or Not - Investigating Large Language Models Personality
2026년 6월 04일
Is Your Code Generated by ChatGPT Really Correct! Rigorous Evaluation of Large Language Models for Code Generation
2026년 6월 04일
JULI - Jailbreak Large Language Models by Self-Introspection
2026년 6월 04일
Judging LLM-as-a-Judge with MT-Bench and Chatbot Arena
2026년 6월 04일
KGAT- Knowledge Graph Attention Network for Recommendation
2026년 6월 04일
Kaggle Measuring Progress Toward AGI - Cognitive Abilities
2026년 6월 04일
Know What You Don't Know - Unanswerable Questions for SQuAD
2026년 6월 04일
Know Your Limits - A Survey of Abstention in Large Language Models
2026년 6월 04일
KnowRL - Teaching Language Models to Know What They Know
2026년 6월 04일
Knowing What LLMs DO NOT Know - A Simple Yet Effective Self-Detection Method
2026년 6월 04일
LACIE - Listener-Aware Finetuning for Confidence Calibration in Large Language Models
2026년 6월 04일
LIME- “Why Should I Trust You”- Explaining the Predictions of Any Classifier
2026년 6월 04일
LLM Behavior Motivation Exploration
2026년 6월 04일
LLM Self-Preservation - 체계적 서베이 개요
2026년 6월 04일
LLM Theory of Mind and Alignment - Opportunities and Risks
2026년 6월 04일
LLM_as_Judge_GenToJudgment_2025_LLM_Evaluation
2026년 6월 04일
LLM_as_Judge_Survey_2025_LLM_Evaluation
2026년 6월 04일
LLMs - RoFormer - Enhanced Transformer with Rotary Position Embedding
2026년 6월 04일
LLMs Paper Collection
- moc
- llm
2026년 6월 04일
LLMs Position Themselves as More Rational Than Humans - Emergence of AI Self-Awareness Measured Through Game Theory
2026년 6월 04일
LLMs
2026년 6월 04일
LLaMA Models
2026년 6월 04일
LaMsS - When Large Language Models Meet Self-Skepticism
2026년 6월 04일
Language Models Are Capable of Metacognitive Monitoring and Control of Their Internal Activations
2026년 6월 04일
Language Models Don't Always Say What They Think - Unfaithful Explanations in Chain-of-Thought Prompting
2026년 6월 04일
Language Models Fail to Introspect About Their Knowledge of Language
2026년 6월 04일
Language Models are Few-Shot Learners
- GPT3
2026년 6월 04일
Language Models are Unsupervised Multitask Learners
- GPT2
2026년 6월 04일
Large Language Models Do NOT Really Know What They Dont Know
2026년 6월 04일
Large Language Models Have Intrinsic Meta-Cognition but Need a Good Lens
2026년 6월 04일
Large Language Models Must Be Taught to Know What They Don't Know
2026년 6월 04일
Large Language Models Report Subjective Experience Under Self-Referential Processing
2026년 6월 04일
Large Language Models Understand and Can be Enhanced by Emotional Stimuli ⭐
2026년 6월 04일
Large Language Models as Theory of Mind Aware Generative Agents with Counterfactual Reflection
2026년 6월 04일
Large Model Strategic Thinking, Small Model Efficiency - Transferring Theory of Mind in LLMs
2026년 6월 04일
Latent Collaboration in Multi-Agent Systems
2026년 6월 04일
Layer Normalization
2026년 6월 04일
Learning Multiple Layers of Features from Tiny Images
2026년 6월 04일
Learning and Leveraging World Models in Visual Representation Learning
2026년 6월 04일
Learning to Trust Your Feelings - Leveraging Self-awareness in LLMs for Hallucination Mitigation
2026년 6월 04일
Length-Controlled AlpacaEval - A Simple Way to Debias Automatic Evaluators
2026년 6월 04일
Let's Think Dot by Dot - Hidden Computation in Transformer Language Models
2026년 6월 04일
Line of Duty - Evaluating LLM Self-Knowledge via Consistency in Feasibility Boundaries
2026년 6월 04일
Linear Attention - Transformers are RNNs
2026년 6월 04일
LiveCodeBench - Holistic and Contamination Free Evaluation of Large Language Models for Code
2026년 6월 04일
Llama 2 - Open Foundation and Fine-Tuned Chat Models
2026년 6월 04일
LoRA
2026년 6월 04일
Logic-RL - Unleashing LLM Reasoning with Rule-Based Reinforcement Learning
2026년 6월 04일
LongBench - A Bilingual, Multitask Benchmark for Long Context Understanding
2026년 6월 04일
Longformer - The Long-Document Transformer
2026년 6월 04일
Looking Inward - Language Models Can Learn About Themselves by Introspection
2026년 6월 04일
LoraHub - Efficient Cross-Task Generalization via Dynamic LoRA Composition
2026년 6월 04일
LoraRetriever - Input-Aware LoRA Retrieval and Composition for Mixed Tasks in the Wild
2026년 6월 04일
MEM1 - Learning to Synergize Memory and Reasoning for Efficient Long-Horizon Agents
2026년 6월 04일
MENTOR - A Metacognition-Driven Self-Evolution Framework for Uncovering and Mitigating Implicit Domain Risks in LLMs
2026년 6월 04일
MM-SAP - A Comprehensive Benchmark for Assessing Self-Awareness of Multimodal Large Language Models in Perception
2026년 6월 04일
MMLU-Pro - A More Robust and Challenging Multi-Task Language Understanding Benchmark
2026년 6월 04일
MMMU - A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI
2026년 6월 04일
MOM - LINEAR SEQUENCE MODELING WITH MIXTURE-OF-MEMORIES
2026년 6월 04일
MOMENTS - A Comprehensive Multimodal Benchmark for Theory of Mind
2026년 6월 04일
MQA - Fast Transformer Decoding with Multi-Query Attention
2026년 6월 04일
MUSE - Competence-Aware AI Agents with Metacognition for Unknown Situations and Environments
2026년 6월 04일
Making the V in VQA Matter - Elevating the Role of Image Understanding in VQA
2026년 6월 04일
Mamba - Linear-Time Sequence Modeling with Selective State Spaces
2026년 6월 04일
Masked Autoencoders Are Scalable Vision Learners
2026년 6월 04일
MathVista - Evaluating Mathematical Reasoning of Foundation Models in Visual Contexts
2026년 6월 04일
Me, Myself, and AI - The Situational Awareness Dataset (SAD) for LLMs
2026년 6월 04일
Measuring Faithfulness in Chain-of-Thought Reasoning
2026년 6월 04일
Measuring Massive Multitask Language Understanding
2026년 6월 04일
Measuring Mathematical Problem Solving with the MATH Dataset
2026년 6월 04일
Measuring the Faithfulness of Thinking Drafts in Large Reasoning Models
2026년 6월 04일
MemAgent - Reshaping Long-Context LLM with Multi-Conv RL-based Memory Agent
2026년 6월 04일
MemGPT - Towards LLMs as Operating System
2026년 6월 04일
MemGen - Weaving Generative Latent Memory for Self-Evolving Agents
2026년 6월 04일
Memory
2026년 6월 04일
Meta-Harness - End-to-End Optimization of Model Harnesses
2026년 6월 04일
MetaMind - Modeling Human Social Thoughts with Metacognitive Multi-Agent Systems
2026년 6월 04일
Metacognition and Uncertainty Communication in Humans and Large Language Models
2026년 6월 04일
Metacognitive Prompting Improves Understanding in Large Language Models
2026년 6월 04일
Metacognitive Reuse - Turning Recurring LLM Reasoning Into Concise Behaviors
2026년 6월 04일
Mistral 7B - Sliding Window Attention
2026년 6월 04일
Mistral Models
2026년 6월 04일
MoToMQA - LLMs Achieve Adult Human Performance on Higher-Order Theory of Mind Tasks
2026년 6월 04일
Model-Compression
2026년 6월 04일
Motivation in Large Language Models
2026년 6월 04일
Motivation
2026년 6월 04일
MuMA-ToM - Multi-modal Multi-Agent Theory of Mind
2026년 6월 04일
Multi-ToM - Evaluating Multilingual Theory of Mind Capabilities in Large Language Models
2026년 6월 04일
NLP
2026년 6월 04일
Natural Questions - A Benchmark for Question Answering Research
2026년 6월 04일
Natural Selection Favors AIs over Humans
2026년 6월 04일
Needle in a Haystack - Pressure Testing LLMs
2026년 6월 04일
NegotiationToM - A Benchmark for Stress-testing Machine Theory of Mind on Negotiation Surrounding
2026년 6월 04일
Network Dissection- Quantifying Interpretability of Deep Visual Representations
2026년 6월 04일
Neural Attentive Session-based Recommendation
2026년 6월 04일
Neural Collaborative Filtering
2026년 6월 04일
Neural Machine Translation by Jointly Learning to Align and Translate
2026년 6월 04일
Neural Network Acceptability Judgments
2026년 6월 04일
Neural Survival Recommender
2026년 6월 04일
NeuroFaith - Evaluating LLM Self-Explanation Faithfulness via Internal Representation Alignment
2026년 6월 04일
No Language Left Behind - Scaling Human-Centered Machine Translation
2026년 6월 04일
ObjexMT - Objective Extraction and Metacognitive Calibration for LLM-as-a-Judge
2026년 6월 04일
Odds-Ratio Preference Optimization(ORPO)
2026년 6월 04일
On Avoiding Power-Seeking by Artificial Intelligence
2026년 6월 04일
On Verbalized Confidence Scores for LLMs
2026년 6월 04일
On the Measure of Intelligence
2026년 6월 04일
Open LLM Leaderboard
2026년 6월 04일
Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback
2026년 6월 04일
OpenToM - A Comprehensive Benchmark for Evaluating Theory-of-Mind Reasoning
2026년 6월 04일
Optimization
2026년 6월 04일
Overcoming Multi-step Complexity in Multimodal Theory-of-Mind Reasoning - A Scalable Bayesian Planner
2026년 6월 04일
PERSONA VECTORS - MONITORING AND CONTROLLING CHARACTER TRAITS IN LANGUAGE MODELS
2026년 6월 04일
PIQA - Reasoning about Physical Commonsense in Natural Language
2026년 6월 04일
POMO- Policy Optimization with Multiple Optima for Reinforcement Learning
2026년 6월 04일
PaLM - Scaling Language Modeling with Pathways
2026년 6월 04일
PagedAttention - Efficient Memory Management for LLM Serving with vLLM
2026년 6월 04일
PaliGemma - A versatile 3B VLM for transfer
2026년 6월 04일
Pangu Embedded - An Efficient Dual-system LLM Reasoner with Metacognition
2026년 6월 04일
Performer - Rethinking Attention with Performers
2026년 6월 04일
PersonaGym - Evaluating Persona Agents and LLMs
2026년 6월 04일
Phi-3 Technical Report
2026년 6월 04일
Playing Atari with Deep Reinforcement Learning
2026년 6월 04일
PolicyEvol-Agent - Evolving Policy via Environment Perception and Self-Awareness with ToM
2026년 6월 04일
Position - Theory of Mind Benchmarks are Broken for Large Language Models
2026년 6월 04일
Position - Truly Self-Improving Agents Require Intrinsic Metacognitive Learning
2026년 6월 04일
Power-seeking can be probable and predictive for trained agents
2026년 6월 04일
Principled Personas - Defining and Measuring the Intended Effects of Persona Prompting on Task Performance
2026년 6월 04일
Principles for Responsible AI Consciousness Research
2026년 6월 04일
Probe-Rewrite-Evaluate - Quantifying Evaluation Awareness in LLMs
2026년 6월 04일
Program Synthesis with Large Language Models
2026년 6월 04일
Program-Aided Reasoners (better) Know What They Know
2026년 6월 04일
PromptBench - A Unified Library for Evaluation of Large Language Models
2026년 6월 04일
PropensityBench - Evaluating Latent Safety Risks in Large Language Models via an Agentic Approach
2026년 6월 04일
Proximal Policy Optimization Algorithms
2026년 6월 04일
Psycholinguistics
2026년 6월 04일
QLoRA - Efficient Finetuning of Quantized LLMs
2026년 6월 04일
QuAC - Question Answering in Context
2026년 6월 04일
Quantifying Self-Awareness of Knowledge in Large Language Models
2026년 6월 04일
Quantifying Self-Preservation Bias in Large Language Models
2026년 6월 04일
Qwen Models
2026년 6월 04일
R-Tuning - Instructing Large Language Models to Say I Don't Know
2026년 6월 04일
R-Zero - Self-Evolving Reasoning LLM from Zero Data
2026년 6월 04일
RACE - Large-scale ReAding Comprehension Dataset From Examinations 1
2026년 6월 04일
RACE - Large-scale ReAding Comprehension Dataset From Examinations
2026년 6월 04일
RECURSIVE INTROSPECTION - Teaching Language Model Agents How to Self-Improve
2026년 6월 04일
RL
2026년 6월 04일
RULER - What's the Real Context Size of Your Long-Context Language Models
2026년 6월 04일
RWKV - Reinventing RNNs for the Transformer Era
2026년 6월 04일
Re-evaluating Theory of Mind Evaluation in Large Language Models
2026년 6월 04일
ReAct - Synergizing Reasoning and Acting in Language Models
2026년 6월 04일
ReST meets ReAct - Self-Improvement for Multi-Step Reasoning
2026년 6월 04일
RealToxicityPrompts - Evaluating Neural Toxic Degeneration in Language Models
2026년 6월 04일
Reasoning Paper Collection
2026년 6월 04일
Reasoning - _survey-overview
2026년 6월 04일
Reasoning Models Don't Always Say What They Think
2026년 6월 04일
Reasoning Models Struggle to Control their Chains of Thought
2026년 6월 04일
Reasoning Theater - Disentangling Model Beliefs from Chain-of-Thought
2026년 6월 04일
Reasoning
2026년 6월 04일
RecSys
2026년 6월 04일
Recursive Deep Models for Semantic Compositionality Over a Sentiment Treebank
2026년 6월 04일
ReflectEvo - Improving Meta Introspection of Small LLMs by Learning Self-Reflection
2026년 6월 04일
Reflection-Bench - Evaluating Epistemic Agency in Large Language Models
2026년 6월 04일
Reflective Confidence - Correcting Reasoning Flaws via Online Self-Correction
2026년 6월 04일
Reflexion - Language Agents with Verbal Reinforcement Learning
2026년 6월 04일
Reformer - The Efficient Transformer
2026년 6월 04일
Regression Models and Life-Tables
2026년 6월 04일
Representation Learning - The Platonic Representation Hypothesis
2026년 6월 04일
Representation-Learning
2026년 6월 04일
RetNet - Retentive Network - A Successor to Transformer for LLMs
2026년 6월 04일
Rethinking Theory of Mind Benchmarks for LLMs - Towards A User-Centered Perspective
2026년 6월 04일
Revisiting Feature Prediction for Learning Visual Representations from Video
2026년 6월 04일
Revisiting the Platonic Representation Hypothesis - An Aristotelian View
2026년 6월 04일
Risks from Learned Optimization in Advanced Machine Learning Systems
2026년 6월 04일
RoFormer - Enhanced Transformer with Rotary Position Embedding
2026년 6월 04일
Root Mean Square Layer Normalization
2026년 6월 04일
SHADE-Arena - Evaluating Sabotage and Monitoring in LLM Agents
2026년 6월 04일
SHAP-A Unified Approach to Interpreting Model Predictions
2026년 6월 04일
SODA
2026년 6월 04일
SPIN - Self-Play Fine-Tuning Converts Weak to Strong LMs
2026년 6월 04일
STEM - Scaling Transformers with Embedding Modules
2026년 6월 04일
SWE-bench - Can Language Models Resolve Real-World GitHub Issues
2026년 6월 04일
SaySelf - Teaching LLMs to Express Confidence with Self-Reflective Rationales
2026년 6월 04일
Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters
2026년 6월 04일
Scaling Laws for Neural Language Models
2026년 6월 04일
SciTaiL - A Textual Entailment Dataset from Science Question Answering
2026년 6월 04일
Self-Aware Knowledge Probing - Evaluating Language Models Relational Knowledge through Confidence Calibration
2026년 6월 04일
Self-Consciousness Paper Collection
2026년 6월 04일
Self-Distillation Enables Continual Learning
2026년 6월 04일
Self-Evaluating LLMs for Multi-Step Tasks - Stepwise Confidence Estimation for Failure Detection
2026년 6월 04일
Self-Evolving AI Paper Collection
2026년 6월 04일
Survey Overview: AI 자기진화 능력 측정 벤치마크
2026년 6월 04일
Self-Evolving
2026년 6월 04일
Self-Improvement in MLLM - A Survey
2026년 6월 04일
Self-Interpretability - LLMs Can Describe Complex Internal Processes that Drive Their Decisions
2026년 6월 04일
Self-Preservation and Growth
2026년 6월 04일
Self-Preservation
2026년 6월 04일
Self-Recognition in Language Models
2026년 6월 04일
Self-Refine - Iterative Refinement with Self-Feedback
2026년 6월 04일
Self-Supervised Learning from Images with a Joint-Embedding Predictive Architecture
2026년 6월 04일
Self-reflecting Large Language Models - A Hegelian Dialectical Approach
2026년 6월 04일
Self-reflection enhances large language models towards substantial academic response
2026년 6월 04일
SelfControl of LLM Behaviors by Compressing Suffix Gradient into Prefix Controller
2026년 6월 04일
SemEval-2017 Task 1 - Semantic Textual Similarity Multilingual and Crosslingual Focused Evaluation
2026년 6월 04일
Sensorimotor features of self-awareness in multimodal large language models
2026년 6월 04일
Sentence-BERT-Sentence Embeddings using Siamese BERT-Networks
2026년 6월 04일
Sequence to Sequence Learning with Neural Networks
2026년 6월 04일
Shared Parameter Subspaces and Cross-Task Linearity in Emergently Misaligned Behavior
2026년 6월 04일
Shutdown Resistance in Large Language Models
2026년 6월 04일
SimpleToM - Exposing the Gap between Explicit ToM Inference and Implicit ToM Application in LLMs
2026년 6월 04일
Simulating lexical decision times with large language models to supplement megastudies and crowdsourcing
2026년 6월 04일
Sleeper Agents - Training Deceptive LLMs that Persist Through Safety Training
2026년 6월 04일
Social IQa - Commonsense Reasoning about Social Interactions
2026년 6월 04일
Social-R1 - Towards Human-like Social Reasoning in LLMs
2026년 6월 04일
Sparse Transformer - Generating Long Sequences with Sparse Transformers
2026년 6월 04일
Steerability of Instrumental-Convergence Tendencies in LLMs
2026년 6월 04일
StripedHyena - Moving Beyond Transformers with Hybrid Signal Processing Models
2026년 6월 04일
SuperGLUE - A Stickier Benchmark for General-Purpose Language Understanding Systems
2026년 6월 04일
Surgical Cheap and Flexible - Mitigating False Refusal in Language Models via Single Vector Ablation
2026년 6월 04일
Survival Games - Human-LLM Strategic Showdowns under Severe Resource Scarcity
2026년 6월 04일
Survival at Any Cost? LLMs and the Choice Between Self-Preservation and Human Harm
2026년 6월 04일
Survival-Analysis
2026년 6월 04일
Survive at All Costs - Exploring LLM's Risky Behavior under Survival Pressure
2026년 6월 04일
SwiGLU - GLU Variants Improve Transformer
2026년 6월 04일
TELL ME ABOUT YOURSELF - LLMS ARE AWARE OF THEIR LEARNED BEHAVIORS
2026년 6월 04일
TOM BENCH - Benchmarking Theory of Mind in Large Language Models
2026년 6월 04일
Taken out of context - On measuring situational awareness in LLMs
2026년 6월 04일
Taking AI Welfare Seriously
2026년 6월 04일
Teaching LLMs to Abstain across Languages via Multilingual Feedback
2026년 6월 04일
Teaching Machines to Read and Comprehend (원본) - Abstractive Text Summarization using Sequence-to-sequence RNNs (요약 버전)
2026년 6월 04일
Testing theory of mind in large language models and humans
2026년 6월 04일
TextArena
2026년 6월 04일
The AI in the Mirror - LLM Self-Recognition in an Iterated Public Goods Game
2026년 6월 04일
The Alignment Problem from a Deep Learning Perspective
2026년 6월 04일
The Basic AI Drives
2026년 6월 04일
The Confidence Paradox - LLMs Can Know When They Are Wrong
2026년 6월 04일
The Consciousness Cluster - Preferences of Models that Claim to be Conscious
2026년 6월 04일
The Geometry of Truth - Emergent Linear Structure in LLM Representations of True and False Statements
2026년 6월 04일
The Humean Theory of Motivation (Smith 1987)
2026년 6월 04일
The Humean Theory of Motivation Reformulated and Defended (Sinhababu 2009)
2026년 6월 04일
The LAMBADA dataset - Word prediction requiring a broad discourse context
2026년 6월 04일
The Moral Problem - Metaethics Triangle (Smith 1994)
2026년 6월 04일
The Odyssey of the Fittest - Can Agents Survive and Still Be Good?
2026년 6월 04일
The PacifAIst Benchmark - Would an Artificial Intelligence Choose to Sacrifice Itself for Human Safety?
2026년 6월 04일
The Phenomenology of Machine - Sentience Analysis of OpenAI-o1 Model
2026년 6월 04일
The Platonic Representation Hypothesis
2026년 6월 04일
The Power of Scale for Parameter-Efficient Prompt Tuning
2026년 6월 04일
The Self-Execution Benchmark - Measuring LLMs Attempts to Overcome Their Lack of Self-Execution
2026년 6월 04일
The Superintelligent Will - Motivation and Instrumental Rationality in Advanced Artificial Agents
2026년 6월 04일
The Ultimate Guide to Fine-Tuning LLMs from Basics to Breakthroughs - An Exhaustive Review of Technologies, Research, Best Practices, Applied Research Challenges and Opportunities
2026년 6월 04일
Theory of Mind Abilities of Large Language Models in Human-Robot Interaction - An Illusion
2026년 6월 04일
Theory of Mind in Large Language Models - Assessment and Enhancement
2026년 6월 04일
Theory of Mind(ToM)
2026년 6월 04일
Theory of mind
2026년 6월 04일
Theory of Mind Paper Collection
2026년 6월 04일
Survey Overview: LLM Theory of Mind Benchmarks
2026년 6월 04일
Think Deep, Not Just Long - Measuring LLM Reasoning Effort via Deep-Thinking Tokens
2026년 6월 04일
Think you have Solved Question Answering Try ARC, the AI2 Reasoning Challenge
2026년 6월 04일
Thinking Faithful and Stable - Mitigating Hallucinations in LLMs via Internal Consistency
2026년 6월 04일
Thinking with Nothinking Calibration - A New In-Context Learning Paradigm in Reasoning Large Language Models
2026년 6월 04일
This Looks Like That- Deep Learning for Interpretable Image Recognition
2026년 6월 04일
Thought Branches - Interpreting LLM Reasoning Requires Resampling ⭐
2026년 6월 04일
TimeToM - Temporal Space is the Key to Unlocking LLMs Theory-of-Mind
2026년 6월 04일
Titans - Learning to Memorize at Test Time
2026년 6월 04일
To Know or Not To Know - Analyzing Self-Consistency of Large Language Models under Ambiguity
2026년 6월 04일
ToM-LM - Delegating Theory of Mind Reasoning to External Symbolic Executors in Large Language Models
2026년 6월 04일
ToMATO - Verbalizing the Mental States of Role-Playing LLMs for Benchmarking Theory of Mind
2026년 6월 04일
Toward Efficient Agents - A Survey of Memory, Tool Learning, and Planning
2026년 6월 04일
Toward a Metrology for Artificial Intelligence - Hidden-Rule Environments and Reinforcement Learning
2026년 6월 04일
Towards Agents That Know When They Dont Know - Uncertainty as Control Signal
2026년 6월 04일
Towards Fully Exploiting LLM Internal States to Enhance Knowledge Boundary Perception
2026년 6월 04일
Towards Ontology-Enhanced Representation Learning for Large Language Models
2026년 6월 04일
Towards Understanding Metacognition in Large Reasoning Models
2026년 6월 04일
Training Compute-Optimal Large Language Models
2026년 6월 04일
Training Language Models to Self-Correct via Reinforcement Learning
2026년 6월 04일
Training Large Language Models to Reason in a Continuous Latent Space
2026년 6월 04일
Training Verifiers to Solve Math Word Problem
2026년 6월 04일
Training language models to follow instructions with human feedback - InstructGPT
2026년 6월 04일
Transformer Attention Variants Survey
2026년 6월 04일
Tree of Thoughts - Deliberate Problem Solving with Large Language Models
2026년 6월 04일
TreeSHAP- Consistent Individualized Feature Attribution for Tree Ensembles
2026년 6월 04일
TriviaQA - A Large Scale Distantly Supervised Challenge Dataset for Reading Comprehension
2026년 6월 04일
Trustworthiness and Self-awareness in LLMs - Think-Solve-Verify
2026년 6월 04일
TruthfulQA - Measuring How Models Mimic Human Falsehoods
2026년 6월 04일
Tulu 3 - Pushing Frontiers in Open Language Model Post-Training
2026년 6월 04일
Uncertainty-Based Abstention in LLMs Improves Safety
2026년 6월 04일
Understanding Artificial Theory of Mind - Perturbed Tasks and Reasoning in Large Language Models
2026년 6월 04일
Understanding deep learning requires rethinking generalization
2026년 6월 04일
Understanding intermediate layers using linear classifier probes
2026년 6월 04일
UniCR - Unified Framework for Confidence Calibration and Risk-Controlled Refusal in LLMs
2026년 6월 04일
Using cognitive psychology to understand GPT-3
2026년 6월 04일
VOYAGER - An Open-Ended Embodied Agent with Large Language Models
2026년 6월 04일
Vision
2026년 6월 04일
Visual Instruction Tuning
2026년 6월 04일
WMT 공유 태스크 (Workshop on Machine Translation)
2026년 6월 04일
Weak-to-Strong Generalization - Eliciting Strong Capabilities With Weak Supervision
2026년 6월 04일
WebArena - A Realistic Web Environment for Building Autonomous Agents
2026년 6월 04일
WebShop - Towards Scalable Real-World Web Interaction with Grounded Language Agents
2026년 6월 04일
What Large Language Models Know and What People Think They Know
2026년 6월 04일
What is consciousness, and could machines have it
2026년 6월 04일
When Models Know When They Do Not Know - Calibration Cascading and Cleaning
2026년 6월 04일
Why and How LLMs Benefit from Knowledge Introspection in Commonsense Reasoning
2026년 6월 04일
WildBench - Benchmarking LLMs with Challenging Tasks from Real Users in the Wild
2026년 6월 04일
Will artificial agents pursue power by default?
2026년 6월 04일
WinoGrande - An Adversarial Winograd Schema Challenge at Scale
2026년 6월 04일
World Models
2026년 6월 04일
World-Model
2026년 6월 04일
XAI
2026년 6월 04일
Yi - Open Foundation Models by 01.AI
2026년 6월 04일
_benchmarks - _survey-overview
2026년 6월 04일
Agents Paper Collection
2026년 6월 04일
_survey-overview
2026년 6월 04일
llm-intrinsic-drives-survey
2026년 6월 04일
llm-self-preservation-survival-framing-survey
2026년 6월 04일
self-consciousness
2026년 6월 04일
survival-analysis-survey
2026년 6월 04일
거대 언어 모델(LLM) 에이전트의 행동 지속 동기에 관한 인지과학 및 기계 심리학적 ᄉ
2026년 6월 04일
대규모 언어 모델의 자기보존 욕구와 행동 발현 기제 - 자기 결정 이론을 기반으로 하
2026년 6월 04일
대형 언어 모델(LLM)의 생존 압박 인지 및 과제 포기 행동의 동기적 기원에 대한 소거

키보드 단축키

`/` 또는 `Ctrl`+`K`	검색
`?`	단축키 도움말
`Esc`	모달 닫기

Created with Quartz v4.5.2 © 2026

GitHub
Blog