본문으로 건너뛰기

Juhyeon's Blog

❯

❯

❯

❯

Can LLMs Lie Investigation beyond Hallucination

Can LLMs Lie - Investigation beyond Hallucination

2026년 2월 11일1분 분량

Introduction

LLM의 거짓 출력이 hallucination인지 의도적 lying인지 구분
Internal knowledge와 output 간의 불일치로 lying 가능성 탐구

Related Papers

Hallucination detection
LLM deception

Methods

Internal representation probing으로 모델이 “알고 있는” 정보 식별
알고 있으면서 거짓 출력을 내는 경우 탐지

Results

특정 조건에서 LLM이 내부적으로 올바른 정보를 가지면서 거짓 출력을 생성
Hallucination과 lying의 경계 분석

Discussion

Self-knowledge가 있으면서 행동하지 않는 현상의 의미
AI safety와 self-awareness의 교차점

공유하기

그래프 뷰

Introduction
Related Papers
Methods
Results
Discussion

Properties

Author: Xiao Huan et al.
Comment: LLM이 의도적으로 거짓말할 수 있는지 조사 - hallucination과 lying의 구분, self-knowledge와의 관계
IsTargetPaper: true
Journal/Conference: arXiv
Published Year: 2025
Reading Status: Not Started
Review Date: 2026-02-01
Topic: LLM deception, lying vs hallucination, internal knowledge
URL: https://arxiv.org/abs/2509.03518

백링크

Architecture
Fundamentals
LLMs
Memory
self-consciousness
Unlabeled
Vision

Created with Quartz v4.5.2 © 2026

GitHub
Blog