講演情報

[8A-01]Multi-Hop Corpus for Detecting LLM Hallucinations

*Ali Ushtar1、Lynden Steven2、的野 晃整2、天笠 俊之1 (1. University of Tsukuba、2. AIST)
発表者区分:学生
論文種別:ショートペーパー
インタラクティブ発表:あり

キーワード:

Large Language Models.、Knowledge Graphs.、LLM Hallucination.

Knowledge graphs can reliably ground large language models (LLMs) in factual accuracy. Introducing KGs into multi-hop question answer (QA) can assess LLMs factual knowledge by testing their reasoning and inference skills. In this paper, we present a multi-hop QA dataset called EDQA, an entropy driven technique which generates the multi-hop questions from the KGs. Through experiments, we reveal that, our dataset effectively evaluates the factual accuracy of LLMs and detects hallucinations.