Consistent Responses to Paraphrased Questions as Evidence Against Hallucination: A Study on Hallucinations in LLMs

Tara Zare; Mehrnoush Shamsfard

Consistent Responses to Paraphrased Questions as Evidence Against Hallucination: A Study on Hallucinations in LLMs

Publish place: International Journal of Web Research، Vol: 8، Issue: 4

Publish Year: 1404

نوع سند: مقاله ژورنالی

زبان: English

This Paper With 9 Page And PDF Format Ready To Download

دریافت فایل کامل Paper

Certificate
من نویسنده این مقاله هستم

استخراج به نرم افزارهای پژوهشی:

لینک ثابت به این Paper:

https://civilica.com/doc/2408270

شناسه ملی سند علمی:

JR_IJWR-8-4_002

تاریخ نمایه سازی: 3 آبان 1404

Abstract:

The increasing adoption of large language models (LLMs) has intensified concerns about hallucinations—outputs that are syntactically fluent but factually incorrect. In this paper, we propose a method for detecting such hallucinations by evaluating the consistency of model responses to paraphrased versions of the same question. The underlying assumption is that if a model produces consistent answers across different paraphrases, the output is more likely to be accurate. To test this method, we developed a system that generates multiple paraphrases of each question and analyzes the consistency of the corresponding responses. Experiments were conducted using two LLMs—GPT-۴O and LLaMA ۳–۷۰B Chat—on both Persian and English datasets. The method achieved an average accuracy of ۹۹.۵% for GPT-۴O and ۹۸% for LLaMA ۳–۷۰B, indicating the effectiveness of our approach in identifying hallucination-free outputs across languages. Furthermore, by automating the consistency evaluation using an instruction-tuned language model, we enabled scalable and unbiased detection of semantic agreement across paraphrased responses.

Keywords:

Large Language Models , Hallucination of Large Language Models , Inconsistency Detection , Paraphrasing

Authors

Tara Zare

Faculty of Computer Science and Engineering, Shahid Beheshti University, Tehran, Iran.

Mehrnoush Shamsfard

Faculty of Computer Science and Engineering, Shahid Beheshti University, Tehran, Iran.

مراجع و منابع این Paper:

لیست زیر مراجع و منابع استفاده شده در این Paper را نمایش می دهد. این مراجع به صورت کاملا ماشینی و بر اساس هوش مصنوعی استخراج شده اند و لذا ممکن است دارای اشکالاتی باشند که به مرور زمان دقت استخراج این محتوا افزایش می یابد. مراجعی که مقالات مربوط به آنها در سیویلیکا نمایه شده و پیدا شده اند، به خود Paper لینک شده اند :

T. Brown et al., “Language Models Are Few-Shot Learners,” Advances ...

نمایش کامل مراجع