高级检索

基于事实验证的五链式机器翻译幻觉检测智能体

A Fact-verification-based Five-link Chain Machine Translation Hallucination Detection Agent

  • 摘要: 机器翻译幻觉现象指机器译文中包含与源语言句子完全无关的翻译信息。当前基于模型内部状态的幻觉检测方法与基于外部工具的幻觉检测方法均存在显著不足,前者依赖翻译模型内部推理过程产生的额外信息,而后者使用的底层模型优化目标与幻觉检测任务存在偏差。根据机器翻译幻觉现象的特点,该文提出了一种基于事实验证的五链式机器翻译幻觉检测智能体,该智能体通过构建的五条链式流程,预测机器译文中的事实与源语言中的事实是否一致,从而进一步检测机器翻译幻觉现象。在公开数据集HalOmi句子级高资源自然幻觉检测子任务上的实验结果表明,基于事实验证的五链式机器翻译幻觉检测智能体显著提高了机器翻译幻觉检测的性能,且在大多数语言对上的表现超过了参与对比的最优方法。

     

    Abstract: Machine translation hallucination refers to the phenomenon that the translated text contains information unrelated to the source language sentence. This paper proposes a fact-verification-based five-chain machine translation hallucination detection agent. This agent uses five sequential processes to determine whether the facts in the machine translation align with those in the source language, thus enabling the detection of hallucination phenomena in machine translation. Experimental results on the HalOmi sentence-level high-resource natural hallucination detection subtask show that the proposed method significantly improves the performance of hallucination detection and outperforms the best competing methods for most language pairs.

     

/

返回文章
返回