高级检索

基于持续知识蒸馏的零代词消解和非零共指消解

Zero-pronoun Resolution and Non-zero Based Coreference Resolution on Continuous Knowledge Distillation

  • 摘要: 零代词消解旨在识别省略的代词及其指代对象,而非零共指消解是指对文本中所有指向同一实体的表达或提及进行聚类和归并。尽管基于深度神经网络的零代词解析和非零共指消解模型能够有效学习零代词和候选先行词的语义信息,但因高质量数据集和平行语料的稀缺,模型往往做出局部决策,忽视了当前决策对未来预测的影响。该文提出一种基于持续知识蒸馏的联合处理零代词消解和非零共指消解方法,将公开模型的知识传递至本地模型以提高解析效率,其核心思想是通过批次化的持续蒸馏,将公共模型的知识传递给本地模型,以提升零代词解析的效率。同时结合端到端神经网络模型捕获词元间的有用信息,并且保持词元的原始顺序不受干扰,使得模型能够有效利用零代词与提及间的互斥关系,从而在后续共指预测中做出更准确的判断。实验结果表明,该方法在多个指标上优于基线模型,具有良好的鲁棒性。

     

    Abstract: Zero-pronoun resolution identifies omitted pronouns and their referents, while non-zero coreference resolution clusters and merges expressions that refer to the same entity. This study proposes a continuous knowledge distillation method for jointly handling zero-pronoun resolution and non-zero coreference resolution. The method uses batch-based continuous distillation, where knowledge is passed from the public model to the local model. An end-to-end neural model captures token relationships while preserving the original sequence. This approach leverages the mutually exclusive relationship between zero pronouns and mentions, resulting in more accurate coreference predictions. Experimental results show that the proposed method outperforms baseline models in multiple metrics.

     

/

返回文章
返回