第七届全国机器翻译研讨会机器翻译评测总结

赵红梅,吕雅娟,贲国生,黄 云,刘 群

PDF(4260 KB)
PDF(4260 KB)
中文信息学报 ›› 2012, Vol. 26 ›› Issue (1) : 22-31.
综述

第七届全国机器翻译研讨会机器翻译评测总结

  • 赵红梅,吕雅娟,贲国生,黄 云,刘 群
作者信息 +

Summary on CWMT2011 MT Translation Evaluation

  • ZHAO Hongmei, LV Yajuan, BEN Guosheng, HUANG Yun, LIU Qun
Author information +
History +

摘要

该文介绍了第七届全国机器翻译研讨会(CWMT2011)机器翻译评测的具体情况。本次评测重点关注各种语言到汉语的翻译,除了汉英、英汉、日汉三个语言对以外,评测还新增了五种民族语言(藏语、蒙古语、维吾尔语、哈萨克语、柯尔克孜语)到汉语的翻译评测。共有19家国内外单位的165个系统参加此次评测。除了介绍评测项目的设置、评测数据的准备、评测流程、参评单位等,本文还重点介绍了CWMT2011的评测结果,并对评测结果进行了分析,用实例说明了与评测结果相关的几个因素 源语言与目标语言是否相似、评测领域是否集中、测试集与训练及开发集语料是否相似、训练语料的规模、参评系统的技术和成熟度等。

Abstract

The 7th China Workshop on Machine Translation(CWMT2011)Evaluation continues the ongoing series of evaluation of machine translation technology in China. This paper presents an overall introduction to CWMT2011 evaluation. This evaluation focuses on the evaluation of MT translation from other languages to Chinese, especially, from ethnic languages (including Mongolian, Tibetan, Uyghur, Kazakh and Kirghiz). 165 systems of 19 participants from home and aboard have taken part in the evaluation. The paper introduces the evaluation tasks, the evaluation data, the evaluation procedure and the participants. We also discuss the evaluation results in details. The examples from this evaluation show that the evaluation result depends on the following factorsthe similarity between the source language and the target language, the range of the field which the evaluation task involves, the similarity between the test data and the training/development data, the size of the training data, the technology and the maturity of the participating system, and etc.
Key wordsmachine translation; machine translation evaluation; BLEU-SBP; WoodPecker evaluation

关键词

机器翻译 / 机器翻译评测 / BLEU-SBP / WoodPecker评测

Key words

machine translation / machine translation evaluation / BLEU-SBP / WoodPecker evaluation

引用本文

导出引用
赵红梅,吕雅娟,贲国生,黄 云,刘 群. 第七届全国机器翻译研讨会机器翻译评测总结. 中文信息学报. 2012, 26(1): 22-31
ZHAO Hongmei, LV Yajuan, BEN Guosheng, HUANG Yun, LIU Qun. Summary on CWMT2011 MT Translation Evaluation. Journal of Chinese Information Processing. 2012, 26(1): 22-31

参考文献

[1] 刘群,赵红梅.第五届全国机器翻译研讨会(CWMT2009)评测报告[R].第五届全国机器翻译研讨会(CWMT2009),2009年10月16~17日,南京.
[2] 赵红梅,吕雅娟,贲国生,等.第七届全国机器翻译研讨会(CWMT2011)评测报告[R].第七届全国机器翻译研讨会(CWMT2011),2011年9月23~24日,厦门.
[3] David Chiang, Steve DeNeefe, Yee Seng Chan, et al. 2008. Decomposability of translation metrics for improved evaluation and efficient algorithms[C]//Proc. EMNLP 2008, pages 610-619.
[4] Michael Collins, Philipp Koehn, Ivona KuDcˇerová. 2005. Clause restructuring for statistical machine translation[C]//Proc. ACL 2005, pages 531-540.
[5] Ming Zhou, Bo Wang, Shujie Liu, et al. 2008. Diagnostic Evaluation of Machine Translation Systems Using Automatically Constructed Linguistic Check-Points[C]//Proc. Coling 2008, pages 1121-1128.

基金

国家自然科学基金项目(60873167);国家青年科学基金项目(61100082)
PDF(4260 KB)

646

Accesses

0

Citation

Detail

段落导航
相关文章

/