Review
ZHANG Gui-ping, YAO Tian-shun, YIN Bao-sheng, CAI Dong-feng, SONG Yan
2007, 21(3): 34-39.
Bilingual corpus is one of the most important parts in translation memory system. To extract more association examples which meet the present needs of users from limited scale of bilingual corpus is the main content of the research of translation memory technology. First of all, this paper analyzes the limits of the current example search method. Based on the knowledge representation of the bilingual corpus, this paper proposes multi-strategy based association example extraction mechanism, that is, to extract association example by using comprehensively the methods of tree matching, sentence edit-distance calculating, phrase chunk matching, lexicon semantic generalization, extended information based optimization (for instance, the information on sentence source, major belonged to, application frequency, etc.). Experimental results indicate that the method effectively improved the recall quantity and quality of association example and the assistant effect to users.