Abstract:Aiming at the task of cross-domain few-shot relation classification, a Piecewise Attention Matching Network (PAMN) is proposed. To improve the sentence similarity algorithm for the task of relation extraction, two sentences are matched with their segmentations in PAMN, which can better estimate the similarity between relation classification instances. PAMN consists of encoding layer and sentence matching layer. At the encoding layer, PAMN uses the pre-trained model BERT to encode the sentence pair, divides the sentence into three segments according to the location of entity, and adapts the different domain through dynamic segmentation length. At the sentence matching layer, PAMN uses a text matching method based on the segmental attention mechanism to calculate the similarity between the query instance and each instance in the support set, and the average is taken as the similarity between the query instance and the support set. The experimental results show that PAMN has achieved the best results on the evaluation list in the field of FewRel 2.0 adaptation tasks.
[1] Snell J, Swersky K, Zemel R.Prototypical networks for few-shot learning[C]//Proceedings of Advances in Neural Information Processing Systems. Cambridge, MA, USA, MIT Press: Curran Associates, 2017: 4077-4087. [2] Garcia V, Estrach J B. Few-shot learning with graph neural networks[C]//Proceedings of the 6th International Conference on Learning Representations, 2018. [3] Ye Z X, Ling Z H. Multi-level matching and aggregation network for few-shot relation classification[C]//Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics. Florence, Italy: Association for Computational Linguistics, 2019: 2872-2881. [4] Han X, Zhu H, Yu P, et al. FewRel: A large-scale supervised few-shot relation classification dataset with state-of-the-art evaluation[C]//Proceedings of the Conference on Empirical Methods in Natural Language Processing. Brussels, Belgium: Association for Computational Linguistics, 2018: 4803-4809. [5] Gao T, Han X, Zhu H, et al. FewRel 2.0: Towards more challenging few-shot relation classification[C]//Proceedings of the Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing. Hong Kong, China: Association for Computational Linguistics, 2019: 6251-6256. [6] Devlin J, Chang M W, Lee K, et al. BERT: Pretraining of deep bidirectional transformers for language understanding[C]//Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. Minneapolis, USA, 2019: 4171-4186. [7] Koch G, Zemel R, Salakhutdinov R. Siamese neural networks for one-shot image recognition[C]//Proceedings of ICML Deep Learning Workshop, 2015. [8] Vinyals O, Blundell C, Lillicrap T, et al. Matching networks for one shot learning[C]//Proceedings of Advances in Neural Information Processing Systems. Cambridge, MA, MIT Press: Curran Associates, 2016: 3630-3638. [9] Geng R, Li B, Li Y, et al. Induction networks for few-shot text classification[C]//Proceedings of the Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing. Hong Kong, China: Association for Computational Linguistics, 2019: 3895-3904. [10] Soares L B, Fitz Gerald N, Ling J, et al. Matching the blanks: Distributional similarity for relation learning[C]//Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics. Florence, Italy, 2019: 2895-2905. [11] Zeng D, Liu K, Chen Y, et al. Distant supervision for relation extraction via piecewise convolutional neural networks[C]//Proceedings of the Conference on Empirical Methods in Natural Language Processing. Lisbon, Portugal: Association for Computational Linguistics, 2015: 1753-1762. [12] Glorot X, Bengio Y. Understanding the difficulty of training deep feedforward neural networks[C]//Proceedings of the 13th International Conference on Artificial Intelligence and Statistics. Sardinia, Italy, 2010: 249-256. [13] Loshchilov I, Hutter F. Fixing weight decay regularization in Adam[C]//Proceedings of the ICLR, Conference Blind Submission, Vancouver, BC, Canada, 2018: 30.