语言同现网、句法网、语义网的构建与比较

赵怿怡,刘海涛

PDF(3875 KB)
PDF(3875 KB)
中文信息学报 ›› 2014, Vol. 28 ›› Issue (5) : 24-31.
语言分析与认知计算

语言同现网、句法网、语义网的构建与比较

  • 赵怿怡1,刘海涛2
作者信息 +

A Comparative Study on the Language Networks Based on Co-occurrence, Syntax, Semantics

  • ZHAO Yiyi1, LIU Haitao2
Author information +
History +

摘要

网络方法应用于语言研究是语言研究大数据时代的新趋势。语言是一个多层级的符号系统,选择哪种语言单位作为网络节点,选择哪种语言单位间的关系作为网络联结,影响到语言网络的结构和功能。该文梳理了以汉语词为单位,以同现、句法、语义关系为联结依据的几类网络构造方法,并针对同一文本构造三类网络发现: 句法网络的网络直径、平均路径长度远小于同现网络,实词在语义网络中占据中心节点位置。这提示我们网络分析方法的应用仍要以可靠的语言学理论为指导,从语言学内部出发才能更好解释各类语言网络的差异。

Abstract

Network structure has been wildely applied in language studies with the coming of the big data era. Since language is a multi-level system of symbols, different language units will exhibit networks of different structure and function. This paper surveys the construction methods for the word co-occurrence network (on the basis of the adjacency of words), the syntactic network (on the basis of syntactic theory-dependency grammar) and the semantic network (on the basis of conceptual relation) for the same text. It is revealed that the syntactic network's diameter and average path length are much smaller than those of the co-occurrence network, and the content words in the semantic network occupy central node locations. This suggests that the linguistic theory is to be applied in the network analysis, and will contribute to better explain the differences of various language networks.

关键词

同现网 / 句法网 / 语义网

Key words

co-occurrence network / syntactic network / semantic network

引用本文

导出引用
赵怿怡,刘海涛. 语言同现网、句法网、语义网的构建与比较. 中文信息学报. 2014, 28(5): 24-31
ZHAO Yiyi, LIU Haitao. A Comparative Study on the Language Networks Based on Co-occurrence, Syntax, Semantics. Journal of Chinese Information Processing. 2014, 28(5): 24-31

参考文献

[1] Briscoe E J. Language as a Complex Adaptive System: Coevolution of Language and of the Language Acquisition Device[C]//Proceedings of Eighth Computational Linguistics in the Netherlands Conference, 1998.
[2] Steels L. Language as a Complex Adaptive System[C]//Proceedings of PPSN VI, Lecture Notes in Computer Science. Berlin:. Springer-Verlag, 2000: 17-26.
[3] Liu H. The complexity of Chinese dependency syntactic networks[J]. Physica A., 2008a, 387: 3048-3058.
[4] Liu H. Statistical Properties of Chinese Semantic Networks[J]. Chinese Science Bulletin. 2009, 54(16): 2781-2785.
[5] Liu H. Linguistic Complex Networks: A new approach to language exploration[J]. Die Grundlagenstudien aus Kybernetik und Geisteswissenschaft (grkg/Humankybernetik) 2011; 52(4): 151-170.
[6] Cong J, Liu H. Approaching human language with complex networks[C]//Proceedings of the Physics of Life Reviews 2014.
[7] Liu H, Cong J. Empirical characterization of modern Chinese as a multi-level system from the complex network approach[J]. J Chin Linguist 2014;42:1 38.
[8] Pickering M J, Garrod S. Toward a mechanistic psychology of dialogue[J]. Behav. Brain Sci., 2004, 27: 169-226.
[9] Eguiluz V, Cecchi G, Chialvo D R, et al. Scale-free brain functional networks[J]. Phys. Rev. Lett. 2005, 92: 018102.
[10] Hudson R. Language Networks: The New Word Grammar[M]. Oxford: Oxford University Press, 2007.
[11] Ferrer i Cancho R. and Sol R V. The Small-World of Human Language[J]. Proc. R. Soc. Lond. Series B, 2001, 268: 2261-2266.
[12] 刘知远, 孙茂松. 汉语词同现网络的小世界效应和无标度特性[J]. 中文信息学报, 2007, 21 (6): 52-58.
[13] Ferrer i Cancho R, Solé R V, Khler R. Patterns in syntactic dependency networks[J]. Physical Review E, 2004, 69: 051915.
[14] Sigman M, Cecchi G A. Global organization of the Wordnet lexicon[M]. Procs. Natl. Acad. Sci. USA, 2002, 99(3): 1742-1747.
[15] Steyvers M, Tenenbaum J B. The large-scale structure of semantic networks: statistical analyses and a model of semantic growth[J]. Cognitive Science, 2005, 29(1): 41-78.
[16] Holanda A J, Torres Pisa I, Kinouchi O, et al. Thesaurus as a complex network[J]. Physica A, 2004, 344: 530-536.
[17] Grnerup O, Karlgren J. Cross-lingual comparison between distributionally determined word similarity networks[C]//Proceedings of the 2010 Workshop on Graph-based Methods for Natural Language Processing, ACL 2010. Uppsala, Sweden, 2010: 48-54.
[18] Bickerton D (EDT), Szathmary E (EDT). Biological Foundations and Origin of Syntax (Str ngmann Forum Reports)[M]. The MIT Press, 2009.[19] 刘海涛. 汉语句法网络的复杂性研究[J]. 复杂系统与复杂性科学, 2007b, 4(4): 38-44.
[20] Cˇech R, MaAcˇutek J. Word form and lemma syntactic dependency networks in Czech: a comparative study[J]. Glottometrics, 2009, 19: 85-98.
[21] Ferrer i Cancho, R. The structure of syntactic dependency networks: insights from recent advances in network theory[C]//Proceedings of Altmann, G.., Levickij, V., Perebyinis, V. (eds.). The problems of quantitative linguistics, Chernivtsi: Ruta, 2005: 60-75.
[22] Tesni re, L. El ments de la syntaxe structurale[M]. Paris: Klincksieck, 1959.
[23] 刘海涛. 泰尼埃的结构句法理论[J]. 北华大学学报(社会科学版), 2007a, 8(5): 68-77.
[24] 刘海涛. 语言网络: 隐喻,还是利器? [J]. 浙江大学学报(人文社会科学版), 2011, 41(2): 160-179.
[25] 陈芯莹, 刘海涛. 汉语句法网络的中心节点研究[J]. 科学通报,2011, 56(10): 735-740.
[26] Solé R, Corominas-Murtra B, Valverde S, et al. Language Networks: Their Structure, Function and Evolution[R]. Santa Fe Institute Working Paper, 2005.
[27] 陆俭明. 现代汉语语法研究教程[M]. 北京: 北京大学出版社,2004.
[28] Liu H, Huang W. A Chinese Dependency Syntax for Treebanking[C]//Proceedings of the 20th Pacific Asia Conference on Language, Information and Computation: 126-133. Beijing: Tsinghua University Press, 2006.
[29] 黄伯荣, 廖序东. 现代汉语[M]. 北京: 高等教育出版社,1991.
[30] 邵敬敏. 汉语语法专题研究[M]. 北京: 北京大学出版社,2009.
[31] 胡裕树. 现代汉语(重订版)[M]. 上海: 上海教育出版社,1995
[32] 姜汇川. 现代汉语副词分类实用词典[M]. 北京: 对外贸易教育出版社. 1989.

基金

国家社会科学基金(11&ZD188,14CYY046)
PDF(3875 KB)

740

Accesses

0

Citation

Detail

段落导航
相关文章

/