1. School of Management, Nanjing University of Posts & Telecommunications, Nanjing Jiangsu 210023, China; 2. College of Information and Technology, Nanjing Agricultural University, Nanjing Jiangsu 210095, China; 3. School of Information Management, Nanjing University, Nanjing Jiangsu 210093, China
As a new research field, the study of language network is developing rapidly. It has aroused great attention from researchers in different areas. Firstly, a briefly introduction is delivered to illustrate the characteristics of language network, statistical properties and the related network models. Secondly, based on the composition of language and the hot topic of network, language network is divided into phonetic network, co-occurrence network, syntactic dependency network, semantic and concept network. Besides, the main research content of language network are described in detail. Finally, the drawbacks and advantages of language network study are summarized.
HAN Pu,WANG Dongbo,LU Gaofei,SU Xinning.
Research and Progress in the Language Network. Journal of Chinese Information Processing. 2014, 28(1): 9-18
{{custom_sec.title}}
{{custom_sec.title}}
{{custom_sec.content}}
参考文献
[1] Watts D J, Strogatz S H. Collective dynamics of small-world networks[J].Nature,1998,393:440-442. [2] Barabasi A L, Albert R. Emergence of scaling in random networks[J]. Science, 1999, 286:509-512. [3] 汪小帆,李翔,陈关荣. 复杂网络理论及其应用[M].北京: 清华大学出版社,2006. [4] 陈关荣. 复杂网络及其新近研究进展简介[J].力学进展, 2008,38(06): 653-662. [5] Crystal D. The Cambridge Encyclopedia of Language[M].London: Cambridge University Press, Cambridge, UK, 1997. [6] George K. Zipf. Human Behaviour and the Principle of Least-Effort[M]. London: Addison-Wesley, Cambridge MA, 1949. [7] Jayaram B D, Vidya M N. Zipfs Law for Indian Languages[J]. Journal of Quantitative Linguistics, 2008, 15(04):293-317. [8] Tuzzi A, Popescu I-I, Altmann G. Zipfs Laws in Italian Texts[J]. Journal of Quantitative Linguistics, 2009, 16(04):354-367. [9] 游荣彦. Zipf 定律与汉字字频分布[J].中文信息学报, 2000, 14(03): 60-65. [10] Wang D, Li M, Di Z. True reason for Zipfs law in language[J].Physica A,2005,358(02):545-550. [11] Cancho R F I, Sole R V. The Small World of Human Language[C]//Proceedings of the Royal Society of London Series B-Biological Sciences, 2001, 268(1482): 2261-2265. [12] Dorogovtsev S N, Mendes J F F. Language as an evolving word web[C]//Proceedings of The Royal Society of London. Series B, Biological Sciences, 2001,268(1485):2603-2606. [13] 刘海涛. 语言网络:隐喻,还是利器?[J]. 浙江大学学报(人文社会科学版), 2011,41(02):170-180. [14] Freeman L C. A Set of Measures of Centrality Based on Betweenness[J].Sociometry,1979(40):35-41. [15] 陈芯莹,刘海涛. 汉语句法网络的中心节点研究[J].科学通报, 2011,56(10):735-740. [16] Medeiros Soares M, Corso G, Lucena L. The network of syllables in Portuguese[J]. Physica A,2005, 355(02): 678-684. [17] Peng G, Minett J W, Wang W S Y. The networks of syllables and characters in Chinese[J]. Journal of Quantitative Linguistics. 2008,15(03): 243-255. [18] Arbesman S, Strogatz S H, Vitevitch M S. The Structure of Phonological Networks Across Multiple Languages[J].International Journal of Bifurcation and Chaos,2010,20(03): 679-685. [19] Yu S, Liu H, Xu C. Statistical properties of Chinese phonemic networks[J]. Physica A,2011, 390(07): 1370-1380. [20] Choudhury M, Chatterjee D, Mukherjee A. Global topology of word co-occurrence networks: Beyond the two-regime power-law[C]//Association for Computational Linguistics, Beijing,2010,162-170. [21] 刘知远,孙茂松. 汉语词同现网络的小世界效应和无标度特性[J].中文信息学报,2007,21(06): 52-58. [22] Zhou S, Hu G, Zhang Z, et al. An empirical study of Chinese language networks[J]. Physica A, 2008, 387(12):3039-3047. [23] Liang W, Shi Y, Tse C K,et al. Comparison of co-occurrence networks of the Chinese and English languages[J]. Physica A, 2009, 388(23): 4901-4909. [24] Liang W, Tse C K, Huang Q, et.al. Study on the co-occurrence of character networks in Chinese essays from different periods[J]. Science in China Ser. F, 2011,accepted. [25] Sheng L, Li C. English and Chinese languages as weighted complex networks[J]. Physica A,2009, 388(12): 2561-2570. [26] Ke J, Yao Y. Analyzing language development from a network approach[J]. Journal of Quantitative Linguistics, 2008,15(01):70-99. [27] Cancho R F I, Solé R V, Khler R. Patterns in Syntactic Dependency Networks[J]. Physical Review E, 2004. 69(05): 051915. [28] Cancho R F I. The Euclidean distance between syntactically linked words[J], Physical Review E, 2004,70(05): 056135. [29] 刘知远,郑亚斌,孙茂松. 汉语依存句法网络的复杂网络性质[J].复杂系统与复杂性科学, 2008,5(2):37-45. [30] Liu H T. Dependency Distance as a Metric of Language Comprehension Difficulty[J]. Journal of Cognitive Science,2008, 9(02):159-191. [31] Liu H T. The complexity of Chinese syntactic dependency networks[J]. Physica A, 2008, 387(12): 3048-3058. [32] 刘海涛. 依依存语法的理论与实践[M]. 北京: 科学出版社, 2009. [33] 刘海涛. 语言复杂网络的聚类研究[J]. 科学通报, 2010, 55: 2667-2674. [34] Cancho R F I, Capocci A, Caldarelli G. Spectral methods cluster words of the same class in a syntactic dependency network[J]. International Journal of Bifurcation and Chaos, 2007, 17(07):2453-2463. [35] Cˇech R, MaAcˇutek J, Zˇabokrtsky Z. The role of syntax in complex networks: Local and global importance of verbs in a syntactic dependency network[J]. Physica A, 2011, 390(20):3614-3623. [36] Cˇech R, Maéutek J. Word form and lemma syntactic dependency networks in Czech: a comparative study[J].Glottometrics, 2009,19: 85-98. [37] Sigman M, Cecchi G A. Global organization of the Wordnet lexicon[C]//Proceedings of the National Academy of Sciences of the United States of America, 2002. 99(03): 1742-1747 [38] Motter A E, de Moura A P S, Lai Y C, et al.Topology of the conceptual network of language[J]. Physical Review E,2002, 65(06):065102. [39] Holanda A J, Pisa I T, Kinouchi O, et al. Thesaurus as a complex network[J]. Physica A , 2004,344(03-04):530-536. [40] Steyvers M, Tenenbaum J B. The large-scale structure of semantic networks:statistical analyses and a model of semantic growth[J].Cognitive Science,2005,29(01):41-78. [41] Tang L, Zhang Y G, Fu X. The Statistic Properties of Chinese Semantic Network in HowNet[C]//Proceedings of NLP-KE05, 2005,58-61. [42] Liu H T. Statistical properties of Chinese semantic networks[J]. Chinese Science Bulletin,2009,(16): 2781-2785. [43] Li J Y, Zhou J. Chinese character structure analysis based on complex networks[J]. Physica A,2007, 380(01):629-638. [44] Li Y, Wei L, Li Wei, et al. small-world patterns in Chinese phrase networks[J]. Chinese Science Bulletin, 2005, 50(3): 286-288. [45] 王建伟, 荣莉莉. 基于复杂网络理论的中文字字网络的实证研究[J]. 大连海事大学学报, 2008, 34(4): 15-18. [46] Veronis J. Hyperlex: lexical cartography for information retrieval[J]. Computer Speech & Language, 2004:18(03): 223-252. [47] Amancio D R, Antiqueira L, Pardo T A S, et al. Complex networks analysis of manual and machine translations[J].International Journal of Modern Physics C,2008, 19 (04):583-598. [48] Tsatsaronis G, Varlamis I, Nrvg K. An experimental study on unsupervised graph-based word sense disambiguation[C]//Proceedings of Computational Linguistics and Intelligent Text Processing, 11th International Conference, CICLing2010, Iasi, Romania, March 21-27, 2010: 184-198. [49] Antiqueira L, Oliveira Jr O N, Costa, et al. A complex network approach to text summarization[J]. Information Sciences,2009,79(05), 584-599. [50] 赵鹏,蔡庆生,王清毅,等.一种基于复杂网络特征的中文文档关键词抽取算法[J].模式识别与人工智能,2007, 20(06):827-831. [51] 余传明, 周丹. 情感词汇共现网络的复杂网络特性分析[J].情报学报,2010,29(05):906-914. [52] 江钟立, 林枫, 孟殿怀.复杂适应性系统理论在言语认知康复中的应用前景[J].中国康复医学杂志, 2006, 21(2):183-185.