语言网络研究进展

韩普,王东波,路高飞,苏新宁

PDF(1390 KB)
PDF(1390 KB)
中文信息学报 ›› 2014, Vol. 28 ›› Issue (1) : 9-18.
综述与前瞻

语言网络研究进展

  • 韩普1,王东波2,路高飞3,苏新宁3
作者信息 +

Research and Progress in the Language Network

  • HAN Pu1,WANG Dongbo2,LU Gaofei3,SU Xinning3
Author information +
History +

摘要

语言网络作为一个新的研究领域,其研究正在迅速崛起,目前已经吸引了不少领域的研究者们的关注。该文首先简要介绍了语言网络的特点、常用的统计特征以及相关的网络模型;其次,根据语言构成单位以及当前语言网络研究热点,将语言网络分为语音网络、共现网络、依存句法网络、概念语义网络,并详细介绍了各类语言网络研究的主要进展。最后总结了语言网络研究的现状并给出了展望。

Abstract

As a new research field, the study of language network is developing rapidly. It has aroused great attention from researchers in different areas. Firstly, a briefly introduction is delivered to illustrate the characteristics of language network, statistical properties and the related network models. Secondly, based on the composition of language and the hot topic of network, language network is divided into phonetic network, co-occurrence network, syntactic dependency network, semantic and concept network. Besides, the main research content of language network are described in detail. Finally, the drawbacks and advantages of language network study are summarized.

关键词

语言网络 / 小世界现象 / 无尺度分布

Key words

language network / small-world phenomenon / scale-free distribution

引用本文

导出引用
韩普,王东波,路高飞,苏新宁. 语言网络研究进展. 中文信息学报. 2014, 28(1): 9-18
HAN Pu,WANG Dongbo,LU Gaofei,SU Xinning. Research and Progress in the Language Network. Journal of Chinese Information Processing. 2014, 28(1): 9-18

参考文献

[1] Watts D J, Strogatz S H. Collective dynamics of small-world networks[J].Nature,1998,393:440-442.
[2] Barabasi A L, Albert R. Emergence of scaling in random networks[J]. Science, 1999, 286:509-512.
[3] 汪小帆,李翔,陈关荣. 复杂网络理论及其应用[M].北京: 清华大学出版社,2006.
[4] 陈关荣. 复杂网络及其新近研究进展简介[J].力学进展, 2008,38(06): 653-662.
[5] Crystal D. The Cambridge Encyclopedia of Language[M].London: Cambridge University Press, Cambridge, UK, 1997.
[6] George K. Zipf. Human Behaviour and the Principle of Least-Effort[M]. London: Addison-Wesley, Cambridge MA, 1949.
[7] Jayaram B D, Vidya M N. Zipfs Law for Indian Languages[J]. Journal of Quantitative Linguistics, 2008, 15(04):293-317.
[8] Tuzzi A, Popescu I-I, Altmann G. Zipfs Laws in Italian Texts[J]. Journal of Quantitative Linguistics, 2009, 16(04):354-367.
[9] 游荣彦. Zipf 定律与汉字字频分布[J].中文信息学报, 2000, 14(03): 60-65.
[10] Wang D, Li M, Di Z. True reason for Zipfs law in language[J].Physica A,2005,358(02):545-550.
[11] Cancho R F I, Sole R V. The Small World of Human Language[C]//Proceedings of the Royal Society of London Series B-Biological Sciences, 2001, 268(1482): 2261-2265.
[12] Dorogovtsev S N, Mendes J F F. Language as an evolving word web[C]//Proceedings of The Royal Society of London. Series B, Biological Sciences, 2001,268(1485):2603-2606.
[13] 刘海涛. 语言网络:隐喻,还是利器?[J]. 浙江大学学报(人文社会科学版), 2011,41(02):170-180.
[14] Freeman L C. A Set of Measures of Centrality Based on Betweenness[J].Sociometry,1979(40):35-41.
[15] 陈芯莹,刘海涛. 汉语句法网络的中心节点研究[J].科学通报, 2011,56(10):735-740.
[16] Medeiros Soares M, Corso G, Lucena L. The network of syllables in Portuguese[J]. Physica A,2005, 355(02): 678-684.
[17] Peng G, Minett J W, Wang W S Y. The networks of syllables and characters in Chinese[J]. Journal of Quantitative Linguistics. 2008,15(03): 243-255.
[18] Arbesman S, Strogatz S H, Vitevitch M S. The Structure of Phonological Networks Across Multiple Languages[J].International Journal of Bifurcation and Chaos,2010,20(03): 679-685.
[19] Yu S, Liu H, Xu C. Statistical properties of Chinese phonemic networks[J]. Physica A,2011, 390(07): 1370-1380.
[20] Choudhury M, Chatterjee D, Mukherjee A. Global topology of word co-occurrence networks: Beyond the two-regime power-law[C]//Association for Computational Linguistics, Beijing,2010,162-170.
[21] 刘知远,孙茂松. 汉语词同现网络的小世界效应和无标度特性[J].中文信息学报,2007,21(06): 52-58.
[22] Zhou S, Hu G, Zhang Z, et al. An empirical study of Chinese language networks[J]. Physica A, 2008, 387(12):3039-3047.
[23] Liang W, Shi Y, Tse C K,et al. Comparison of co-occurrence networks of the Chinese and English languages[J]. Physica A, 2009, 388(23): 4901-4909.
[24] Liang W, Tse C K, Huang Q, et.al. Study on the co-occurrence of character networks in Chinese essays from different periods[J]. Science in China Ser. F, 2011,accepted.
[25] Sheng L, Li C. English and Chinese languages as weighted complex networks[J]. Physica A,2009, 388(12): 2561-2570.
[26] Ke J, Yao Y. Analyzing language development from a network approach[J]. Journal of Quantitative Linguistics, 2008,15(01):70-99.
[27] Cancho R F I, Solé R V, Khler R. Patterns in Syntactic Dependency Networks[J]. Physical Review E, 2004. 69(05): 051915.
[28] Cancho R F I. The Euclidean distance between syntactically linked words[J], Physical Review E, 2004,70(05): 056135.
[29] 刘知远,郑亚斌,孙茂松. 汉语依存句法网络的复杂网络性质[J].复杂系统与复杂性科学, 2008,5(2):37-45.
[30] Liu H T. Dependency Distance as a Metric of Language Comprehension Difficulty[J]. Journal of Cognitive Science,2008, 9(02):159-191.
[31] Liu H T. The complexity of Chinese syntactic dependency networks[J]. Physica A, 2008, 387(12):
3048-3058.
[32] 刘海涛. 依依存语法的理论与实践[M]. 北京: 科学出版社, 2009.
[33] 刘海涛. 语言复杂网络的聚类研究[J]. 科学通报, 2010, 55: 2667-2674.
[34] Cancho R F I, Capocci A, Caldarelli G. Spectral methods cluster words of the same class in a syntactic dependency network[J]. International Journal of Bifurcation and Chaos, 2007, 17(07):2453-2463.
[35] Cˇech R, MaAcˇutek J, Zˇabokrtsky Z. The role of syntax in complex networks: Local and global importance of verbs in a syntactic dependency network[J]. Physica A, 2011, 390(20):3614-3623.
[36] Cˇech R, Maéutek J. Word form and lemma syntactic dependency networks in Czech: a comparative study[J].Glottometrics, 2009,19: 85-98.
[37] Sigman M, Cecchi G A. Global organization of the Wordnet lexicon[C]//Proceedings of the National Academy of Sciences of the United States of America, 2002. 99(03): 1742-1747
[38] Motter A E, de Moura A P S, Lai Y C, et al.Topology of the conceptual network of language[J]. Physical Review E,2002, 65(06):065102.
[39] Holanda A J, Pisa I T, Kinouchi O, et al. Thesaurus as a complex network[J]. Physica A , 2004,344(03-04):530-536.
[40] Steyvers M, Tenenbaum J B. The large-scale structure of semantic networks:statistical analyses and a model of semantic growth[J].Cognitive Science,2005,29(01):41-78.
[41] Tang L, Zhang Y G, Fu X. The Statistic Properties of Chinese Semantic Network in HowNet[C]//Proceedings of NLP-KE05, 2005,58-61.
[42] Liu H T. Statistical properties of Chinese semantic networks[J]. Chinese Science Bulletin,2009,(16): 2781-2785.
[43] Li J Y, Zhou J. Chinese character structure analysis based on complex networks[J]. Physica A,2007, 380(01):629-638.
[44] Li Y, Wei L, Li Wei, et al. small-world patterns in Chinese phrase networks[J]. Chinese Science Bulletin, 2005, 50(3): 286-288.
[45] 王建伟, 荣莉莉. 基于复杂网络理论的中文字字网络的实证研究[J]. 大连海事大学学报, 2008, 34(4): 15-18.
[46] Veronis J. Hyperlex: lexical cartography for information retrieval[J]. Computer Speech & Language, 2004:18(03): 223-252.
[47] Amancio D R, Antiqueira L, Pardo T A S, et al. Complex networks analysis of manual and machine translations[J].International Journal of Modern Physics C,2008, 19 (04):583-598.
[48] Tsatsaronis G, Varlamis I, Nrvg K. An experimental study on unsupervised graph-based word sense disambiguation[C]//Proceedings of Computational Linguistics and Intelligent Text Processing, 11th International Conference, CICLing2010, Iasi, Romania, March 21-27, 2010: 184-198.
[49] Antiqueira L, Oliveira Jr O N, Costa, et al. A complex network approach to text summarization[J]. Information Sciences,2009,79(05), 584-599.
[50] 赵鹏,蔡庆生,王清毅,等.一种基于复杂网络特征的中文文档关键词抽取算法[J].模式识别与人工智能,2007, 20(06):827-831.
[51] 余传明, 周丹. 情感词汇共现网络的复杂网络特性分析[J].情报学报,2010,29(05):906-914.
[52] 江钟立, 林枫, 孟殿怀.复杂适应性系统理论在言语认知康复中的应用前景[J].中国康复医学杂志, 2006, 21(2):183-185.

基金

863计划项目(2011AA01A206);国家自然科学基金(71273126)
PDF(1390 KB)

585

Accesses

0

Citation

Detail

段落导航
相关文章

/