本文用误差估计方法,在给定误差限和置信概率的条件下,解出了汉字字频统计的抽样规模,解出了一种汉字字频统计的抽样规模,提出了一种汉字字频统计的新方法,该方法中所定义的汉字的统计频率具有统计学上的无偏性且较之以前方法具有更小的方差,因而是汉字的使用频率的一种更为精确的估计。
Abstract
In this paper , the lower Limit of the sample size in f requency statistics of chinese characters is given by using the method named error estimate under the conditions of giving the error bounds and the confident probability , a new statistical method of frequencies of chinese characters is presented in which the defined statisfical f requency of a chinese character is unbiased and has a smaller variance than before and therefore is a preciser estimate of utility frequency of the chinese character.
关键词
使用频率 /
统计频率 /
抽样规模 /
置信概率 /
无偏性 /
有效性
{{custom_keyword}} /
Key words
utility f requency /
statistical frequency /
sample sige /
confidence probability /
unbiasedness /
efficiency
{{custom_keyword}} /
{{custom_sec.title}}
{{custom_sec.title}}
{{custom_sec.content}}
参考文献
[1 ]《语言和计算机》编辑组,语言和计算机(3) ,中国社会科学出版社,北京,1986 ,132 - 137
[2 ]梁之舜等编,概率论及数理统计,高等教育出版社,1988 ,70 - 90
[3 ]陈一凡,胡宣华,汉字键盘输入技术与理论基础,清华大学出版社,广西科学技术出版社,1994 ,1 - 4
[4 ]同[1 ] ,132 - 137
[5 ]RAO ,C1R1 ,Linear Statistical Inference and Its Applications ,John Wiley & Sons ,1973 ,334 - 351
{{custom_fnGroup.title_cn}}
脚注
{{custom_fn.content}}