LIU Shao-hui,DONG Ming-kai,ZHANG Hai-jun,LI Rong,SHI Zhong-zhi
2002, 16(3): 9-15,27.
This paper does research and improves on the classical approach of calculating the term weight in Vector Space Model. Furthermore ,an approach of multi-hierarchy text classification based on Vector Space Model is proposed. In this approach ,all classes are organized as a tree according to some given hierarchical relations ,and all the training documents in a class are combined into a class-document . In order to construct the class models ,it is just only to compare among the class-documents attached to the same node of the same layer. When it is going to classify the documents ,one matching process is hierarchically performed from the root node to the leaf nodes until a corresponding subclass is found. The experiment and real systems indicate that the approach is of high classification Precision and Recall.