Information Retrieval and Question Answering
LIU Dexi, FU Qi, WEI Yaxiong, WAN Changxuan, LIU Xiping, ZHONG Minjuan, QIU Jiahong
2018, 32(3): 110-119.
Social short texts, coming from Twitter, Sina Microblog, etc., are limited in length but bear diversified topics, complex social relationships, as well as strong correlation with Web pages. Therefore, the traditional information retrieval methods are not suitable for the socialized short texts. The paper proposes a social short text retrieval method, SSTR, based on multiple-enhanced graph. The multiple-enhanced graph algorithm is based on Markov chain theory, where three types of relationships between short texts, authors, and tokens are considered. In SSTR, topic model based on LDA is employed when computing the similarity between short texts, which could overcome the disadvantages of TF-IDF feature. Experimental results show that, compared to cosine similarity based and LDA based re-ranking, SSTR produces better re-ranking result.