Review
WANG Shan; LIU Rui
2016, 30(6): 140-146.
The construction of a speech corpus is the foundation of research on oral languages. In this paper, a small-scale corpus is constructed based on the representative talk shows, QiangqiangSanrenxing and LuYuYouyue. An annotation system constituted by 5 primary categories and 16 subtypes is developed to annotate the conversational structures. According to the statistics of conversational structures, there are 309 interrupted structures, 141 inserted structures, 111 repetitive structures, 653/589 question and answer structures, 51/21 obstruction-correction structures, which reflect the unbalanced distribution of the number of conversational structures. The form, nature and communicative tasks of the talk shows are the main influencing factors of the distribution of the conversational structure. In addition, conversational structures show certain patterns, and therefore trigram analysis is carried out to explore the combinations. It is found that the highest frequency combination in the corpus is the question-answer adjacency pair, in addition to a large number of contingency combinations.The combination patterns of conversation structures not only reflect the style of the talk shows, but also help to analyze the functional modules in the conversation, the formation of conversation strategies, and thus help us more deeply understand the operational mechanisms of the conversation.