Knowledge Representation and Acquisition
LIU Xuli, JIN Jihao, RUAN Tong, GAO Daqi, YIN Yichao, GE Xiaoling
2020, 34(11): 37-48.
Clinical research based on observational data of electronic medical records has become a hot topic. In this paper, a new representation model of medical observation data based on RDF is proposed. The model can clearly represent multiple event types such as clinical examination, diagnosis, treatment as well as temporal relationships between events. Base on electronic medical records from hospitals, clinical event graphs are constructed by four steps: data preprocessing, RDF format conversion, time sequence construction and knowledge fusion. Specifically, using the electronic medical records of three first-class hospitals in Shanghai, we constructed a medical dataset including three specialties, 173 395 medical events, 501 335 temporal relationships of events, and linked with 5 313 concepts in the knowledge base. This paper further provides 40 sample queries for clinical retrospective research including etiology analysis and treatment analysis, with demonstration in contrast to the traditional database in terms of query formulation and retrieval process. The dataset follows the Open Link Standard and is published on OpenKG with online SPARQL site (https://peg.ecustnlplab.com/dataset.html).