An information retrieval approach for text mining of medical records based on graph descriptor

Alexander Dudko, Tatiana Endrjukaite, Yasushi Kiyoki

Research output: Chapter in Book/Report/Conference proceedingConference contribution

1 Citation (Scopus)

Abstract

This paper describes a new method of data retrieval from free text documents in medical domain. Proposed approach creates the document summary and highlights most important keywords in the text. To achieve this result we process the document natural language text and build a descriptor as an internal representation of the document. This descriptor is a graph with concepts, relations between them, and concept points as a metric of relevance. By means of points in the descriptor the approach performs ambiguity resolution, selects most relevant concepts to display in the summary, and votes for keywords highlighting in the text. Besides the direct representation of identified information in the summary, this work proposes a way to provide extended summary by using additional knowledge about relations between medications, procedures, diseases and anatomy. The described approach helps to speed up analysis and decision making processes by means of providing aggregated summary for a document and highlighting most meaningful parts of the document's text. Experiment results demonstrate that automatic summary generation and keywords highlighting can be successfully performed by the proposed approach to achieve meaningful and highly relevant results.

Original languageEnglish
Title of host publicationInformation Modelling and Knowledge Bases XXX
EditorsTatiana Endrjukaite, Hannu Jaakkola, Alexander Dudko, Yasushi Kiyoki, Bernhard Thalheim, Naofumi Yoshida
PublisherIOS Press
Pages334-352
Number of pages19
ISBN (Electronic)9781614999324
DOIs
Publication statusPublished - 2019 Jan 1

Publication series

NameFrontiers in Artificial Intelligence and Applications
Volume312
ISSN (Print)0922-6389

    Fingerprint

Keywords

  • ambiguity resolution
  • graph descriptor
  • information retrieval
  • summary generation
  • text mining

ASJC Scopus subject areas

  • Artificial Intelligence

Cite this

Dudko, A., Endrjukaite, T., & Kiyoki, Y. (2019). An information retrieval approach for text mining of medical records based on graph descriptor. In T. Endrjukaite, H. Jaakkola, A. Dudko, Y. Kiyoki, B. Thalheim, & N. Yoshida (Eds.), Information Modelling and Knowledge Bases XXX (pp. 334-352). (Frontiers in Artificial Intelligence and Applications; Vol. 312). IOS Press. https://doi.org/10.3233/978-1-61499-933-1-334