Informatics and Applications

2015, Volume 9, Issue 2, pp 93-110

ASSOCIATIVE PORTRAITS OF SUBJECT AREAS AS A TOOL FOR AUTOMATED CONSTRUCTION OF BIG DATA SYSTEMS FOR KNOWLEDGE EXTRACTION: THEORY, METHODS, VISUALIZATION, AND APPLICATION

  • I. V. Galina
  • E. B. Kozerenko
  • Yu. I. Morozova
  • N. V. Somin
  • M. M. Charnine

Abstract

The paper presents the technique of developing systems for extraction of knowledge which employs the approach of automated association portrait of a subject area (APSA) formation and building a semantic context space (SCS). The ideology of the APSA is based on the distributional hypothesis claiming that semantically equal (or related) lexemes have a similar context and, vice versa, in a similar context, the lexemes are semantically close. The model uses an extended hypothesis that consists in the investigation of similarities and differences in contexts not only of individual words, but of arbitrary multilexeme fragments of meaningful word-combinations. The examples of implemented projects for different subject domains are given.

[+] References (33)

[+] About this article